Parser Configuration

Content Parser Settings

With this settings you can activate or deactivate parsing of additional content-types based on their MIME-types.
For a detailed description of the various MIME-types take a look at http://www.iana.org/assignments/media-types/.
If you want to test a specific parser you can do so using the File Viewer.

Extension Mime-Type
Microsoft Powerpoint Parser
pps
ppt
application/mspowerpoint
application/powerpoint
application/vnd.ms-powerpoint
application/ms-powerpoint
application/mspowerpnt
application/vnd-mspowerpoint
application/x-powerpoint
application/x-m
GNU Zip Compressed Archive Parser
gz
tgz
application/x-gzip
application/gzip
application/x-gunzip
application/gzipped
application/gzip-compressed
gzip/document
Adobe Flash Parser
swf
application/x-shockwave-flash
application/x-shockwave-flash2-preview
application/futuresplash
image/vnd.rn-realflash
vCard Parser
vcf
text/x-vcard
application/vcard
application/x-versit
text/x-versit
text/x-vcalendar
Audio File Meta-Tag Parser
m4p
m4a
oga
flac
ogg
mp3
wma
audio/mpeg
audio/MPA
audio/mpa-robust
audio/mp4
audio/flac
audio/x-flac
audio/x-ms-wma
audio/x-ms-asf
Comma Separated Value Parser
csv
Microsoft Visio Parser
vdx
vtx
vss
vsd
vst
application/visio
application/x-visio
application/vnd.visio
application/visio.drawing
application/vsd
application/x-vsd
image/x-vsd
zz-application/zz-winassoc-vsd
Generic Image Parser
jpg
bmp
jpeg
wbmp
tif
rle
png
cur
ico
jpe
gif
tiff
image/jpg
image/vnd.microsoft.icon
image/x-png
image/tiff
image/vnd.wap.wbmp
image/x-icon
image/ico
image/cursor
image/x-tiff
image/x-bmp
image/png
image/jpeg
image/vnd.microsoft.cursor
image/x-cursor
image/gif
image/bmp
FreeMind Parser
mm
application/freemind
application/x-freemind
PostScript Document Parser
ps
application/postscript
application/ps
application/x-postscript
application/x-ps
application/x-postscript-not-eps
Commodore 64 SID Audio File Parser
sid
audio/prs.sid
audio/psid
audio/x-psid
audio/sidtune
audio/x-sidtune
Open Office XML Document Parser
xltx
xlsx
ppsx
potx
dotx
docx
pptx
application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/vnd.openxmlformats-officedocument.wordprocessingml.template
application/vnd.openxmlformats-officedocument.presentationml.template
application/vnd.openxmlformats-officedocument.presentationml.slideshow
application/vnd.openxmlformats-officedocument.presentationml.presentation
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
application/vnd.openxmlformats-officedocument.spreadsheetml.template
Metadata Image Parser
psd
tif
image/tiff
image/vnd.adobe.photoshop
image/x-photoshop
Link Scraper Parser
cpp
jsp
c
py
json
jsonp
js
pl
mf
h
application/json
application/x-javascript
text/javascript
text/x-javascript
text/x-json
text/sgml
Torrent Metadata Parser
torrent
application/x-bittorrent
SVG Image Parser
svg
image/svg+xml
Word Document Parser
doc
application/msword
application/doc
appl/text
application/vnd.msword
application/vnd.ms-word
application/winword
application/word
application/x-msw6
application/x-msword
OASIS OpenDocument V2 Text Document Parser
odg
odf
odi
odp
odm
ods
odt
sxc
otg
ott
ots
otp
odc
sxw
odb
application/vnd.oasis.opendocument.text
application/vnd.oasis.opendocument.spreadsheet
application/vnd.oasis.opendocument.presentation
application/vnd.oasis.opendocument.graphics
application/vnd.oasis.opendocument.chart
application/vnd.oasis.opendocument.formula
application/vnd.oasis.opendocument.database
application/vnd.oasis.opendocument.image
application/vnd.oasis.opendocument.text-master
application/vnd.oasis.opendocument.text-template
application/vnd.oasis.opendocument.spreadsheet-template
application/vnd.oasis.opendocument.presentation-template
application/vnd.oasis.opendocument.graphics-template
application/x-vnd.oasis.opendocument.text
application/OOo-calc
application/OOo-writer
Bzip 2 UNIX Compressed File Parser
tbz
tbz2
bz2
application/x-bzip2
application/bzip2
application/x-bz2
application/x-bzip
application/x-stuffit
Streaming HTML Parser
xhtml
stm
php4
php5
php2
shtm
msg
php3
txt
cfm
php
shtml
tpl
tex
html
asp
phtml
phtm
aspx
htm
text/html
text/xhtml+xml
application/xhtml+xml
application/x-httpd-php
application/x-tex
application/vnd.ms-outlook
text/plain
text/csv
Microsoft Excel Parser
xla
xls
application/msexcel
application/excel
application/vnd.ms-excel
application/x-excel
application/x-msexcel
application/x-ms-excel
application/x-dos_ms_excel
application/xls
ZIP File Parser
zip
jar
apk
application/zip
application/x-zip
application/x-zip-compressed
application/x-compress
application/x-compressed
multipart/x-zip
application/java-archive
application/vnd.android.package-archive
Tape Archive File Parser
tar
application/x-tar
application/tar
applicaton/x-gtar
multipart/x-tar
RSS Parser
rss
xml
XML
text/rss
application/rss+xml
application/atom+xml
Acrobat Portable Document Parser
pdf
application/pdf
application/x-pdf
application/acrobat
applications/vnd.pdf
text/pdf
text/x-pdf
7zip Archive Parser
7z
application/x-7z-compressed
Rich Text Format Parser
rtf
text/rtf
text/richtext
application/rtf
application/x-rtf
application/x-soffice
Android Application Parser
apk
application/vnd.android.package-archive
PDF Parser Attributes

This is an experimental setting which makes it possible to split PDF documents into individual index entries. Every page will become a single index hit and the url is artifically extended with a post/get attribute value containing the page number as value. When such an url is displayed within a search result, then the post/get attribute is transformed into an anchor hash link. This makes it possible to view the individual page directly in the pdf.js viewer built-in into firefox, for reference see https://github.com/mozilla/pdf.js/wiki/Viewer-options

Split PDF
Property Name