Supported Document Formats

The following table indicates the file formats from which GroupDocs.Parser Cloud can extract data.

Document TypeParse Document by TemplateExtract TextExtract Document InfoExtract ImagesExtract Container Items Info
Word Processing
DOCMicrosoft Word Document
DOTMicrosoft Word Document Template
DOCXOffice Open XML Document
DOCMOffice Open XML Macro-Enabled Document
DOTXOffice Open XML Document Template
DOTMOffice Open XML Document Macro-Enabled Template
TXTPlain text
ODTOpen Document Text
OTTOpen Document Text Template
RTFRich Text Format
PDF
PDFPortable Document Format File
Markup
HTMLHypertext Markup Language File
XHTMLExtensible Hypertext Markup Language File
MHTMLMIME HTML File
MDMarkdown
XMLXML File
Ebooks
CHMCompiled HTML Help File
EPUBDigital E-Book File Format
FB2FictionBook 2.0 File
Speadsheet
XLSMicrosoft Excel Spreadsheet
XLTMicrosoft Excel Template
XLSXOffice Open XML Spreadsheet
XLSMOffice Open XML Macro-Enabled Spreadsheet
XLSBOffice Open XML Binary Spreadsheet
XLTXOffice Open XML Spreadsheet Template
XLTMOffice Open XML Macro-Enabled Spreadsheet Template
ODSOpen Document Spreadsheet
OTSOpen Document Spreadsheet Template
CSVComma Separated Values
XLAExcel Add-In File
XLAMExcel Open XML Macro-Enabled Add-In
NUMBERSApple iWork Numbers
Presentations
PPTPowerPoint Presentation
PPSPowerPoint Slideshow
POTPowerPoint Template
PPTXOffice Open XML Presentation
PPTMOffice Open XML Macro-Enabled Presentation
POTXOffice Open XML Presentation Template
POTMOffice Open XML Macro-Enabled Presentation Template
PPSXOffice Open XML Presentation Slideshow
PPSMOffice Open XML Macro-Enabled Presentation Slideshow
ODPOpen Document Presentation
OTPOpen Document Presentation Template
Emails
PSTOutlook Personal Information Store File
OSTOutlook Offline Data File
EMLE-Mail Message
EMLXApple Mail Message
MSGOutlook Mail Message
Notes
ONEOneNote Document
Archives
ZIPZipped File