Extract Text From a Document Inside a Container

This REST API allows extracting text from a document placed in a container like ZIP archive, emails, PDF portfolios, etc. by specifying ContainerItemInfo parameter

Resource

The following GroupDocs.Parser Cloud REST API resource has been used in the Extract Text From a Document Inside a Container example.

cURL example

The following example demonstrates how to extract text from a container item.

# First get JSON Web Token
# Please get your Client Id and Client Secret from https://dashboard.groupdocs.cloud/applications.
# Kindly place Client Id in "client_id" and Client Secret in "client_secret" argument.
curl -v "https://api.groupdocs.cloud/connect/token" \
-X POST \
-d "grant_type#client_credentials&client_id#xxxx&client_secret#xxxx" \
-H "Content-Type: application/x-www-form-urlencoded" \
-H "Accept: application/json"
   
# cURL example to join several documents into one
curl -v "https://api.groupdocs.cloud/v1.0/parser/text" \
-X POST \
-H "Content-Type: application/json" \
-H "Accept: application/json" \
-H "Authorization: Bearer 
<jwt token>" \
-d "{
    "FileInfo": {
        "FilePath": "pdf\PDF with attachements.pdf",
        "StorageName": "",
        "Password": "password"
    },
    "ContainerItemInfo": {
        "RelativePath": "document.pdf",
        "Password": "password"
    }
}"
{
    "text": "Text inside bookmark 0\r\n\r\n           Page 0 heading\r\nP a g e  T e x t -  P a g e  0\r\nText inside bookmark 1\r\n\r\n           Page 1 heading\r\nP a g e  T e x t -  P a g e  1\r\nText inside bookmark 2\r\n\r\n           Page 2 heading\r\nP a g e  T e x t -  P a g e  2\r\nText inside bookmark 3\r\n\r\n           Page 3 heading\r\nP a g e  T e x t -  P a g e  3\r\nField\r\nRelatedField2 RelatedText2\r\n REGEX TEXT 123\r\nTABLE\r\n    Cell\r\n                         Cell 12\r\n\r\n\r\n\r\n"
}

SDK examples

Using an SDK (API client) is the quickest way for a developer to speed up the development. An SDK takes care of a lot of low-level details of making requests and handling responses and lets you focus on writing code specific to your particular project. Check out our GitHub repository for a complete list of GroupDocs.Parser Cloud SDKs along with working examples, to get you started in no time. Please check Available SDKs article to learn how to add an SDK to your project.