DIS APIs

This page contains the API endpoints for all document intelligence services.

Extract text from a document

post

Upload a document (digital or scanned) in Base64 format and extract its text. Returns a file_id to track processing status.

Body
file_contentstringRequired

Base64 encoded input document

Example: JVBERi0xLjUKJ...
languagestring · enumRequired

Language of the input document

Example: englishPossible values:
file_typestring · enumRequired

Type of input document

Example: DigitalPossible values:
Responses
200

Text extraction initiated successfully

application/json
post
/v1/document/extract_text

Extract structured information from a document

post

Upload a document (digital or scanned) in Base64 format and extract entities such as PII, NER, or key-value pairs. Optionally specify custom keys to extract.

Body
file_contentstringRequired

Base64 encoded input document

Example: JVBERi0xLjUKJ...
languagestring · enumRequired

Language of the input document

Example: englishPossible values:
file_typestring · enumRequired

Type of input document

Example: DigitalPossible values:
keys_to_extractstring[]Optional

Optional list of specific keys to extract

Example: ["Name","Position"]
Responses
200

Information extraction initiated successfully

application/json
post
/v1/document/extract_information

Summarize a document

post

Upload a document (digital or scanned) in Base64 format and generate a summarized version according to the specified word count.

Body
file_contentstringRequired

Base64 encoded input document

Example: JVBERi0xLjUKJ...
languagestring · enumRequired

Language of the input document

Example: englishPossible values:
file_typestring · enumRequired

Type of input document

Example: DigitalPossible values:
word_count_for_summarizationintegerRequired

Target word count for summarization

Example: 400
Responses
200

Document summarization initiated successfully

application/json
post
/v1/document/doc_summarization

Mask Personally Identifiable Information in a document

post

Upload a document (digital or scanned) in Base64 format and mask sensitive fields like names, positions, and other specified keys.

Body
file_contentstringRequired

Base64 encoded input document

Example: JVBERi0xLjUKJ...
languagestring · enumRequired

Language of the input document

Example: englishPossible values:
file_typestring · enumRequired

Type of input document

Example: DigitalPossible values:
keys_to_maskstring[]Required

List of keys/fields to be masked in the document

Example: ["Name","Position"]
Responses
200

PII masking initiated successfully

application/json
post
/v1/document/PII_masking

Get document status

get

Retrieve the status of a document processing request using its file_id.

Query parameters
file_idstringRequired

The unique ID of the uploaded document.

Responses
200

Status retrieved successfully

application/json
get
/v1/document/status

Last updated

Was this helpful?