Skip to main content
Participant
August 16, 2022
Question

Processing larger documents on extract API?

  • August 16, 2022
  • 1 reply
  • 965 views

The documentation for the Extract API states these limitations for the API: 

 

Document file size100MB
Output images per Document Transaction (export)50
Page limit (OCR)*100
Page limit(Non-Scanned - Extract API)*200
JSON file size (Document Generation and HTML to PDF)10MB

 

The nature of the project that I'm working on with the API means I'll be dealing with much larger documents (the largest I've seen so far is 1200 pages). 

The same documentation page also states:

- Paid credentials enable a greater processing quota and require entering into a separate written agreement. 

 

Does this mean that if my organization were to subscribe to the API service (via AWS), the 200-page limitation is removed? If not, what other options do I have for processing larger documents?

 

Thanks, 

Shree

This topic has been closed for replies.

1 reply

Raymond Camden
Community Manager
Community Manager
August 16, 2022

In this case, the quota refers to moving from a free trial quota, it does not change the limits on the API. You can consider using the Split operation to get your PDF into smaller chunks.

Participant
August 18, 2022

Using the Split might mean losing context that, say, the "Path" field provides in the Extract API, especially since Split just works on a per-page basis. Is there absolutely no way to extend the API limit?