The documentation for the Extract API states these limitations for the API:
|Document file size||100MB|
|Output images per Document Transaction (export)||50|
|Page limit (OCR)*||100|
|Page limit(Non-Scanned - Extract API)*||200|
|JSON file size (Document Generation and HTML to PDF)||10MB|
The nature of the project that I'm working on with the API means I'll be dealing with much larger documents (the largest I've seen so far is 1200 pages).
The same documentation page also states:
- Paid credentials enable a greater processing quota and require entering into a separate written agreement.
Does this mean that if my organization were to subscribe to the API service (via AWS), the 200-page limitation is removed? If not, what other options do I have for processing larger documents?
In this case, the quota refers to moving from a free trial quota, it does not change the limits on the API. You can consider using the Split operation to get your PDF into smaller chunks.
Using the Split might mean losing context that, say, the "Path" field provides in the Extract API, especially since Split just works on a per-page basis. Is there absolutely no way to extend the API limit?