Introduction
The standalone APIs allow you to process PDF and image files without consuming a project. We currently provide 3 operations:- Extract structured data: Extract strucutred data from the file following a JSONSchema.
- Extract markdown: Extract markdown from the file, including headers, paragraphs, lists and tables.
- Extract tables: Extract tables from the file in JSON format.
operationId
obtained in the initial call.
Sync vs Async APIs
Each of the operations listed above is available via a Sync API and an Async API. In Sync APIs, you make a single call which triggers the operation and returns the result. In Async APIs, you make two calls: the first call triggers the operation and returns anoperationId
, and the second call uses the operationId
to check the status and get the result.
Depending on the input, the call might take a long time to complete, especially if the file is large or the operation is complex. If the API is taking too long, the request might time out before the file processing is finished. For this reason, we recommend using the Asynchronous API for most use cases.
The Sync API is limited to 10 requests per minute per operation. If you need a higher rate limit, contact us.