Send an agent to extract data from a document, optionally transforming it to a specific data schema. POST a file called 'file'. Must be a filetype readable from the dashboard document task (ie pdf or image)
/api/v2/document/start
Additional hints to the AI about what you want to extract
I want a list of people
The model to use. Large is slower but may be more accurate. Small is faster but may be less accurate.
auto
, large
, small
Attempt to extract extended metadata, like URLs and images
Increasing this will cause the AI to scroll further down the page
A JSON schema to extract against
{"schema":{"name":"<the site name>"}}