Get Sources
Return the extraction result enriched with per-leaf source provenance.
Each extracted leaf value is wrapped as where source contains citation content, surrounding context, and a format-specific anchor (bbox for PDFs, cell ref for spreadsheets, text span for plain text, etc.).
Authorizations
Path Parameters
Response
Successful Response
An extraction's output annotated with the source that backs each value.
Returned when fetching the sources for an extraction. Carries the source
file and its detected document_type, the original extraction output,
and a parallel sources tree where each leaf is a {value, source} object
locating the value in the document (a page region for PDFs, a cell for
spreadsheets, a text span for plain text, and so on).
ID of the extraction
Detected document type of the source file
pdf, image, csv, xlsx, docx, txt File metadata (id, filename, mime_type)
Original extraction output
Same shape as extraction but leaves are {value, source} objects
"extraction.sources"