INVOICE
Tech Services LLC
Structured document extraction at scale. One API routes each page to the right engine and returns the fields you pinned. Batch ten documents or ten thousand.
Tech Services LLC
{
"document_type": "invoice",
"vendor": "Tech Services LLC",
"invoice_number": "TS-2026-001",
"issue_date": "2026-03-04",
"due_date": "2026-04-03",
"line_items": [
{ "description": "Laptop Repair, Dell XPS 15", "qty": 1, "total": "$100.00" },
{ "description": "Data Recovery, 500GB HDD", "qty": 1, "total": "$150.00" }
// 12 more
],
"subtotal": "$3,145.00",
"tax": "$314.50",
"total": "$3,459.50"
}
Simplified preview. API responses include a confidence score per field.
Ask an LLM to extract the same document twice and you get two answers. Ask doXtract and you get the same shape every time, because the fields are pinned by a Template or Profile.
{
"vendor_name": "Tech Services",
"amount": "3459.50",
"date": "April 3",
"tax": "maybe included"
}
{
"vendor": "Tech Services LLC",
Total": "$3,459.50",
due: "2026-04-03"
// invented field "late_fee"
}
{
"document_type": "invoice",
"vendor": "Tech Services LLC",
"invoice_number": "TS-2026-001",
"issue_date": "2026-03-04",
"due_date": "2026-04-03",
"line_items": [ ... ],
"subtotal": "$3,145.00",
"tax": "$314.50",
"total": "$3,459.50"
}
Teams evaluating Textract, Azure Document Intelligence, Reducto, or a DIY stack usually need the same thing: reliable fields across document types, without rebuilding glue code every time a vendor changes their layout.
How it works
We only invoke a vision model when a page cannot be read any other way. You get exact text, faster, without paying for a model you did not need.
Templates ship ready from the library. Profiles are yours to define. Either pins the field list before your first API call.
Built from real forms across 14 categories. Consistent fields out of the box.
Build an Extraction Profile from your own documents once. Every matching document afterward returns the same fields in the same shape.
Six pipelines teams wire up first, from invoice intake to agent-ready chunks.
Turn invoices and purchase orders into clean line-item data for your ledger.
Pull figures from bank statements, paystubs, and tax forms in seconds.
Pull merchant, date, and totals from receipt photos and scans.
Extract fields from patient intake, lab results, and insurance claim forms.
Capture parties, dates, and key terms, and redact PII before storage.
Convert any document into clean, chunked text your agents can use.
# one request, any document
curl https://api.doxtract.io/v1/extract \
-H "Authorization: Bearer dxt_live_..." \
-F "file=@invoice.pdf" \
-F "profile=ap-invoices"
# 200 OK
{
"job_id": "job_8fK2mNp9vR",
"status": "completed",
"pages": 2,
"quality_score": 0.97,
"fields": {
"vendor": { "value": "Tech Services LLC", "confidence": 0.98 },
"invoice_number": { "value": "TS-2026-001", "confidence": 0.99 },
"issue_date": { "value": "2026-03-04", "confidence": 0.97 },
"due_date": { "value": "2026-04-03", "confidence": 0.97 },
"line_items": {
"value": [
{ "description": "Laptop Repair, Dell XPS 15", "qty": 1, "total": "$100.00" },
{ "description": "Data Recovery, 500GB HDD", "qty": 1, "total": "$150.00" }
],
"confidence": 0.94
},
"subtotal": { "value": "$3,145.00", "confidence": 0.96 },
"tax": { "value": "$314.50", "confidence": 0.95 },
"total": { "value": "$3,459.50", "confidence": 0.97 }
}
}