Saltar al contenido

Una API para extraer datos estructurados de PDFs e imágenes.

Solo pagas por las páginas que son 100% precisas. Si nos equivocamos en algo, no pagas por esa página.

Extracted Data · Invoice conf 0.99
1
2 ·· "document_type" "invoice"
3 ·· "confidence" 0.99
4 ·· "invoice_number" "BORD_282_2025/0006806"
5 ·· "vendor" "IKEA IBÉRICA S.A."
6 ·· "vendor_tax_id" "A24812518"
7 ·· "buyer" "Windel SL"
8 ·· "buyer_tax_id" "B-58372914"
9 ·· "issue_date" "2025-08-29"
10 ·· "receipt_ref" "SIM_282_2025/1083480"
11 ·· "line_items" 16
12 ·· "subtotal" 501.36
13 ·· "vat_21pct" 105.28
14 ·· "total" 606.64
15 ·· "currency" "EUR"
16 ·· "status" "PAGADO"
17
1
2 ·· "document_type" "payslip"
3 ·· "confidence" 0.98
4 ·· "period" "2023-11"
5 ·· "employer" "EMPRESA"
6 ·· "employee_nif" "01312819M"
7 ·· "employee_ss_id" "8251463D"
8 ·· "salary_base" 1363.95
9 ·· "gross_pay" 1875.81
10 ·· "irpf_rate" 0.135
11 ·· "irpf_amount" 253.23
12 ·· "ss_deductions" 91.26
13 ·· "net_pay" 1501.58
14 ·· "currency" "EUR"
15 ·· "iban" "ES44 2100 0418 4502 0005 2346"
16
1
2 ·· "document_type" "purchase_order"
3 ·· "confidence" 0.97
4 ·· "po_number" "PO-00-2025"
5 ·· "date" "2025-04-24"
6 ·· "status" "closed_completed"
7 ·· "vendor" "Wendel Harris"
8 ·· "vendor_terms" "Net 30 Days"
9 ·· "department" "IT Department"
10 ·· "approved_by" "Patrick Smith"
11 ·· "delivery_date" "2025-04-24"
12 ·· "line_items" 7
13 ·· "total" 234.47
14 ·· "currency" "USD"
15
1
2 ·· "document_type" "receipt"
3 ·· "confidence" 0.99
4 ·· "merchant" "REAL SEAFOOD"
5 ·· "date" "2021-05-29"
6 ·· "time" "17:55"
7 ·· "table" "707"
8 ·· "party_size" 4
9 ·· "server" "LEILANI J"
10 ·· "line_items" 10
11 ·· "food_total" 142.55
12 ·· "subtotal" 131.00
13 ·· "tax" 10.86
14 ·· "total" 191.86
15 ·· "currency" "USD"
16

Powering document extraction for teams at

La infraestructura documental
que simplemente funciona.

Otras soluciones te dan una caja de herramientas. Invofox te da resultados. Los pipelines perfectos que tardarías un año en construir y un equipo en mantener, listos desde el día uno.

La extracción de datos no tiene atajos.
Necesitas un pipeline.

El procesamiento de documentos no es solo una funcionalidad. Es una infraestructura compleja preparada para casos límite, que escala y aprende del feedback.

01 1
Upload a document Send any PDF, image or scanned file
You
02
File intake & integrity Handle corrupt and password protected files
Ingestion
03
Pre-processing Deskew, denoise, and sharpen for clean OCR.
Parsing
04
Dual-pass OCR Two passes: one reads the text, one maps the layout
Parsing
05
Page splitting Separate multi-document files into subdocuments
Parsing
06
Classification Index and categorize each document
Parsing
07
Format conversion Get your documents LLM-ready
Parsing
08
Multi-step extraction AI models identify every relevant value
Extraction
09
Tables & line items Reconstruct tables, reconcile subtotals to totals
Extraction
10
Entity normalization Normalize dates, currencies, numbers and tax codes
Extraction
11
Schema mapping Map raw fields into your exact data model
Extraction
12
Cross-field validation Check amounts and business rules
Extraction
13
Confidence scoring Build field and document level confidence scores
Extraction
14
Agentic review Re-check and self-correct low confidence fields
Delivery
15
Webhook delivery Send final result to your system
Delivery
16
Detect edge cases Flag docs to avoid errors and get feedback
Improve
17
Learn from feedback Improve results with a single API call
Improve
18
Pipeline tuning Continuous iteration on real docs and corrections
Improve
19
Live upgrades Roll out new AI models
Improve
20
Avoid regressions Catch accuracy drops on every change
Improve
21
Scaling & throughput Queues, autoscaling and peak-traffic handling
Infra
22
Monitoring & drift Real-time alerts on latency, accuracy and format drift
Infra
23
Zero-retention Documents deleted after delivery, never stored
Infra
24 3
Receive JSON Clean, validated, schema-mapped structured data
You
INVOFOX
Everything between
upload and JSON.
1endpoint
99%+accuracy
Pipeline Infrastructure Improvements New cases Validation

Lánzalo en
una tarde.

Integra un endpoint en tu código. Recibe JSON limpio y estructurado de cualquier documento, sin construir el pipeline de extracción, entrenar modelos ni gestionar casos límite. Nunca.

bash — invofox
$ curl -X POST \
··https://api.invofox.com/v1/extract \
··-H "Authorization: Bearer $KEY" \
··-F "file=@invoice.pdf"
200 OK · 1.2s
{
··"type": "invoice",
··"vendor": "Meridian Ltd",
··"total": 6720.00,
··"confidence": 0.99
}

Probado en producción a nivel mundial,
cientos de equipos confían en Invofox.

Invofox está en producción hoy en EE. UU., UE y LATAM — para fintechs, marketplaces, operaciones logísticas, plataformas contables y grandes empresas. Esta es una instantánea de lo que procesa cada día.

Live
// overview

Production metrics

Today
743,291
Documents processed
Across all docs
99.2%
Average accuracy
SLA-bound
End-to-end
<2s
Average response time
p50 0.8s
p95 1.4s
p99 1.9s
Out of the box
200+
Document formats
PDF JPG PNG TIFF HEIC +195
Recent extractions streaming
PDF
invoice_8237.pdf Invoice
9.2s done
PDF
bundle_482.pdf Multi-doc
Split into 5 11.4s done
PNG
payslip_2104.png Payslip
8.1s done
PDF
statement_5821.pdf Bank statement
12.7s done
JPG
invoice_8238.jpg Invoice
processing
PDF
batch_201.pdf Multi-doc
Split into 3 10.5s done
PNG
receipt_2298.png Receipt
7.8s done
Splitter active
2,847
multi-doc bundles split today
+12% vs yesterday
12
Recent splits
bundle_482.pdf → 5 documents
batch_201.pdf → 3 documents
package_009.pdf → 4 documents
Try it now No card or email required.

Sin promesas vacías.
Entregamos resultados.

+99% accuracy guaranteed

Top results are part of our SLAs.

Accuracy targets are part of our contractual obligations.

$0 if we make a mistake

Pay only for correct data.

Every document where a mistake is reported through our API is automatically credited back. You never pay for an error.

Pay per page. No credits, no math. See pricing

SLA tier available on plans processing 1M+ documents per year.

Por qué construimos Invofox.

Una mirada breve al problema del que nos cansamos — y cómo nos propusimos solucionarlo. De primera mano de los fundadores.

Confianza, integrada. Verificada por fuera.

Elige una certificación para ver el sello. Mira cómo una petición real deja su rastro zero-retention.

Compliance
SOC 2 badge
SOC 2 Active
Type II · audited annually by AICPA

Our systems and controls are independently audited every year against the AICPA Trust Services Criteria — security, availability, processing integrity, confidentiality, and privacy.

Zero-retention

Process. Deliver. Erase.

Documents deleted right after delivery. No copies, no backups, no logs.

No copies No backups No logs
Self-hosted

Run it on your servers.

Deploy Invofox inside your own infrastructure. Same API, your perimeter.

On-prem VPC Air-gap
Want the full report? Audits, policies, sub-processors and the latest pen-test summary live in our trust center. Open trust center

Preguntas frecuentes.

~/invofox / faq.json
accuracy.json
1
2 ··"question" "¿Cuán preciso es Invofox?"
3
4 ··"answer" "Los umbrales de precisión están garantizados en tu SLA, por tipo de documento y por campo. Cada extracción se valida antes de contar en tu factura. El feedback loop hace que la precisión mejore con el tiempo a medida que tu equipo marca casos límite. Los casos de uso estables alcanzan hasta el 99%."
5
Precisión accuracy.json
main 0 errors 0 warnings UTF-8 LF JSON

¿Aún tienes dudas? Habla con nosotros