Benchmarks tell the full story.

We benchmarked Unstructured against Reducto, LlamaParse, Docling, Snowflake, Databricks & NVIDIA across 1,000+ enterprise pages. See the data.

Benchmarking the Landscape

We could tell you we're the best document parsing solution. Instead, we'll show you the data and let you decide.

How We Evaluated

We benchmarked Unstructured against leading document parsing tools—including Reducto, LlamaParse, Docling, Snowflake, Databricks, and NVIDIA—using a real-world enterprise dataset of over 1,000 pages. The documents reflect messy production reality: scanned invoices, complex layouts, nested tables, handwritten notes, and industry-specific formats. We measured performance across four key dimensions:

Landscape Across The Tools

For Unstructured, Reducto, and Docling, we tested multiple pipeline configurations. For the other tools, we used their default configurations.

Overall Outcome

System	Adjusted CCT	Tokens Added	Element Alignment	Table Cell Level Content Accuracy	Table Cell Level Spatial Accuracy
Unstructured	#1 0.880	#2 0.051	#4 0.574	#1 0.820	#1 0.813
Databricks AI Parse Document	0.809	0.053	0.417	0.615	0.623
NVIDIA Nemotron-Parse-v1.1	0.648	0.070	0.339	0.559	0.651
Snowflake Layout Mode	0.792	0.102	0.608	0.556	0.583
Reducto Agentic	0.812	0.124	0.595	0.708	0.706
LlamaParse VLM	0.835	0.069	0.277	0.522	0.578
Docling Default	0.716	0.135	0.599	0.657	0.716
Claude Sonnet-4 (off the shelf)	0.792	0.071	0.421	0.687	0.710
GPT-5-mini (off the shelf)	0.764	0.161	0.452	0.716	0.750
Gemini 2.5 Pro (off the shelf)	0.680	0.257	0.430	0.627	0.644
GPT-5.2 (off the shelf)	0.763	0.167	0.440	0.705	0.710
Claude Opus-4.6 (off the shelf)	0.742	0.044	0.449	0.678	0.686

Last updated: Mar 31, 2026

The Full Picture

To provide complete transparency, we're sharing detailed results for every pipeline configuration we tested. The charts below also show how different Unstructured pipelines each use different partitioning and enrichment strategies perform across the above SCORE metrics.

What We Found

The results across pipelines and metrics reveal a few patterns worth understanding before you dig into the charts.

Our Detailed Results

Each pipeline below uses a different combination of partitioning strategy and enrichments. Use these charts to find the best fit for your documents and use case

Adjusted CCT by Pipeline

The core measure of text accuracy. Unlike basic string matching, it accounts for formatting differences so a pipeline that outputs structured HTML and one that outputs plain text can both score well — what matters is whether the actual words were captured.

Unstructured VLM Partitioner GPT-5-mini

0.883

Unstructured VLM Partitioner GPT-5.4

0.880

Unstructured VLM Partitioner Claude Opus-4.5

0.878

Unstructured VLM Partitioner Claude Sonnet-4

0.871

Unstructured High-Res Refined with Claude Sonnet-4

0.863

Unstructured High-Res Refined with GPT-5-mini

0.857

LlamaParse VLM

0.835

Reducto Agentic

0.812

Databricks AI Parse Document

0.809

Snowflake Layout Mode

0.792

Claude Sonnet-4 (off the shelf)

0.792

LlamaParse High Resolution OCR

0.776

GPT-5-mini (off the shelf)

0.764

GPT-5.2 (off the shelf)

0.763

Claude Opus-4.6 (off the shelf)

0.742

Docling Default

0.716

Unstructured OSS

0.715

Snowflake OCR Mode

0.705

Gemini 2.5 Pro (off the shelf)

0.680

NVIDIA Nemotron-Parse-v1.1

0.648

Docling Granite VLM

0.625

Pipeline Category

Unstructured

Databricks AI Parse Document

NVIDIA Nemotron-Parse-v1.1

Snowflake Layout Mode

Reducto Agentic

LlamaParse VLM

Docling Default

Claude Sonnet-4 (off the shelf)

GPT-5-mini (off the shelf)

Gemini 2.5 Pro (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

1.000.800.600.400.200.00

0.883

0.880

0.878

0.871

0.863

0.857

0.835

0.812

0.809

0.792

0.776

0.764

0.763

0.742

0.716

0.715

0.705

0.680

0.648

0.625

Unstructured VLM Partitioner GPT-5-mini

Unstructured VLM Partitioner GPT-5.4

Unstructured VLM Partitioner Claude Opus-4.5

Unstructured VLM Partitioner Claude Sonnet-4

Unstructured High-Res Refined with Claude Sonnet-4

Unstructured High-Res Refined with GPT-5-mini

LlamaParse VLM

Reducto Agentic

Databricks AI Parse Document

Snowflake Layout Mode

Claude Sonnet-4 (off the shelf)

LlamaParse High Resolution OCR

GPT-5-mini (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

Docling Default

Unstructured OSS

Snowflake OCR Mode

Gemini 2.5 Pro (off the shelf)

NVIDIA Nemotron-Parse-v1.1

Docling Granite VLM

Last updated: Mar 31, 2026

Tokens Added by Pipeline

Counts words a pipeline generated that were never in the source document. In production AI applications, invented content is often more damaging than missing content — it feeds false information directly into whatever is built on top.

Unstructured VLM Partitioner GPT-5-mini

0.036

Claude Opus-4.6 (off the shelf)

0.044

Snowflake OCR Mode

0.048

Unstructured VLM Partitioner Claude Sonnet-4

0.049

Unstructured VLM Partitioner GPT-5.4

0.051

Databricks AI Parse Document

0.053

LlamaParse High Resolution OCR

0.055

Unstructured VLM Partitioner Claude Opus-4.5

0.056

Unstructured High-Res Refined with Claude Sonnet-4

0.057

LlamaParse VLM

0.069

Unstructured High-Res Refined with GPT-5-mini

0.069

NVIDIA Nemotron-Parse-v1.1

0.070

Claude Sonnet-4 (off the shelf)

0.071

Snowflake Layout Mode

0.102

Unstructured OSS

0.119

Reducto Agentic

0.124

Docling Default

0.135

GPT-5-mini (off the shelf)

0.161

Docling Granite VLM

0.163

GPT-5.2 (off the shelf)

0.167

Gemini 2.5 Pro (off the shelf)

0.257

Pipeline Category

Unstructured

Databricks AI Parse Document

NVIDIA Nemotron-Parse-v1.1

Snowflake Layout Mode

Reducto Agentic

LlamaParse VLM

Docling Default

Claude Sonnet-4 (off the shelf)

GPT-5-mini (off the shelf)

Gemini 2.5 Pro (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

1.000.800.600.400.200.00

0.036

0.044

0.048

0.049

0.051

0.053

0.055

0.056

0.057

0.069

0.070

0.071

0.102

0.119

0.124

0.135

0.161

0.163

0.167

0.257

Unstructured VLM Partitioner GPT-5-mini

Claude Opus-4.6 (off the shelf)

Snowflake OCR Mode

Unstructured VLM Partitioner Claude Sonnet-4

Unstructured VLM Partitioner GPT-5.4

Databricks AI Parse Document

LlamaParse High Resolution OCR

Unstructured VLM Partitioner Claude Opus-4.5

Unstructured High-Res Refined with Claude Sonnet-4

LlamaParse VLM

Unstructured High-Res Refined with GPT-5-mini

NVIDIA Nemotron-Parse-v1.1

Claude Sonnet-4 (off the shelf)

Snowflake Layout Mode

Unstructured OSS

Reducto Agentic

Docling Default

GPT-5-mini (off the shelf)

Docling Granite VLM

GPT-5.2 (off the shelf)

Gemini 2.5 Pro (off the shelf)

Last updated: Mar 31, 2026

Element Alignment by Pipeline

Documents are made up of different element types — headings, paragraphs, tables, figures. This measures whether a pipeline correctly identifies and consistently classifies those elements. A system that extracts all the text but mislabels what it is loses the document's structure entirely.

Snowflake Layout Mode

0.608

Docling Default

0.599

Reducto Agentic

0.595

Unstructured High-Res Refined with GPT-5-mini

0.580

Unstructured High-Res Refined with Claude Sonnet-4

0.580

Unstructured VLM Partitioner GPT-5-mini

0.575

Unstructured VLM Partitioner Claude Opus-4.5

0.574

Unstructured VLM Partitioner GPT-5.4

0.574

Unstructured VLM Partitioner Claude Sonnet-4

0.572

Docling Granite VLM

0.558

Unstructured OSS

0.534

GPT-5-mini (off the shelf)

0.452

Claude Opus-4.6 (off the shelf)

0.449

GPT-5.2 (off the shelf)

0.440

Gemini 2.5 Pro (off the shelf)

0.430

Claude Sonnet-4 (off the shelf)

0.421

Databricks AI Parse Document

0.417

NVIDIA Nemotron-Parse-v1.1

0.339

LlamaParse High Resolution OCR

0.277

LlamaParse VLM

0.266

Snowflake OCR Mode

0.000

Pipeline Category

Unstructured

Databricks AI Parse Document

NVIDIA Nemotron-Parse-v1.1

Snowflake Layout Mode

Reducto Agentic

LlamaParse VLM

Docling Default

Claude Sonnet-4 (off the shelf)

GPT-5-mini (off the shelf)

Gemini 2.5 Pro (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

1.000.800.600.400.200.00

0.608

0.599

0.595

0.580

0.575

0.574

0.572

0.558

0.534

0.452

0.449

0.440

0.430

0.421

0.417

0.339

0.277

0.266

0.000

Snowflake Layout Mode

Docling Default

Reducto Agentic

Unstructured High-Res Refined with GPT-5-mini

Unstructured High-Res Refined with Claude Sonnet-4

Unstructured VLM Partitioner GPT-5-mini

Unstructured VLM Partitioner Claude Opus-4.5

Unstructured VLM Partitioner GPT-5.4

Unstructured VLM Partitioner Claude Sonnet-4

Docling Granite VLM

Unstructured OSS

GPT-5-mini (off the shelf)

Claude Opus-4.6 (off the shelf)

GPT-5.2 (off the shelf)

Gemini 2.5 Pro (off the shelf)

Claude Sonnet-4 (off the shelf)

Databricks AI Parse Document

NVIDIA Nemotron-Parse-v1.1

LlamaParse High Resolution OCR

LlamaParse VLM

Snowflake OCR Mode

Last updated: Mar 31, 2026

Table Cell Level Content Accuracy by Pipeline

Tables are the hardest part of document parsing. This measures whether the text inside each individual cell was extracted correctly — wrong numbers in a financial table are worse than no table at all.

Unstructured VLM Partitioner GPT-5.4

0.820

Unstructured VLM Partitioner Claude Opus-4.5

0.812

Unstructured VLM Partitioner Claude Sonnet-4

0.778

Unstructured High-Res Refined with Claude Sonnet-4

0.773

Unstructured High-Res Refined with GPT-5-mini

0.760

GPT-5-mini (off the shelf)

0.716

Reducto Agentic

0.708

GPT-5.2 (off the shelf)

0.705

Unstructured VLM Partitioner GPT-5-mini

0.690

Claude Sonnet-4 (off the shelf)

0.687

Claude Opus-4.6 (off the shelf)

0.678

Docling Granite VLM

0.657

Gemini 2.5 Pro (off the shelf)

0.627

Databricks AI Parse Document

0.615

Docling Default

0.606

NVIDIA Nemotron-Parse-v1.1

0.559

Snowflake Layout Mode

0.556

LlamaParse VLM

0.522

Unstructured OSS

0.426

LlamaParse High Resolution OCR

0.361

Snowflake OCR Mode

0.000

Pipeline Category

Unstructured

Databricks AI Parse

NVIDIA NeMo Retriever

Snowflake AI Parse

Reducto

Llama Parse

Docling

Claude Sonnet-4 (off the shelf)

GPT-5-mini (off the shelf)

Gemini 2.5 Pro (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

1.000.800.600.400.200.00

0.820

0.812

0.778

0.773

0.760

0.716

0.708

0.705

0.690

0.687

0.678

0.657

0.627

0.615

0.606

0.559

0.556

0.522

0.426

0.361

0.000

Unstructured VLM Partitioner GPT-5.4

Unstructured VLM Partitioner Claude Opus-4.5

Unstructured VLM Partitioner Claude Sonnet-4

Unstructured High-Res Refined with Claude Sonnet-4

Unstructured High-Res Refined with GPT-5-mini

GPT-5-mini (off the shelf)

Reducto Agentic

GPT-5.2 (off the shelf)

Unstructured VLM Partitioner GPT-5-mini

Claude Sonnet-4 (off the shelf)

Claude Opus-4.6 (off the shelf)

Docling Granite VLM

Gemini 2.5 Pro (off the shelf)

Databricks AI Parse Document

Docling Default

NVIDIA Nemotron-Parse-v1.1

Snowflake Layout Mode

LlamaParse VLM

Unstructured OSS

LlamaParse High Resolution OCR

Snowflake OCR Mode

Last updated: Mar 31, 2026

Table Cell Level Spatial Accuracy by Pipeline

Getting cell content right is only half the problem. This measures whether each piece of text landed in the correct row and column. Structure is what makes a table useful rather than just a list of values.

Unstructured VLM Partitioner GPT-5.4

0.813

Unstructured VLM Partitioner Claude Opus-4.5

0.782

Unstructured High-Res Refined with Claude Sonnet-4

0.776

Unstructured VLM Partitioner Claude Sonnet-4

0.775

Unstructured High-Res Refined with GPT-5-mini

0.774

GPT-5-mini (off the shelf)

0.750

Unstructured VLM Partitioner GPT-5-mini

0.734

Docling Granite VLM

0.716

Claude Sonnet-4 (off the shelf)

0.710

GPT-5.2 (off the shelf)

0.710

Reducto Agentic

0.706

Claude Opus-4.6 (off the shelf)

0.686

Docling Default

0.659

NVIDIA Nemotron-Parse-v1.1

0.651

Gemini 2.5 Pro (off the shelf)

0.644

Databricks AI Parse Document

0.623

Snowflake Layout Mode

0.583

LlamaParse VLM

0.578

Unstructured OSS

0.498

LlamaParse High Resolution OCR

0.409

Snowflake OCR Mode

0.000

Pipeline Category

Unstructured

Databricks AI Parse Document

NVIDIA Nemotron-Parse-v1.1

Snowflake Layout Mode

Reducto Agentic

LlamaParse VLM

Docling Default

Claude Sonnet-4 (off the shelf)

GPT-5-mini (off the shelf)

Gemini 2.5 Pro (off the shelf)

GPT-5.2 (off the shelf)

Claude Opus-4.6 (off the shelf)

1.000.800.600.400.200.00

0.813

0.782

0.776

0.775

0.774

0.750

0.734

0.716

0.710

0.706

0.686

0.659

0.651

0.644

0.623

0.583

0.578

0.498

0.409

0.000

Unstructured VLM Partitioner GPT-5.4

Unstructured VLM Partitioner Claude Opus-4.5

Unstructured High-Res Refined with Claude Sonnet-4

Unstructured VLM Partitioner Claude Sonnet-4

Unstructured High-Res Refined with GPT-5-mini

GPT-5-mini (off the shelf)

Unstructured VLM Partitioner GPT-5-mini

Docling Granite VLM

Claude Sonnet-4 (off the shelf)

GPT-5.2 (off the shelf)

Reducto Agentic

Claude Opus-4.6 (off the shelf)

Docling Default

NVIDIA Nemotron-Parse-v1.1

Gemini 2.5 Pro (off the shelf)

Databricks AI Parse Document

Snowflake Layout Mode

LlamaParse VLM

Unstructured OSS

LlamaParse High Resolution OCR

Snowflake OCR Mode

Last updated: Mar 31, 2026

Real-World Data, Rigorous Evaluation

The benchmarks tell the story. The methodology is open. The data speaks for itself. Try Unstructured, and discover why leading teams rely on Unstructured to power production AI pipelines.