Files
fil/tools/benchmark-harness/fixtures/docx_large_formatted.json
Henrik Jess Nielsen b4c07d3693
All checks were successful
Deploy fil (kreuzberg) / deploy (push) Successful in 49s
Nomad changes
2026-06-01 23:40:55 +02:00

16 lines
506 B
JSON

{
"document": "../../../test_documents/docx/test_emf_docx.docx",
"file_type": "docx",
"file_size": 426097,
"expected_frameworks": ["kreuzberg", "docling", "markitdown", "pandoc", "tika", "unstructured"],
"metadata": {
"description": "Large formatted DOCX - 416KB document with EMF graphics",
"category": "docx-complex",
"size_class": "medium"
},
"ground_truth": {
"text_file": "../../../test_documents/ground_truth/docx/docx_large_formatted.txt",
"source": "vision"
}
}