Files

17 lines
483 B
JSON
Raw Permalink Normal View History

2026-06-01 23:40:55 +02:00
{
"document": "../../../../test_documents/pdf/pdfa_029.pdf",
"file_type": "pdf",
"file_size": 5262991,
"expected_frameworks": ["kreuzberg"],
"metadata": {
"description": "PDF/A benchmark document",
"source": "pdfa",
"size_category": "large"
},
"ground_truth": {
"text_file": "../../../../test_documents/ground_truth/pdf/pdfa_029.txt",
"source": "mistral-pixtral",
"markdown_file": "../../../../test_documents/ground_truth/pdf/pdfa_029.md"
}
}