Files
fil/tools/benchmark-harness/fixtures/txt_multilingual.json

15 lines
422 B
JSON
Raw Normal View History

2026-06-01 23:40:55 +02:00
{
"document": "../../../test_documents/text/multilingual.txt",
"file_type": "txt",
"file_size": 169,
"expected_frameworks": ["kreuzberg", "pandoc", "pymupdf4llm", "tika", "unstructured"],
"metadata": {
"description": "Multilingual plain text file",
"category": "text"
},
"ground_truth": {
"text_file": "../../../test_documents/ground_truth/txt/txt_multilingual.txt",
"source": "vision"
}
}