Files
fil/.ai-rulez/domains/ocr-integration/rules/ocr-performance.md
Henrik Jess Nielsen b4c07d3693
All checks were successful
Deploy fil (kreuzberg) / deploy (push) Successful in 49s
Nomad changes
2026-06-01 23:40:55 +02:00

465 B

priority
priority
high
  • Cache OCR results: key = hash(image_bytes + language + config)
  • Invalidate cache when OCR config changes (backend, language, PSM mode)
  • Batch processing: process multiple images concurrently with configurable parallelism
  • Resource management: limit concurrent OCR operations to avoid memory exhaustion
  • Performance targets: <2s for single page, <10s for 10-page document
  • Monitor and log OCR processing times for regression detection