Files
fil/.ai-rulez/domains/ocr-integration/rules/ocr-backend-standards.md
Henrik Jess Nielsen b4c07d3693
All checks were successful
Deploy fil (kreuzberg) / deploy (push) Successful in 49s
Nomad changes
2026-06-01 23:40:55 +02:00

568 B

priority
priority
critical
  • Pluggable backend architecture: all backends implement the OcrBackend trait
  • Backend independence: switching backends must not require API changes
  • Tesseract is the default backend (native C FFI via leptess)
  • Python backends (EasyOCR, PaddleOCR): use tokio::task::spawn_blocking, release GIL for Rust work
  • Graceful degradation: if preferred backend unavailable, fall back to next available
  • All backends must return structured results with confidence scores
  • Document installation requirements and troubleshooting for each backend