11 lines
499 B
Markdown
11 lines
499 B
Markdown
|
|
---
|
||
|
|
priority: high
|
||
|
|
---
|
||
|
|
|
||
|
|
- Track confidence scores on all OCR results — expose in API
|
||
|
|
- Image preprocessing (denoise, deskew, binarize) should improve accuracy by 10-30%
|
||
|
|
- PSM mode selection: auto-detect layout, allow user override (single block, single line, sparse text, etc.)
|
||
|
|
- Language detection: validate requested languages are available, provide install hints if not
|
||
|
|
- Multi-language support: allow multiple languages per OCR request
|
||
|
|
- Test OCR accuracy against ground-truth documents in CI
|