Files
fil/docs/snippets/csharp/config/tesseract_config.md

23 lines
525 B
Markdown
Raw Normal View History

2026-06-01 23:40:55 +02:00
```csharp title="C#"
using Kreuzberg;
var config = new ExtractionConfig
{
Ocr = new OcrConfig
{
Backend = "tesseract",
Language = "eng+deu",
TesseractConfig = new TesseractConfig
{
Psm = 6,
Oem = 3,
MinConfidence = 0.5,
Language = "eng"
}
}
};
var result = await KreuzbergLib.ExtractFile("scanned.pdf", null, config);
Console.WriteLine($"OCR text: {result.Content.Substring(0, Math.Min(100, result.Content.Length))}");
```