![]() ![]() ![]() Also performing image processing considerably slows down the conversion process. Incorrect application of image processing technique can sometimes also degrade the detection accuracy. The selection of image enhancement techniques that can help in improving the detection accuracy depend on many factors such as the quality of the original physical document which was scanned, the scanning fidelity, and the quality and resolution of the scanned image. For example for English, German, and French use "eng+deu+fra".īefore passing the images to Tesseract the document converter can also optionally enhance them to improve the text detection accuracy. Multiple languages can be specified by separating them with a plus sign. Also remember to set the list of languages in the DocumentLanguage setting as shown in the code snippet. To enable additional languages you can download the training data for additional languages from the Tesseract GitHub page and copy it to the tessdata folder, which is located in the same folder as your binaries. Tesseract can recognize many more languages. The add-on NuGet package ships with training data only for the English language. The Tesseract OCR library uses training data to recognize text. ConvertToSeperateFiles ) Ĥ.Run application to see converted output file in application bin directory. CurrentDirectory ,īaseFileName, ConversionMode. ConvertToFile ( inputFiles, outputFileFormat, Environment. TEXT // Convert to searchable PDF documentConverter. RecognizeElementTypes = RecognizeElementTypes. DocumentLanguage = "eng" documentConverter. OFF // Languages used in the scanned document documentConverter. ImageEnhancementMode = ImageEnhancementMode. Base file name for output file string baseFileName = "converted" // Convert to PDF string outputFileFormat = "pdf" // Additional parameters for OCR documentConverter. Create DocumentConverter instance DocumentConverter documentConverter = new DocumentConverter () // List of files to be converted (images and/or scanned PDF) List inputFiles = new List () ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |