OSDN > Finden Software > External Sites > SourceForge.net > Sanskrit / Hindi - Tesseract OCR > Download von Dateienliste > Download

Sanskrit / Hindi - Tesseract OCR

Download von OCRHindi_using_VietOCR_and_Tesseract.pdf (OCRHindi_using_VietOCR_and_Tesseract.pdf ( externer Link: SourceForge.net): 703,290 Bytes) wird in Kürze beginnen. Wenn nicht, klicke auf OCRHindi_using_VietOCR_and_Tesseract.pdf ( externer Link: SourceForge.net).

Datei-Informationen

Dateigröße: 703,290 Bytes
MD5: 2c7e500b2313ad422967d6a3f03a2766

Wohin willst du als nächstes gehen?

Springe zum OSDN Projektseite Anderen Releases anzeigen

Bewertung

Durchschnittlich

0.0

0 Insgesamt

5 Sterne	0
4 Sterne	0
3 Sterne	0
2 Sterne	0
1 Stern	0

Ihr Bewertung

Rezensionen verfassen

Projektbeschreibung

Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed.

I am experimenting with different fonts and training texts and will post the traineddata files for various devanagari fonts in the hope that these can be used to OCR the various scanned books with devanagari text.

Currently traineddata file for Sanskrit2003 font and another similar font used in a book are uploaded here.

See DocumentationWiki for more details.

Sanskrit / Hindi - Tesseract OCR

Datei-Informationen

Wohin willst du als nächstes gehen?

Bewertung

Ihr Bewertung für Sanskrit / Hindi - Tesseract OCR

Projektbeschreibung