-
Notifications
You must be signed in to change notification settings - Fork 202
Home
Stefan Weil edited this page Oct 22, 2019
·
8 revisions
tesstrain (formerly ocrd-train) is a collection of scripts and documentation for training of Tesseract with LSTM (supported by Tesseract 4 and newer releases).
Currently it includes a Makefile
which allows training from real line images with ground truth (text transcriptions).
Training from synthetic images is supported by training scripts (Shell, Python) which are still part of the Tesseract code base.
- Training Fraktur with Austrian Newspapers
- Training Fraktur with GT4HistOCR