You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Maybe the hOCR can be modified on the server side on the fly when it is requested by a web client:
A program could look up metadata in the database (date, issue, page number) and add it to the HTML answer (title tag, time information). Then it could add an image link, maybe also links for other visualisations (like hocrjs). The same program could also do post OCR and fix known OCR errors. That process would preserve the original OCR results, deliver the best post OCR available and preserve disk space.
I have found something interesting here https://digi.bib.uni-mannheim.de/periodika/reichsanzeiger/ocr/film/tesseract-4.0.0-alpha.20170703/012-9419/0580.hocr and would like to see the corresponding image. How can I find it?
The text was updated successfully, but these errors were encountered: