A repository for a digitisation effort of Ora Maritima (1902) and Pro Patria (1903).
Read online here
orig
: contains original scans from Internet Archive and OCR transcriptionstext
: contains edited and corrected text. The Latin text has been fully macronised._macron.txt
files are 'true' plain text,_tagged.txt
files have section numberings for HTML and one paragraph per line, with line break tags to mark lines of verse.docs
: contains HTML files of texts and website pages
- OCR: done
- Automatic spellchecking: done
- Manual spellchecking and correction: done
- Add macrons: done
- Create HTML file: done
- Add pictures from print books to HTML files: done
- Tokenise and lemmatise text: to do
- Create vocabulary aids: to do
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.