Closed neelsmith closed 5 years ago
But better. Given an analyzable, univocal OHCO2 Corpus,
MidOrthography
Or compare current work in ocre-texts repository, and use of its FormulaUnit class and object.
ocre-texts
FormulaUnit
Not doing in this library: see https://github.com/neelsmith/latin-corpus
But better. Given an analyzable, univocal OHCO2 Corpus,
MidOrthography
to tokenize it, and filter for lexical tokens