UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

Support conversion to MiniOCR #135

Open kba opened 3 years ago

kba commented 3 years ago

Minimal-noise OCR format for full text indexing https://dbmdz.github.io/solr-ocrhighlighting/formats/#miniocr

nichtich commented 1 year ago

Moved to https://dbmdz.github.io/solr-ocrhighlighting/0.8.3/formats/#miniocr