🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
GNU Affero General Public License v3.0
7
stars
2
forks
source link
L'essor 2006-2015 has terrible text recognition #88
Open
simon-clematide opened 4 years ago
Something went pretty wrong on that. The https://www.e-newspaperarchives.ch/?a=d&d=LES20070601-01.2.13&e=-------fr-20--1--img-txIN--------0----- does not seem to suffer from that. Perfect text and perfect layout recognition