impresso / impresso-text-acquisition

Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
https://impresso.github.io/impresso-text-acquisition/
GNU Affero General Public License v3.0
7 stars 2 forks source link

Update Text-importer dependencies and documenation, and fix small associated bugs #121

Closed piconti closed 9 months ago

piconti commented 10 months ago

Like impresso-pycommons, impresso-text-acquisition was nut updated since the end of the Impresso I project, and is consequently in need of several updates.

In particular, several dependencies creating inconsistencies, and some documenation is outdated. In addition, some small bugs have been uncovered in the code, along with bigger ones (which are the topic of individual issues:

The ones which do not have any specific related issue will be fixed along with the main dependencies update.