robertknight / ocrs

Rust library and CLI tool for OCR (extracting text from images)
Apache License 2.0
1.1k stars 46 forks source link

Support for non-Latin characters #8

Open robertknight opened 8 months ago

robertknight commented 8 months ago

ocrs should support models that can recognize non-Latin text.

Some likely sub-tasks:

DehaiWang commented 8 months ago

will support Chinese character?

robertknight commented 8 months ago

The goal is to make this possible. There are a lot of details still to be figured out.

robertknight commented 2 weeks ago

Eliminating the hard-coded alphabet from the recognition process. IIRC this was from an EasyOCR model that I used at one point.

This was completed in https://github.com/robertknight/ocrs/pull/100.