OCR-D / ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework
Apache License 2.0
13 stars 6 forks source link

Fix model resource name in ocrd resmgr #88

Open mikegerber opened 1 year ago

mikegerber commented 1 year ago

ocrd resmgr currently uses "qurator-gt4histocr-1.0", which was probably meant to mean "qurator-gt4histocr" for Calamari 1.0?

kba commented 1 year ago

ocrd resmgr currently uses "qurator-gt4histocr-1.0", which was probably meant to mean "qurator-gt4histocr" for Calamari 1.0?

No, that was just a random version because ...

  • [ ] Also try to figure out if OCR-D resmgr has some kind of versioning

... the resource manager has no mechanism for versioning of resources. Hence, the version must be part of the resource name.

  • [ ] If not, try to convince OCR-D to please not hide model versions ("qurator-gt4histocr" is "2019-12-11T11_10+0100", or if needed "2019-12-11")

The best way to do that is to provide the date as part of the name, e.g. qurator-gt4histocr-2019-12-12.

  • [ ] Should put the resources in our own ocrd-tool-json anyway.

Ineed, that way you have full control over the naming and model version.