googlesamples / mlkit

A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS
Apache License 2.0
3.6k stars 2.94k forks source link

TextRecognizer reads character `<` as either letters S, C, E or K #354

Closed rjmangubat23 closed 3 years ago

rjmangubat23 commented 3 years ago

Hello,

TextRecognizer reads MRZ character < either letters S, C, E or K.

Sample below: image

Our app is a MRZ scanner which means this issue gets us wrong results especially for givenNames which do not have a check digit (helps validates the data) from the MRZ string scanned.

Currently we have used and tested for both stable repos but it still happens

Thank you in advance! :D

zongmins commented 3 years ago

Based on the video and offline testing with Overlay, OCR is fairly accurate on the text below the image but not the text in the image. In addition to the confusion on [<SCEK], there are recognizer errors elsewhere in the passport image. The recognition errors are most likely because the recognizer model wasn't trained with the passport font (monospace font OCR-B).

In the interim, users might try using a higher resolution image, or running recognition multiple times on a video feed to identify which results are stable.

rjmangubat23 commented 3 years ago

@zongmins @bcdj thank you for replying. I followed your advice and it did improve it however there are still times that it does happen. May I know where I can ask for improved support for passport font? or is there a way in modifying the recognizer model?

Thank you again!

bcdj commented 3 years ago

You are at the right place for requesting improved support for passport font. :-) We have forwarded your request to the model training team and will continue to work on improving our model. No ETA can be provided at this moment.

rjmangubat23 commented 2 years ago

@bcdj may I ask if there are any updates on improved support for passport font in the latest version?

Thank you in advance!