Text Recognition can not detect vertical aligned characters (For container codes)

googlesamples / mlkit

A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS

Apache License 2.0

3.55k stars 2.92k forks source link

Text Recognition can not detect vertical aligned characters (For container codes) #254

Closed yenthanh132 closed 3 years ago

yenthanh132 commented 3 years ago

I am using Text recognition of ML Kit library for detecting Container Codes, there are two options for container codes which are horizontal aligned (normal) and vertical aligned. Text Recognition can scan all the characters for container codes in horizontal alignment but can not scan characters if the container codes is aligned vertically. Is there any option for tunning the Text recognition or it is because of the limitation of Text Recognition Library? I also tested with online Google Vision API but the result is still the same.

As the below image, it can track the container code in horizontal (KTNU 396009 3) but can't detect the container code in vertical (SRGU 212732 5)

cs-googler commented 3 years ago

Hi, the model should be able to detect characters. However, since the Characters in your photo is lined with a certain angle, it is hard for the model to detect all of them. When I use the vision quickstart live preview activity, I can get one or two of them recognized. So, there is space for us to improve the model. Thanks for reporting.

yenthanh132 commented 3 years ago

Thank you @cs-googler for your comments, It would be great if you can improve the model for this case. I understand that this may take time and the new model won't be improved soon, in the mean time I need to find an alternative solution to detect characters in this special case for my business. Do you have any recommendation for me to address this problem so I can also process by myself first? (I will share the result to improve the model too).

fjr619 commented 3 years ago

for text recognition can we have custom model? because cant find the documentation for custom model text recognition. and how to create that custom model to improve accuracy?

chrisito commented 3 years ago

Sadly this is currently a known limitation of the Text Recognition API that we need to address. Sadly there is no quick fix here, open sourcing the underlying models would not really help.

werohit commented 3 years ago

I am facing the same problem.

device-2021-07-01-152522

yenthanh132 commented 3 years ago

I think the best option is to train myself another OCR model for detection this special scenario, close this issue now.

sidathgueye commented 2 years ago

J'utilise la reconnaissance de texte de la bibliothèque ML Kit pour détecter les codes de conteneur, il existe deux options pour les codes de conteneur qui sont alignés horizontalement (normal) et alignés verticalement. La reconnaissance de texte peut scanner tous les caractères pour les codes de conteneurs alignés horizontalement mais ne peut pas scanner les caractères si les codes de conteneurs sont alignés verticalement. Existe-t-il une option pour régler la reconnaissance de texte ou est-ce à cause de la limitation de la bibliothèque de reconnaissance de texte ? J'ai également testé avec l'API Google Vision en ligne mais le résultat est toujours le même.

Comme l'image ci-dessous, il peut suivre le code du conteneur en horizontal (KTNU 396009 3) mais ne peut pas détecter le code conteneur en vertical (SRGU 212732 5)

Bonjour est ce qu aujourdui vous avew une solution pour ce probleme. je travaille exactement sur le meme projet les nuneros de conteneurs.

sidathgueye commented 2 years ago

Bonjour est ce qu aujourdui vous avew une solution pour ce probleme. je travaille exactement sur le meme projet les nuneros de conteneurs.

werohit commented 2 years ago

Non, je ne suis pas en mesure de résoudre ce problème avec la bibliothèque. Je travaille donc sur un autre algorithme OCR pour résoudre ce problème. Mais MLKit n'a pas encore fourni de solution pour cela. J'ai vérifié de nombreuses autres bibliothèques OCR et elles ont déjà fourni une solution pour cela.

sidathgueye commented 2 years ago

Non, je ne suis pas en mesure de résoudre ce problème avec la bibliothèque. Je travaille donc sur un autre algorithme OCR pour résoudre ce problème. Mais MLKit n'a pas encore fourni de solution pour cela. J'ai vérifié de nombreuses autres bibliothèques OCR et elles ont déjà fourni une solution pour cela.

Ah super, vous pouvez me partager un lien github sil vous plait? sinon m'indiquer les solutions que vous avez pu utilisé. Merci d'avance

RazaShehryar commented 2 months ago

is there any update on this? I am facing a similar issue.

RazaShehryar commented 2 months ago

@yenthanh132 did you find a solution for it?