I am dealing with this edge case where an excellent quality image is almost perfectly recognized, but surya will skip the number 4 every single time. I tested different bounding boxes, zoom levels and different types of noise but none made any difference. The detector is working fine but the recognizer reads R$33,00 instead of R$334,00 if I specify only portuguese. However, if I specify portuguese and english, it comes out correctly.
How would you reason about the influence of adding english when recognizing numbers? Is there anything particular in the training data that would make reasonable to always add english?
I am dealing with this edge case where an excellent quality image is almost perfectly recognized, but surya will skip the number 4 every single time. I tested different bounding boxes, zoom levels and different types of noise but none made any difference. The detector is working fine but the recognizer reads R$33,00 instead of R$334,00 if I specify only portuguese. However, if I specify portuguese and english, it comes out correctly.
How would you reason about the influence of adding english when recognizing numbers? Is there anything particular in the training data that would make reasonable to always add english?
surya_ocr 1_z0.png --images --langs pt
surya_ocr 1_z1.png --images --langs pt