OCR numbers - Githubissues

Calamari-OCR / calamari

Line based ATR Engine based on OCRopy

GNU General Public License v3.0

1.05k stars 209 forks source link

OCR numbers #49

Closed hsl20130659 closed 5 years ago

hsl20130659 commented 5 years ago

it seems that the model can not recognize numbers

ChWick commented 5 years ago

Which model did you use?

hsl20130659 commented 5 years ago

fraktur_19th_century

ChWick commented 5 years ago

This model is trained to predict both characters and numbers. Could you please provide a line of your data where it fails?

hsl20130659 commented 5 years ago

it writes nothing in the result.txt

hsl20130659 commented 5 years ago

I use a picture that has numbers and chinese characters

ChWick commented 5 years ago

The fraktur 19th century model can only be used for Fraktur fonts: The characters must be similar to https://en.wikipedia.org/wiki/Fraktur. It might be possible that the model could possibly detect Arabic digits embedded in Chinese fonts, however I can not guarantee that. Do you input a full image or only a single horizontal text line? The latter is required.

hsl20130659 commented 5 years ago

I put a full image into the network ,my goal is just recognizing the numbers and I dont care if there are chinese characters or not.

ChWick commented 5 years ago

Calamari can only process single lines, not a full image. You need a separate tool to handle the line segmentation. You can use e. g. https://github.com/tmbdev/ocropy/wiki/Page-Segmentation for the page and line segmentation.

hsl20130659 commented 5 years ago

my pictures are cropped from ID cards ,the row of birthdays

ChWick commented 5 years ago

Solely the row of birthdays? Can you attach a single image file. Maybe you need https://github.com/Calamari-OCR/calamari_models/tree/master/antiqua_modern as model (Modern typographic fonts).

hsl20130659 commented 5 years ago

you can test the pictures,and show me the results

ChWick commented 5 years ago

This might work, but additional steps are required:

Crop the bottom area to the content with approximately 3px margin around the text.
Binarize the image

I manually did those steps and got (. 1986t 1 t 8 fH as result for 001.png.

ChWick commented 5 years ago

My manually processed input line: 0001

hsl20130659 commented 5 years ago

do you mind I make a friend with you ? My qq is 913452977, I think I have something more to ask you ,hehe

ChWick commented 5 years ago

You can E-Mail me via christoph.wick@informatik.uni-wuerzburg.de.