weinman / cnn_lstm_ctc_ocr

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR
GNU General Public License v3.0
498 stars 170 forks source link

Multiple Words? #16

Closed Omnipresent closed 6 years ago

Omnipresent commented 6 years ago

This isn't an issue but a question. I've read the CRNN paper and have played around with PyTorch implementation of it: https://github.com/meijieru/crnn.pytorch

I've noticed that CRNN is able to detect single words but not multiple words. Example: box_21

My question is, would this project help with detection of multiple words?

weinman commented 6 years ago

To be clear, this software only does cropped text recognition. That is, it doesn't take an arbitrary image and identify the bounding boxes of the text (or words).

However, assuming you have something like the example above, this model could in theory be used to identify the separate words if you train with a literal space among the output character labels, and you generate training data with multiple words (containing the space character label). This keras example is an example that does so (with two words).

Omnipresent commented 6 years ago

I am using CTPN to prepare the cropped images from the bounding boxes it provides. It doesn't make bounding boxes on single words but on multiple words. Thanks for confirming that my training data would need a literal space. I will try playing around to see if there are other ways. Thanks