Closed felixdittrich92 closed 12 months ago
@charlesmindee i think you are the only one who can do it (data) :smiley:
@charlesmindee, what kind of data do you use to train your text recogntions models ? Do you use custom data ? and do you use special data for french documents ?
@nikokks for text recognition, we use cropped words from various types of documents, which sum up to about 11M images. The bias is indeed around the choice of characters (European mostly) and the language (some sequences of characters are more frequent than others).
However, a while back, we started a fully synthetic approach (cf. CharacterGenerator
and WordGenerator
). So far, the results are not on par, but still very decent for a fully synthetic approach :)
Training started on Linknet_Resnet50
(Tensorflow), we'll post the results soon :wink:
Detection models in Tensorflow are done! Now, we start recognition models in Tensorflow. First one is master
!
Closing because we track it in #1364
🚀 The feature
Ensure that each model has a pretrained available checkpoint
TensorFlow:
classification:
detection:
recognition:
PyTorch:
classification:
detection:
recognition:
Motivation, pitch