clovaai / deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Apache License 2.0
3.77k stars 1.11k forks source link

MJ_train LmDB have only 800k samples #234

Open dangvansam opened 4 years ago

dangvansam commented 4 years ago

i was downloaded data file for train model from link (https://drive.google.com/drive/folders/1BWTEMvJ6gF8Xiou2v-IJmlyi21utkMhU) but when i load file (21Gb) have only ~800k samples. but in data original page it is about 9M samples: "This dataset consists of 9 million images covering 90k English words, and includes the training, validation and test splits used in our work."