i was downloaded data file for train model from link (https://drive.google.com/drive/folders/1BWTEMvJ6gF8Xiou2v-IJmlyi21utkMhU) but when i load file (21Gb) have only ~800k samples. but in data original page it is about 9M samples: "This dataset consists of 9 million images covering 90k English words, and includes the training, validation and test splits used in our work."
i was downloaded data file for train model from link (https://drive.google.com/drive/folders/1BWTEMvJ6gF8Xiou2v-IJmlyi21utkMhU) but when i load file (21Gb) have only ~800k samples. but in data original page it is about 9M samples: "This dataset consists of 9 million images covering 90k English words, and includes the training, validation and test splits used in our work."