rois-codh / kmnist

Repository for Kuzushiji-MNIST, Kuzushiji-49, and Kuzushiji-Kanji
Creative Commons Attribution Share Alike 4.0 International
693 stars 94 forks source link

Black images #1

Closed hysts closed 4 years ago

hysts commented 5 years ago

It seems there are black images in KMNIST training/testing data.

Indices of black images in training data: 13109, 13415, 13442, 23328, 24234, 26413, 28814, 29279, 33433, 38946, 42827, 44186, 50110, 55361, 55533, 56969, 57915

Index of black image in testing data: 9677

I checked both MNIST format files and NumPy format ones.

mxbi commented 5 years ago

Thanks for finding this and bringing it to our attention :) It seems to have been an issue with preprocessing that wasn't caught when the data was manually checked.

Since the dataset has only just been released we plan to release an updated version soon with the black images fixed in all three datasets.

hysts commented 5 years ago

Great! Looking forward to it. :)

mxbi commented 4 years ago

Updated dataset released back in February fixes this. Thank you again for reporting this!