iterative / dataset-registry

Dataset registry DVC project
67 stars 39 forks source link

Add EMNIST dataset to the registry #13

Closed iesahin closed 1 year ago

iesahin commented 3 years ago

There is a much larger dataset similar to MNIST created in 2017. Along with a larger set of digits, it also contains letters split into balanced and unbalanced sets.

https://www.nist.gov/itl/products-and-services/emnist-dataset

The file format is the same as the original MNIST and looks it can be a drop-in replacement.

We can add this to the registry as well, to use as a more realistic dataset.