Open zwouter opened 3 months ago
Could this be an issue with Windows? I don't reproduce locally and I can successfully download_and_prepare the dataset. If the problem persists, you could also try to filter missing values (example).
If you find a fix for windows, please feel free to push a PR that fixes the issue :) Thanks!
Short description The Higgs dataset cannot be used, probably because it contains unexpected missing values.
Environment information
Operating System: Windows 11
Python version: 3.11.1
tensorflow-datasets
/tfds-nightly
version: tensorflow-datasets 4.9.4tensorflow
/tf-nightly
version: tensorflow 2.16.1Does the issue still exists with the last
tfds-nightly
package (pip install --upgrade tfds-nightly
) ? Yes.Reproduction instructions
Logs
Expected behavior I expect the dataset to be downloaded and prepared such that I can quickly load it in the future.
Additional context I am new to using tfds, but other datasets (e.g. MNIST, CIFAR10) work as intended. The dataset is not supposed to have missing values, according to https://archive.ics.uci.edu/dataset/280/higgs