🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
19.24k
stars
2.69k
forks
source link
Loading just one particular split is not possible for imagenet-1k #6793
Open
PaulPSta opened 7 months ago
Describe the bug
I'd expect the following code to download just the validation split but instead I get all data on my disk (train, test and validation splits)
` from datasets import load_dataset
dataset = load_dataset("imagenet-1k", split="validation", trust_remote_code=True) `
Is it expected to work like that?
Steps to reproduce the bug
Expected behavior
Just a single (validation) split should be downloaded.
Environment info
python: 3.12.2 datasets: 2.18.0 huggingface_hub: 0.22.2