Is there a way to use the huggingface APIs to only download a portion of the dataset? I see that the parquet files are named with validation, validation_unique, test, train, etc. prefixes, but when trying to download a single split, it seems to download the entire dataset:
Is there a way to use the huggingface APIs to only download a portion of the dataset? I see that the parquet files are named with validation, validation_unique, test, train, etc. prefixes, but when trying to download a single split, it seems to download the entire dataset:
I'm not sure how huggingface datasets works -- i.e, is there some metadata file that huggingface can use to map "split" to "files in that split"