Support `pipeline` argument in inspect.py functions

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Apache License 2.0

19.28k stars 2.7k forks source link

Is your feature request related to a problem? Please describe.

The wikipedia dataset requires a pipeline argument to build the list of splits:

But this is currently not supported in get_dataset_config_info:

which is called by other functions, e.g. get_dataset_split_names.

Additional context

The dataset viewer is not working out-of-the-box on wikipedia for this reason:

huggingface / datasets