Open severo opened 2 years ago
Now: the preview (first-rows) works, but not the conversion to parquet. See https://huggingface.co/datasets/wikipedia/viewer/20220301.de/train
_split_generators() missing 1 required positional argument: 'pipeline'
Error code: UnexpectedError
Is your feature request related to a problem? Please describe.
The
wikipedia
dataset requires apipeline
argument to build the list of splits:https://huggingface.co/datasets/wikipedia/blob/main/wikipedia.py#L937
But this is currently not supported in
get_dataset_config_info
:https://github.com/huggingface/datasets/blob/main/src/datasets/inspect.py#L373-L375
which is called by other functions, e.g.
get_dataset_split_names
.Additional context
The dataset viewer is not working out-of-the-box on
wikipedia
for this reason:https://huggingface.co/datasets/wikipedia/viewer