Open marcenacp opened 1 month ago
sure. Thanks for reporting.
@severo Do I understand correctly that each service should:
Do you see a way to fix it more gradually service by service (e.g., starting by /parquet
)? How can we make sure that we don't break anybody relying on names not being serialized in the URL?
Thanks!
When playing with mlcroissant, we observed the following issue:
bigcode/commitpackft has both the configs
c
andc#
. When going to https://huggingface.co/api/datasets/bigcode/commitpackft/parquet/c#/train/0.parquet, it lists https://huggingface.co/api/datasets/bigcode/commitpackft/parquet/c/train/0.parquet (instead of https://huggingface.co/api/datasets/bigcode/commitpackft/parquet/c%23/train/0.parquet).Should dataset names / config names be escaped in the URLs?
cc @severo @lhoestq