-
When playing with mlcroissant, we observed the following issue:
[bigcode/commitpackft](https://huggingface.co/datasets/bigcode/commitpackft) has both the configs `c` and `c#`. When going to https:/…
-
https://github.com/huggingface/alignment-handbook/blob/606d2e954fd17999af40e6fb4f712055ca11b2f0/src/alignment/data.py#L216-L221
Actual exception is `ValueError`:
```
[rank5]: Traceback (most re…
-
For example https://huggingface.co/datasets/ontocord/CulturaY.
-
Datasets format
-
https://huggingface.co/datasets/linux-cn/archive 需要人写爬虫
-
Hi, I just noticed that the bop datasets are moved to huggingface, it's better to update the source_url to https://huggingface.co/datasets/bop-benchmark/datasets/resolve/main for convenience of other…
-
Currently, scraped GitHub discussions are in the `data` folder. Move this to a HF dataset.
- [ ] Uploading files to HF Dataset. Link: https://huggingface.co/datasets/The-OpenROAD-Project/ORQA_discu…
-
In some apps, the metrics and healthcheck are public:
- https://datasets-server.huggingface.co/admin/metrics
- https://datasets-server.huggingface.co/sse/metrics
- https://datasets-server.hugging…
-
**Is your feature request related to a problem? Please describe.**
NeMo curator supports document datasets as dataframes today and includes some helpers to read from json/parquet files.
**Describe…
-
Given this manifest:
```
version = 1
[install]
psycopg.pkg-path = "python312Packages.psycopg"
tqdm.pkg-path = "python312Packages.tqdm"
datasets.pkg-path = "python312Packages.datasets"
ipywidgets.pkg-…