issues
search
huggingface
/
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.29k
stars
2.7k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Solution to issue: #7080 Modified load_dataset function, so that it prompts the user to select a dataset when subdatasets or splits (train, test) are available
#7191
negativenagesh
closed
2 weeks ago
1
Datasets conflicts with fsspec 2024.9
#7190
cw-igormorgado
opened
1 month ago
1
Audio preview in dataset viewer for audio array data without a path/filename
#7189
Lauler
opened
1 month ago
0
Pin multiprocess<0.70.1 to align with dill<0.3.9
#7188
albertvillanova
closed
1 month ago
1
shard_data_sources() got an unexpected keyword argument 'worker_id'
#7187
Qinghao-Hu
opened
1 month ago
0
pinning `dill<0.3.9` without pinning `multiprocess`
#7186
shubhbapna
closed
1 month ago
0
CI benchmarks are broken
#7185
albertvillanova
closed
1 month ago
1
Pin dill<0.3.9 to fix CI
#7184
albertvillanova
closed
1 month ago
1
CI is broken for deps-latest
#7183
albertvillanova
closed
1 month ago
0
Support features in metadata configs
#7182
albertvillanova
closed
1 month ago
2
Fix datasets export to JSON
#7181
varadhbhatnagar
closed
3 weeks ago
8
Memory leak when wrapping datasets into PyTorch Dataset without explicit deletion
#7180
iamwangyabin
closed
1 month ago
1
Support Python 3.11
#7179
albertvillanova
closed
1 month ago
1
Support Python 3.11
#7178
albertvillanova
closed
1 month ago
0
Fix release instructions
#7177
albertvillanova
closed
1 month ago
1
fix grammar in fingerprint.py
#7176
jxmorris12
opened
2 months ago
0
[FSTimeoutError] load_dataset
#7175
cosmo3769
closed
1 month ago
6
Set dev version
#7174
albertvillanova
closed
2 months ago
1
Release: 3.0.1
#7173
albertvillanova
closed
2 months ago
1
Add torchdata as a regular test dependency
#7172
albertvillanova
closed
2 months ago
1
CI is broken: No solution found when resolving dependencies
#7171
albertvillanova
closed
2 months ago
0
Support JSON lines with missing columns
#7170
albertvillanova
closed
2 months ago
1
JSON lines with missing columns raise CastError
#7169
albertvillanova
closed
2 months ago
0
sd1.5 diffusers controlnet training script gives new error
#7168
Night1099
closed
1 month ago
3
Error Mapping on sd3, sdxl and upcoming flux controlnet training scripts in diffusers
#7167
Night1099
closed
1 month ago
1
fix docstring code example for distributed shuffle
#7166
lhoestq
closed
2 months ago
1
fix increase_load_count
#7165
lhoestq
closed
2 months ago
3
fsspec.exceptions.FSTimeoutError when downloading dataset
#7164
timonmerk
opened
2 months ago
5
Set explicit seed in iterable dataset ddp shuffling example
#7163
alex-hh
closed
2 months ago
1
Support JSON lines with empty struct
#7162
albertvillanova
closed
2 months ago
1
JSON lines with empty struct raise ArrowTypeError
#7161
albertvillanova
closed
2 months ago
0
Support JSON lines with missing struct fields
#7160
albertvillanova
closed
2 months ago
1
JSON lines with missing struct fields raise TypeError: Couldn't cast array
#7159
albertvillanova
closed
2 months ago
1
google colab ex
#7158
docfhsp
opened
2 months ago
0
Fix zero proba interleave datasets
#7157
lhoestq
closed
2 months ago
1
interleave_datasets resets shuffle state
#7156
jonathanasdf
opened
2 months ago
0
Dataset viewer not working! Failure due to more than 32 splits.
#7155
sleepingcat4
closed
2 months ago
1
Support ndjson data files
#7154
albertvillanova
closed
2 months ago
2
Support data files with .ndjson extension
#7153
albertvillanova
closed
2 months ago
0
Align filename prefix splitting with WebDataset library
#7151
albertvillanova
closed
2 months ago
0
WebDataset loader splits keys differently than WebDataset library
#7150
albertvillanova
closed
2 months ago
0
Datasets Unknown Keyword Argument Error - task_templates
#7149
varungupta31
closed
2 months ago
2
Bug: Error when downloading mteb/mtop_domain
#7148
ZiyiXia
closed
2 months ago
4
IterableDataset strange deadlock
#7147
jonathanasdf
closed
2 months ago
6
Set dev version
#7146
albertvillanova
closed
2 months ago
1
Release: 3.0.0
#7145
albertvillanova
closed
2 months ago
1
Fix key error in webdataset
#7144
ragavsachdeva
closed
2 months ago
7
Modify add_column() to optionally accept a FeatureType as param
#7143
varadhbhatnagar
closed
2 months ago
6
Specifying datatype when adding a column to a dataset.
#7142
varadhbhatnagar
closed
2 months ago
1
Older datasets throwing safety errors with 2.21.0
#7141
alvations
closed
2 months ago
17
Previous
Next