issues
search
huggingface
/
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.28k
stars
2.7k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Remove upper version limit of fsspec[http]
#7296
cyyever
opened
1 day ago
0
[BUG]: Streaming from S3 triggers `unexpected keyword argument 'requote_redirect_url'`
#7295
casper-hansen
opened
2 days ago
0
Remove `aiohttp` from direct dependencies
#7294
akx
opened
3 days ago
0
Updated inconsistent output in documentation examples for `ClassLabel`
#7293
sergiopaniego
opened
5 days ago
3
DataFilesNotFoundError for datasets `OpenMol/PubChemSFT`
#7292
xnuohz
closed
3 days ago
3
Why return_tensors='pt' doesn't work?
#7291
bw-wang19
opened
6 days ago
2
`Dataset.save_to_disk` hangs when using num_proc > 1
#7290
JohannesAck
opened
1 week ago
0
Dataset viewer displays wrong statists
#7289
speedcell4
closed
1 week ago
1
Release v3.1.1
#7288
alex-hh
closed
1 week ago
0
Support for identifier-based automated split construction
#7287
alex-hh
opened
1 week ago
3
Concurrent loading in `load_from_disk` - `num_proc` as a param
#7286
unography
closed
1 week ago
0
Release v3.1.0
#7285
alex-hh
closed
1 week ago
0
support for custom feature encoding/decoding
#7284
alex-hh
closed
12 hours ago
2
Allow for variation in metadata file names as per issue #7123
#7283
egrace479
opened
2 weeks ago
0
Faulty datasets.exceptions.ExpectedMoreSplitsError
#7282
meg-huggingface
opened
2 weeks ago
0
File not found error
#7281
MichielBontenbal
opened
2 weeks ago
1
Add filename in error message when ReadError or similar occur
#7280
elisa-aleman
opened
2 weeks ago
5
Feature proposal: Stacking, potentially heterogeneous, datasets
#7279
TimCares
opened
2 weeks ago
0
Let soundfile directly read local audio files
#7278
fawazahmed0
opened
2 weeks ago
0
Add link to video dataset
#7277
NielsRogge
closed
2 weeks ago
1
Accessing audio dataset value throws Format not recognised error
#7276
fawazahmed0
opened
2 weeks ago
3
load_dataset
#7275
santiagobp99
opened
2 weeks ago
0
[MINOR:TYPO] Fix typo in exception text
#7274
cakiki
opened
2 weeks ago
0
Raise error for incorrect JSON serialization
#7273
varadhbhatnagar
closed
3 days ago
2
fix conda release worlflow
#7272
lhoestq
closed
3 weeks ago
1
Set dev version
#7271
lhoestq
closed
3 weeks ago
1
Release: 3.1.0
#7270
lhoestq
closed
3 weeks ago
1
Memory leak when streaming
#7269
Jourdelune
opened
3 weeks ago
2
load_from_disk
#7268
ghaith-mq
opened
3 weeks ago
1
Source installation fails on Macintosh with python 3.10
#7267
mayankagarwals
opened
3 weeks ago
1
The dataset viewer should be available soon. Please retry later.
#7266
viiika
closed
3 weeks ago
1
Disallow video push_to_hub
#7265
lhoestq
closed
3 weeks ago
1
fix docs relative links
#7264
lhoestq
closed
3 weeks ago
1
Small addition to video docs
#7263
lhoestq
closed
3 weeks ago
1
Allow video with disabeld decoding without decord
#7262
lhoestq
closed
3 weeks ago
1
Cannot load the cache when mapping the dataset
#7261
zhangn77
opened
3 weeks ago
0
cache can't cleaned or disabled
#7260
charliedream1
opened
3 weeks ago
0
Don't embed videos
#7259
lhoestq
closed
3 weeks ago
1
Always set non-null writer batch size
#7258
lhoestq
closed
3 weeks ago
1
fix ci for pyarrow 18
#7257
lhoestq
closed
3 weeks ago
1
Retry all requests timeouts
#7256
lhoestq
closed
3 weeks ago
1
fix decord import
#7255
lhoestq
closed
3 weeks ago
1
mismatch for datatypes when providing `Features` with `Array2D` and user specified `dtype` and using with_format("numpy")
#7254
Akhil-CM
opened
3 weeks ago
1
Unable to upload a large dataset zip either from command line or UI
#7253
vakyansh
opened
3 weeks ago
0
Add IterableDataset.shard()
#7252
lhoestq
closed
3 weeks ago
1
Missing video docs
#7251
lhoestq
closed
4 weeks ago
1
Basic XML support (mostly copy pasted from text)
#7250
lhoestq
closed
4 weeks ago
1
How to debugging
#7249
ShDdu
opened
4 weeks ago
0
ModuleNotFoundError: No module named 'datasets.tasks'
#7248
shoowadoo
opened
4 weeks ago
2
Adding column with dict struction when mapping lead to wrong order
#7247
chchch0109
opened
1 month ago
0
Next