issues
search
rom1504
/
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
MIT License
3.74k
stars
341
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
All Images Have Failed to Resize... (center_crop)
#441
Shiran-Yuan
opened
1 week ago
1
Failed to resize 20%
#440
JosselinSomervilleRoberts
closed
1 week ago
1
Download Laion-coco lots of failed
#439
LIRENDA621
opened
3 weeks ago
1
Q: docs on deduplication?
#438
ppbrown
opened
3 weeks ago
1
Some URL's cause img2dataset to hang indefinitely
#437
thecodingwizard
opened
1 month ago
2
Use a Reasonable User Agent
#436
solonovamax
opened
1 month ago
1
How to download image files from Laion-5b?
#435
laolongboy
closed
2 months ago
1
why still add ray.remote to download when use ray as distributer?
#434
caohan25
opened
3 months ago
0
--resize_mode "center_crop" fails due to change in albumentations
#433
jonasricker
opened
4 months ago
0
albumentations/check_version.py", line 29, in fetch_version_info urllib.error.URLError: <urlopen error [Errno 101] Network is unreachable> ERROR:albumentations.check_version:Error fetching version info
#432
lucasjinreal
opened
4 months ago
0
pyarrow.lib.ArrowInvalid: CSV parse error: Expected 1 columns, got 2: https://rmrbcmsonline.peopleapp.com/upload/ueditor/image/20200914/a_490014455258673152.jpg?x-oss ...
#431
lucasjinreal
opened
4 months ago
0
I want to know what this means?
#430
jiamingzhang94
opened
5 months ago
1
make wandb job name configurable
#429
gabrielilharco
opened
5 months ago
0
Support read image from local path
#428
thesby
opened
6 months ago
2
[DRAFT] feat: allow fixed resize
#427
borisdayma
opened
6 months ago
0
pyarrow.lib.ArrowInvalid: No match for FieldRef.Name(URL) in sample_id: double
#426
LIUYUANWEI98
opened
6 months ago
1
Bump pytest from 8.0.0 to 8.2.0
#425
dependabot[bot]
opened
7 months ago
0
Bump black from 24.1.1 to 24.4.2
#424
dependabot[bot]
opened
7 months ago
0
Bump black from 24.1.1 to 24.4.1
#423
dependabot[bot]
closed
7 months ago
1
Bump mypy from 1.8.0 to 1.10.0
#422
dependabot[bot]
opened
7 months ago
0
Decompressing the downloaded tar file is very slow
#421
Nastu-Ho
opened
7 months ago
2
Bump black from 24.1.1 to 24.4.0
#420
dependabot[bot]
closed
7 months ago
1
s3 paths in url_list are not supported
#419
BennySemyonovAB
opened
7 months ago
1
Question about LAION-400M
#418
BIGBALLON
opened
8 months ago
0
The success rate when downloading the sbu data set is extremely low at 0
#417
Luochangjiang10
opened
8 months ago
1
Fix usage example in README
#416
johnbradley
opened
8 months ago
0
placekitten.com example in README fails to download images
#415
johnbradley
opened
8 months ago
1
Bump black from 24.1.1 to 24.3.0
#414
dependabot[bot]
closed
7 months ago
1
Is the field 'similarity' in Parquet file referring to the cosine similarity of the feature representations of image-text pairs? How is this metric computed?
#413
gobigrassland
opened
8 months ago
0
Update fire requirement from <0.6.0,>=0.4.0 to >=0.4.0,<0.7.0
#412
dependabot[bot]
opened
8 months ago
0
Bump pytest from 8.0.0 to 8.1.1
#411
dependabot[bot]
closed
7 months ago
1
Bump mypy from 1.8.0 to 1.9.0
#410
dependabot[bot]
closed
7 months ago
1
Bump pytest from 8.0.0 to 8.0.2
#409
dependabot[bot]
closed
8 months ago
1
Bump pylint from 3.0.3 to 3.1.0
#408
dependabot[bot]
opened
9 months ago
0
Why I can't download laion400M dataset?
#407
SomnusQue
opened
9 months ago
3
Fix glob pattern when gcs url path has a trailing slash
#406
kafonek
closed
9 months ago
1
GCS url_path either not recognized as directory or mangled glob
#405
kafonek
closed
9 months ago
1
Bump pytest from 8.0.0 to 8.0.1
#404
dependabot[bot]
closed
9 months ago
1
Bump black from 24.1.1 to 24.2.0
#403
dependabot[bot]
closed
8 months ago
1
Download hangs at End
#402
zanussbaum
opened
9 months ago
0
laion-coco is not available
#401
vanpersie32
opened
9 months ago
0
Low success rate on donwloading laion400m
#400
tchaton
opened
9 months ago
26
Bump black from 24.1.0 to 24.1.1
#399
dependabot[bot]
closed
10 months ago
0
Bump pytest from 7.4.4 to 8.0.0
#398
dependabot[bot]
closed
10 months ago
0
Option to ignore SSL certificate
#397
theophilegervet
opened
10 months ago
1
Option to ignore SSL certificate
#396
theophilegervet
closed
10 months ago
0
Implement mode to retry failed urls of all shards
#395
rom1504
opened
10 months ago
1
Bump black from 23.12.1 to 24.1.0
#394
dependabot[bot]
closed
10 months ago
0
pyarrow.lib.ArrowInvalid: Empty CSV file
#393
R-Sheldon
opened
10 months ago
0
Update pyarrow requirement from <15,>=6.0.1 to >=6.0.1,<16
#392
dependabot[bot]
closed
10 months ago
0
Next