rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
MIT License
3.71k stars 338 forks source link

use a pipeline concept to refactor downloader.py #265

Open rom1504 opened 1 year ago

rom1504 commented 1 year ago

https://github.com/webdataset/webdataset/blob/main/webdataset/pipeline.py