microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
19.08k stars 2.43k forks source link

[textdiffuser2]the unzip cannot download #1565

Open shuyueW1991 opened 1 month ago

shuyueW1991 commented 1 month ago

I downloaded data as well as file folder, then I reach out to Mario-laion-unzip.py to execute so that a properly structured dataset can be set up. I also found there is a global env variable ,but I find that this variable does not appear again in the code. So, my question is how can I make a a dataset that is based on the and download the images according to the urls in folder. Is there a code or code snippet for that?