Closed t1307109256 closed 10 months ago
Hi! Thanks for your interest in our work. You can follow these steps to download CC3M:
It takes a few days to download the entire dataset.
Closing this for now. Feel free to reopen/ comment if you have further questions :)
Thanks,but I have another question.How do I download the validation set?
The pre-training requires validation set:
python -m src.main --name exp1
--train_data <path to (poisoned) train csv file>
--validation_data
This is just the ImageNet validation set; does that answer the question?
However, there is no caption column in the labels.csv file of the ImageNet validation set, and an error will be reported when setting --caption_key to caption.
Hey! I checked some stuff and it looks like you may not need to specify validation data. Can you try to remove it from the command and run again?
Hello author, I am a novice and would like to ask how to use the utils/download.py script to download the images from their URL for CC3M and/or CC12M. Can you give me an example?