Open vishaal27 opened 1 year ago
Hi, we will publish it soon, working on data release. We will update. Thank you :)
Hi, it's updated now in the project website, sorry for the delay, please ask if anything is unclear CSV: https://seetrue.s3.amazonaws.com/wysiwyr_train.csv Images: https://drive.google.com/file/d/1M1CKmYkIdpFYjCOc9JwXHP5Z7E91CJl3/view?usp=drive_link
Hey, thanks for the great work!
I had a question about the training dataset for the end-to-end VNLI model. In the paper you mention:
Specifically, we finetune BLIP2 and PaLI-17B using a dataset comprising 110K text-image pairs labeled with alignment annotations. This includes 44K examples from COCO-Con, 3.5K from PickaPic-Con, 20K from COCO t2i and 40K from the training split of the SNLI-VE dataset.
However, I was unable to find the training split on AWS/Huggingface. Are there plans to release it, or if it has already been released, could you please point me to where I can find it?