Question about training dataset

vishaal27 commented 1 year ago

Hey, thanks for the great work!

I had a question about the training dataset for the end-to-end VNLI model. In the paper you mention:

Specifically, we finetune BLIP2 and PaLI-17B using a dataset comprising 110K text-image pairs labeled with alignment annotations. This includes 44K examples from COCO-Con, 3.5K from PickaPic-Con, 20K from COCO t2i and 40K from the training split of the SNLI-VE dataset.

However, I was unable to find the training split on AWS/Huggingface. Are there plans to release it, or if it has already been released, could you please point me to where I can find it?

yonatanbitton commented 1 year ago

Hi, we will publish it soon, working on data release. We will update. Thank you :)

yonatanbitton commented 11 months ago

Hi, it's updated now in the project website, sorry for the delay, please ask if anything is unclear CSV: https://seetrue.s3.amazonaws.com/wysiwyr_train.csv Images: https://drive.google.com/file/d/1M1CKmYkIdpFYjCOc9JwXHP5Z7E91CJl3/view?usp=drive_link

yonatanbitton / wysiwyr

Question about training dataset #7