microsoft / TAP

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
MIT License
70 stars 11 forks source link

What is the val set of pre-train #8

Closed JayZhu0104 closed 2 years ago

JayZhu0104 commented 2 years ago

Hi! After downloading OCR-CC features, I found that there were only feature files of training set. But I noticed that the IMDB file contains information about the val set. And the 'tap_base_pretrain.yml' file needs to fill in the val set and test set. What should be filled in this part? Thanks a lot! image

zyang-ur commented 2 years ago

Hi @JayZhu0104 ,

We don't use the CC-val set and the related files are included just for completeness.

You could either skip the validation or place the textvqa's val set in the boxed line. Since we would use the final iteration's ckpt in pretraining, it should not matter.

JayZhu0104 commented 2 years ago

Thank you for your reply!