fawazsammani / nlxgpt

NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
44 stars 10 forks source link

How do I get the results of the VCR dataset stated in the appendix by running the source code #9

Closed Gary-code closed 1 year ago

Gary-code commented 1 year ago

How do I get the results of the VCR dataset stated in the appendix by running the source code. Directly run vcr.py and fine tune with the pre-trained model on the Caption dataset?