cvlab-columbia / viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Other
1.63k stars 117 forks source link

API for VideoQA #38

Closed Ziyang412 closed 6 months ago

Ziyang412 commented 8 months ago

Hi, thanks for the great work! I really love this paper and really happy to try some VideoQA examples on this model.

In this process, I faced some questions. It will be really nice of you to share any useful information with me!

  1. The API in the paper and in the api.prompt file are slightly different, may I ask which is the final version?

  2. Also, I noticed https://github.com/cvlab-columbia/viper/blob/bde4c6343825e6a131547cdfdeed8a62c9ac4b11/prompts/api.prompt#L443, I am wondering whether you also add some in-context examples to the prompt, if so, may I know the exact number and the exact examples? Knowing this prompt will largely help me reproduce the results of the VideoQA task.

Any help would be highly appreciated. Thanks in advance!

surisdi commented 6 months ago

Hi, we updated the code with the benchmark-specific prompts (including examples).