bfshi / scaling_on_scales

When do we not need larger vision models?
MIT License
321 stars 9 forks source link

About Vstar Evaluation #14

Open jungle-gym-ac opened 3 months ago

jungle-gym-ac commented 3 months ago

Hello, may I ask about how you evaluate your models on Vstar? Did you just directly used multiple_choices_inference function provided from Vstar to calculate the log-likelihood of the model on each option, and select the option with the highest likelihood

bfshi commented 3 months ago

Yes that's right.