PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
https://arxiv.org/pdf/2311.10122.pdf
Apache License 2.0
3.02k stars 220 forks source link

Can the confidence coefficient of an answer be obtained? #162

Closed IsabelJimenez99 closed 6 months ago

IsabelJimenez99 commented 6 months ago

I am testing with the model ‘LanguageBind/Video-LLaVA-7B-hf’, and every time I run it on an image, I get a different answer. I would like to know how much confidence the model has in each response, could I know?