Closed msarkeshi closed 9 months ago
Thanks. This is great work. I am seeing some hallucinations with some of the videos I try. E.g. it sees people that don't exist or objects (like cell phone) that are not in the video. Is upgrading to a bigger LLM a solution to this? Is there any parameter (other than temperature) that I can play with to minimize hallucinations?
Hi @mehdisarkeshi,
One of the quickest solutions could be to explicitly instruct the model to be brief. However, a better solution could be to use a bigger model and cleaner data.
Hi @msarkeshi,
The Vicuna 13B and 33B models have different hidden dimensions compared to the Vicuna 7B. As a result, our linear layer projections, which have been tuned specifically for the 7B model, will not be directly compatible with these models.
Thank you.