hi, authors, I want to use Video-LLaMA to infer my own dataset, I find that the current framework supports the max number of input frames as 32, if I change the frames in the config that more than 32, there is an error shown, so how to increase the frames that more than 32?
hi, authors, I want to use Video-LLaMA to infer my own dataset, I find that the current framework supports the max number of input frames as 32, if I change the frames in the config that more than 32, there is an error shown, so how to increase the frames that more than 32?
thanks!!!