Open Morgott-The-Omen-King opened 1 month ago
Hello, authors,
When can we get the Audio-Visual finetuned Video-LLaMA2?Or can we finetune this by ourselves based on the well-visual-finetuned video-llama2?
Thanks in advanced.
I also have the same question: I would like to use videollama2 training code to evaluate some datasets and having the audio part would be very interesting. Thank you!
Hello, authors,
When can we get the Audio-Visual finetuned Video-LLaMA2?Or can we finetune this by ourselves based on the well-visual-finetuned video-llama2?
Thanks in advanced.