DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Apache License 2.0
871 stars 60 forks source link

Could you please advise when the checkpoint for the audio branch will be made public? #87

Open ymxyll opened 2 months ago

ymxyll commented 2 months ago

I have a project planned that relies on it recently. Thank you for your response.

lixin4ever commented 2 months ago

We are working on this, please stay tuned.

DwanZhang-AI commented 2 months ago

Any update?

XuecWu commented 2 months ago

The same request. Looking forward to hearing from you. Thanks a lot.

qixueweigitbub commented 4 weeks ago

Same request here.

xinyifei99 commented 3 weeks ago

Thanks for your attention! You can switch to the audio_visual branch (https://github.com/DAMO-NLP-SG/VideoLLaMA2/tree/audio_visual) and clone the repository to run inference for audio related tasks.

LiangMeng89 commented 2 days ago

The same request. Looking forward to hearing from you. Thanks a lot.

Hello,I'm a phD student from ZJU, I also use videollama2 to do my own research,we create a WeChat group to discuss some issues of videollama2 and help each other,could you join us? Please contact me: WeChat number == LiangMeng19357260600, phone number == +86 19357260600,e-mail == liangmeng89@zju.edu.cn.