DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
BSD 3-Clause "New" or "Revised" License
2.83k stars 263 forks source link

Dear author, How much time does it cost to train this model? With what type of GPU cards? #136

Open zhangyuereal opened 11 months ago

zhangyuereal commented 11 months ago

Dear author, How much time does it cost to train this model? With what type of GPU cards?