rese1f / MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
https://rese1f.github.io/MovieChat/
BSD 3-Clause "New" or "Revised" License
534 stars 41 forks source link

How much time does it cost to train this model? #35

Closed zhangyuereal closed 11 months ago

zhangyuereal commented 11 months ago

Dear author, How much time does it cost to train this model? With what type of GPU cards?

Espere-1119-Song commented 11 months ago

Thank you for your insterest, please see the closed issue https://github.com/rese1f/MovieChat/issues/2 "Our method is training-free, you can implement this mechanism in any model." For inference, we use 4090, and MovieChat costs around 20GB.