penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
https://vstar-seal.github.io/
MIT License
497 stars 32 forks source link

Training cost #14

Closed lxysl closed 3 months ago

lxysl commented 4 months ago

May I ask how many GPUs you use when training the model? Thank you!

penghao-wu commented 4 months ago

We use 2 or 4 A100-80G for training.