llava-rlhf / LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF
https://llava-rlhf.github.io/
GNU General Public License v3.0
315 stars 21 forks source link

Question about the optimization time #33

Closed JulioZhao97 closed 2 months ago

JulioZhao97 commented 2 months ago

Could you please tell me how long and how many GPUs are needed for training process?

Edward-Sun commented 2 months ago

Hi @JulioZhao97 the PPO training takes 1-2 days on 8 x A100-80G.