Open Utkarsh4430 opened 4 months ago
Hey! Great work, guys! I just wanted to know how much compute did it take to pretrain the model. The number and size of GPUs and the number of hours the model was trained. Thanks!
Based on our experiment settings, pretraing stage and finetuning stage take around 5\~6 hours and 16\~17 hours on 32 A100, respectively.
did you use 40GB or 80GB A100?
Hey! Great work, guys! I just wanted to know how much compute did it take to pretrain the model. The number and size of GPUs and the number of hours the model was trained. Thanks!