Open vikigenius opened 5 months ago
The paper claims that you trained for 5 epochs on 32 A100 GPUs. Do you have an estimate of how much time it took? And also was it a 40GB A100 or 80 GB?
80 GB is used. Around one week.
The paper claims that you trained for 5 epochs on 32 A100 GPUs. Do you have an estimate of how much time it took? And also was it a 40GB A100 or 80 GB?