Closed forever208 closed 7 months ago
The default iteration is 1300k, which gonna take 5 days using a single A100... Does the training really need 5 days?
@gnobitab Hi, it would be appreciated if you could share your training time on A100
To the best of my knowledge, it takes ~3 days on a 3090 GPU
@yuanzhi-zhu thanks!
The default iteration is 1300k, which gonna take 5 days using a single A100... Does the training really need 5 days?