Closed Preet538-neitzen closed 3 years ago
we trained 200k steps of batch size =4. It takes around 10 hours each model.
we trained 200k steps of batch size =4. It takes around 10 hours each model.