Closed David19970306 closed 5 months ago
The code used for training may take a while to become publicly available, but you can refer to the data and training methods in the paper for training.
May I ask how much resources (GPU time) it took to train the model? The paper didn't seem to mention this, and I'm just curious!
May I ask how much resources (GPU time) it took to train the model? The paper didn't seem to mention this, and I'm just curious!
We trained it with 32 A100s for roughly 5 days.
Is it time to update the training code?
in our lab, we have the urgent issue to train this model by ourself, when can you published the paper and the codes for trainning?