I found the results on Figure 4(c) is about 3 epoches. May I ask the training scripts about this experiment or some important hyper-parameters, such as learning rate, rank, batch size?
The hyperparameters for the 3-epoch experiment are exactly the same as those provided in the project script. You just need to use all the training data and change the number of epochs to 3.
Hi,
I found the results on Figure 4(c) is about 3 epoches. May I ask the training scripts about this experiment or some important hyper-parameters, such as learning rate, rank, batch size?