Closed bobo123yhb closed 3 years ago
Do you mean that the batch size needs to be set smaller for a 2080ti GPU? My suggestion is to use a smaller learning rate and training longer until convergence. If the batch size is not much smaller than the default batch size provided, you may still use the same learning rate. If you have enough capacity to do a hyparameter search, I strongly suggest you to do so. As the default learning rate in my experiments are based on 4X 16GB V100 GPUs.
I am very interested in your work, but I have some doubts about the setting of the learning rate.I currently only have a 2080ti graphics card.How should the learning rate be adjusted?