Closed zyzhang1130 closed 4 years ago
Hi, May i check with you why there are 20000 step before training by default? What is the use of it? Thank you.
This allows a sufficient amount of data to be gathered before training, to prevent overfitting to a small dataset.
noted with thanks
Hi, May i check with you why there are 20000 step before training by default? What is the use of it? Thank you.