Closed brucefan1983 closed 3 months ago
This PR aims to improve the training results when using mini-batches, based on the following tricks:
This PR aims to improve the training results when using mini-batches, based on the following tricks: