Closed boxbox2 closed 8 months ago
Please see #29 for more details.
When the batch size is small, it may lead to an entire batch consisting solely of abnormal samples, thereby affecting the calculation of the paper's loss formula (Formula 9). We have fixed this. We have fixed this bug. Please try again.
train_loss increases by two points from 2.03 to 4.23, sometimes becoming nan when it increases to 10, and sometimes becoming nan when it reaches 20