Open jx1100370217 opened 4 years ago
Why does the valid_ppl become larger as the training progresses?
Why does the valid_ppl become larger as the training progresses?