Open RyanHuangNLP opened 5 years ago
I found that train on 500 lines sample data consume 40min on one P40 per epoch, I wonder if train on large corpus(such as 2G), is it really need to train 50 epochs.
I found that train on 500 lines sample data consume 40min on one P40 per epoch, I wonder if train on large corpus(such as 2G), is it really need to train 50 epochs.