Hi thanks for sharing this great work! I am trying to train a model on my own with the hybrid filled setting for MNLI and it seems that it takes more than 5 hours to train for one epoch. Is the training expected to be this slow or there might be something run with what I do? I basically use the same hyperparameters provided in the analysis folder.
Hi thanks for sharing this great work! I am trying to train a model on my own with the hybrid filled setting for MNLI and it seems that it takes more than 5 hours to train for one epoch. Is the training expected to be this slow or there might be something run with what I do? I basically use the same hyperparameters provided in the analysis folder.