Closed mmarius closed 5 years ago
In the Deep contextualized word representations paper you report an average forward and backward perplexity of 39.7. Is this on the validation data or on the training data?
Validation
Thanks!
In the Deep contextualized word representations paper you report an average forward and backward perplexity of 39.7. Is this on the validation data or on the training data?