Open artidoro opened 4 years ago
This should be useful for the non attention models: https://github.com/pytorch/examples/tree/master/word_language_model