cpllab / lm-zoo

Easy black-box access to state-of-the-art language models
https://cpllab.github.io/lm-zoo/
MIT License
14 stars 6 forks source link

add TinyLSTM model #47

Closed hans closed 4 years ago

hans commented 4 years ago

Model

"Tiny LSTM" model used in recent ACL SyntaxGym papers. Based on PyTorch sample LSTM implementation. Relatively shallow stacked LSTM with dropout.

Training

Complexity:

Performance: 57.09 perplexity on held-out BLLIP-LG from Hu et al 2020.

Other notes

NB tests will probably fail, since this model supports the mount_checkpoint feature in advance of that feature being available on develop.

Licensing

MIT