ai-forever / ru-gpts

Russian GPT3 models.
Apache License 2.0
2.08k stars 444 forks source link

RuGPT-2 Large model colab notebook #37

Closed fen0s closed 3 years ago

fen0s commented 3 years ago

https://colab.research.google.com/drive/1lSx9P4C60umxpoROgz3b0OZfy7EcDgrg Perhaps, it could be added in readme? Got it running with help of gradient checkpointing and change of optimizer from AdamW to SM3. Uses fork of aitextgen library.

king-menin commented 3 years ago

Please make pull request