Update train.py - Githubissues

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

MIT License

7.67k stars 668 forks source link

Closed KnightBits closed 1 year ago

KnightBits commented 1 year ago

Added autosave of model state every 5 thousand iterations