flowersteam / lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
MIT License
176 stars 15 forks source link

Ppo upgrade #6

Closed ClementRomac closed 1 year ago