flowersteam / lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
MIT License
176 stars 14 forks source link

#40: Stability issues with gradient accumulation #41

Closed ClementRomac closed 3 months ago