huggingface / trl

Train transformer language models with reinforcement learning.
http://hf.co/docs/trl
Apache License 2.0
8.61k stars 1.06k forks source link

better trl parser with yaml config #1739

Closed mnoukhov closed 1 week ago

mnoukhov commented 2 weeks ago

Fixes #1733

@younesbelkada let me know if you have comments or suggestions!

HuggingFaceDocBuilderDev commented 2 weeks ago

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.