huggingface / autotrain-advanced

🤗 AutoTrain Advanced
https://huggingface.co/autotrain
Apache License 2.0
3.65k stars 442 forks source link

[Feature Request] Adding the ppo trainer #607

Closed Esmail-ibraheem closed 4 weeks ago

Esmail-ibraheem commented 2 months ago

Feature Request

Adding the proximal policy optimization (ppo) trainer

Motivation

Applying the ppo trainer, so we can compare between the two trainers: ppo and dpo

Additional Context

No response

github-actions[bot] commented 1 month ago

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] commented 4 weeks ago

This issue was closed because it has been inactive for 20 days since being marked as stale.