coax-dev / coax

Modular framework for Reinforcement Learning in python
https://coax.readthedocs.io
MIT License
168 stars 17 forks source link

Example of using this lib for RLHF? #41

Open asmith26 opened 1 year ago

asmith26 commented 1 year ago

Just wondering if there are any example of using this lib for implement RLHF (Reinforcement Learning from Human Feedback)?

Inspired by: https://openai.com/blog/chatgpt image

Many thanks for any help! :)