Example of using this lib for RLHF?

coax-dev / coax

Modular framework for Reinforcement Learning in python

https://coax.readthedocs.io

MIT License

168 stars 17 forks source link

Open asmith26 opened 1 year ago

asmith26 commented 1 year ago

Just wondering if there are any example of using this lib for implement RLHF (Reinforcement Learning from Human Feedback)?

Many thanks for any help! :)