allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences
https://rl4lms.apps.allenai.org/
Apache License 2.0
2.13k stars 191 forks source link

how to stop env parallel multi-process to debug env.step()? #68

Open invoker-LL opened 6 months ago