chufanchen / read-paper-and-code

0 stars 0 forks source link

ICLR 2024 | CPPO: Continual Learning for Reinforcement Learning with Human Feedback #87

Open chufanchen opened 7 months ago

chufanchen commented 7 months ago

https://openreview.net/forum?id=86zAUE80pP