AlibabaResearch / DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
MIT License
1.15k stars 185 forks source link

Inquiry About Code Release for "Fine-Tuning Language Models with Reward Learning on Policy #154

Open Ramyyang opened 1 month ago

Ramyyang commented 1 month ago

I am very interested in your paper "Fine-Tuning Language Models with Reward Learning on Policy." Could you please let me know when you plan to release the code for this work?

Thank you very much!