cszmli / Rethink-RL-Sup

Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
9 stars 5 forks source link