Closed NoB0 closed 10 months ago
This PR includes:
Trainer
TrainerDQN
Note that some methods related to user simulation will be part of another PR.
This PR includes:
Trainer
classTrainerDQN
to train a dialogue policy using DQN algorithmNote that some methods related to user simulation will be part of another PR.