Training on multiple dialog corpuses

facebookresearch / ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

https://parl.ai

MIT License

10.48k stars 2.09k forks source link

Training on multiple dialog corpuses #241

Closed Henry-E closed 7 years ago

Henry-E commented 7 years ago

I managed to get a couple of the chit chat tasks training at the same time using the seq2seq agent and the MultiWorld. Is this something that has been done much or does the different tone of dialogue in each corpus confuse the RNN?

alexholdenmiller commented 7 years ago

So it definitely works to run, e.g., on all of the babi tasks at once (and it gets better performance on some tasks). I haven't tried running it on multiple completely different tasks yet--performance probably varies a lot based on which ones you combine!

Henry-E commented 7 years ago

Ok cool, I was just curious if there were many existing examples of papers which train dialogue or other agents on multiple chit chat datasets. It seems like an interesting idea but this is probably more of a reddit-ml type of question I guess. Thanks