uber-archive / plato-research-dialogue-system

This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
Apache License 2.0
979 stars 191 forks source link

Integrate model with new dataset #15

Closed HarshitaSahai closed 3 years ago

HarshitaSahai commented 3 years ago

I would like to use the model for my dataset. Completed the following steps:

  1. Created domain ( ontology and database)
  2. "plato run --config " successful run
  3. Extracted DSTC2 and changed the ontology mentioned inside it as per my data.

After this, I'm unable to understand how to move forward. Could you please guide me?

Thank you

al3xpapangelis commented 3 years ago

Are you using DSTC2 data or another dataset? (The DSTC2 parsers are specific to DSTC2 data)

HarshitaSahai commented 3 years ago

I'm using DSTC2 but in the section where ontology is specified, I had updated my ontology. Will it work then or internally also it is associated with CamRest. Also while doing the slotting manually I'm facing issue like how I can run that and check it. If you could guide me with that?

Thanks

al3xpapangelis commented 3 years ago

Well, if you train models on DSTC and then use a different ontology it won't work. But if you use a new ontology (and database), then it should work but you need to either parse your own data that correspond to that ontology or create simulated data using Plato's simulators.

If you want to train models (using your own or simulated data), then I would suggest to start by writing a parser (look at the DSTC2 or MetalWOZ parsers). Then you can modify the Ludwig configs to represent your dataset attributes and write interfaces similar to camrest_nlu, etc.

I hope this helps!