budzianowski / multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
MIT License
861 stars 200 forks source link

Restaurant name error in the dataset "MUL1382.json" #46

Closed Victorwz closed 4 years ago

Victorwz commented 4 years ago

In the Multi-WOZ 2.2 "data.json", line 7419586, dialog_id : MUL1382.json: "text": "We've narrowed it down to 3. kihinoor, the gandhi, and mahal of cambridge. Would you like me to make a reservation for you?" "text": "Yes please make a reservation for 3 people at 16:00 on Saturday at any of those choices." "text": "I was able to book at Kohinoor for 16:00 on Saturday for 3 people. Your reference number is NTJ52ASI. The table will be held for 15 minutes."

Actually, in the database for restaurant domain, there is no restaurant named "kihinoor" but there is one restaurant named "kohinoor". And based on the next two utterances, I believe the first restaurant name in the first utterance should be "kohinoor".

XiaoxueZang commented 4 years ago

Thanks for your interests in the dataset. It's a typo caused by the crowd worker. We'll correct it in the next update. Please note that MultiWOZ 2.2 didn't fix the original utterances to maintain its naturalness as typos usually happen in the daily conversation. Dialogue states are fixed in MultiWOZ 2.2 to accommodate to it. I checked the dialogue states annotation of this dialogue. They look correct to me. Please feel free to add follow-ups if you any question.