What is the "self.key_name" in the benchmark code? Is the test/dev data that you did not give?

thu-coai / KdConv

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Apache License 2.0

459 stars 62 forks source link

Closed Mrwhite132613 closed 4 years ago

Mrwhite132613 commented 4 years ago

for key in self.key_name: origin_data[key] = {'posts': [], 'prev_posts': [], 'responses': [], 'kg': [], 'kg_index': []}

Mrwhite132613 commented 4 years ago

thanks @chujiezheng

chujiezheng commented 4 years ago

It is a list of dataset split names, such as ['train', 'dev', 'test']. They are also the data file names.

Mrwhite132613 commented 4 years ago

@chujiezheng 可以加一个微信吗？这个benchmark感觉有好多地方跑不通。这个数据集您只给了train.json，是dev和test要从train.json里面自己切分吗？数据集超级nice,也是刚刚做对话，想交个朋友，十分感谢

chujiezheng commented 4 years ago

可以的，您把您的联系方式发到我的邮箱：zcj16@tsinghua.org.cn，我添加您

由于我们现在在办比赛，所以dev和test会推迟些公开