gaojingsheng / LiveChat

Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL 2023
MIT License
25 stars 2 forks source link

Could you provide the complete dataset? #2

Closed 27182812 closed 1 year ago

27182812 commented 1 year ago

Hello, excellent work! I would like to conduct relevant research on this dataset. Could you provide the complete dataset?

gaojingsheng commented 1 year ago

Sorry for the lating of LiveChat, all of the processed dataset can be retrieved from here: https://github.com/gaojingsheng/LiveChat/blob/master/Dataset/README.md

27182812 commented 1 year ago

Thank you for your reply! And I'm curious about the character information in the basic_file.json. Currently it is anonymized, but the only character is not important for identification, is it convenient to provide a complete list of character information? This will be of great help to me and I will explain the help of your work in my research. You can also send it to me via email.

gaojingsheng commented 1 year ago

Sure, but in our preliminary experiments, we actually found that the different dimensions of features annotated in LiveChat had no significant impact on the effectiveness of responses. Therefore, we didn't present this part of the results. We only released anonymized results, partly to protect privacy and partly because the results were not significant. You can contact me via email, and I will send you the mapping table. My email address is [gaojingsheng@sjtu.edu.cn]