Open thomaspzollo opened 2 years ago
The full Chinese section of the WikiConv corpus is not yet available in ConvoKit.
We have however released a small sample; see section 1.2 of this example notebook: https://github.com/CornellNLP/Cornell-Conversational-Analysis-Toolkit/blob/master/examples/politeness-strategies/Politeness_Strategies_in_MT-mediated_Communication.ipynb
If you need the full corpus and want to add it yourself, that would be of course appreciated; see data contribution guidelines here: https://github.com/CornellNLP/Cornell-Conversational-Analysis-Toolkit/blob/master/CONTRIBUTING.md
I see the original WikiConv paper says there were conversations in Chinese collected, are these available through ConvoKit?