CornellNLP / ConvoKit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
https://convokit.cornell.edu/documentation/
MIT License
542 stars 121 forks source link

New corpus utilities #172

Closed oscarso2000 closed 2 years ago

oscarso2000 commented 2 years ago

Allow users to now create an empty corpus should they choose to do so by simply using Corpus().

Also implements further utilities such as adding individual utterances and adding individual speakers, without having to create a new corpus and use the merge functionality.

Screen Shot 2022-07-15 at 9 15 47 AM

This image provides an example of what we aim to do. Credits to Jonathan Chang.