CornellNLP / ConvoKit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
https://convokit.cornell.edu/documentation/
MIT License
556 stars 129 forks source link

Corpus-level updating / adding metadata attributes with pandas dataframes #78

Open calebchiam opened 3 years ago

calebchiam commented 3 years ago

For example, corpus.update_utterance_metadata(df) would add / update the metadata fields of all the Utterances in the Corpus according to the index-value pairs in the input dataframe.

The motivation is to make things easier for those who work primarily with pandas dataframes and may generate additional labels for objects via dataframes first before trying to add them to corpus components. (This is similar to the motivation for #69>)