CornellNLP / ConvoKit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
https://convokit.cornell.edu/documentation/
MIT License
552 stars 125 forks source link

[Feature Request] Selection filter for random speaker/conversation/utterance corpus functions #214

Open vianxnguyen opened 6 months ago

vianxnguyen commented 6 months ago

It would be nice for corpus functions such as random_speaker(), random_conversation(), etc to have a selection filter, for example, to get a random speaker that satisfies a condition. This could be useful in data exploration, in cases where the user wants to get a speaker with a certain role, or a conversation with at least a certain length, etc.