about details of dataset

A-Rain commented 4 years ago

Hi, I'm confused about the dataset since there is no readme file. I found the column names are the following:

does 'context' means the sentiment of each utterence ? And what do 'prompt', 'selfeval' and 'tags' mean ?

thanks,

EricMichaelSmith commented 4 years ago

Hi! Sorry for the delay in responding. "context" is the sentiment of the conversation overall, and "prompt" is the prompt with which the Speaker wrote the first sentence (i.e. it's what the conversation should be about). "selfeval" was human ratings of these conversation utterances (see other Issues in this repo for what they mean), and "tags" represents whether the sentences are rated as political, among other things. (I didn't collect "selfeval" or "tags" and they're not used in the paper, so I'm not aware of all of the subtleties of them.)

A-Rain commented 4 years ago

I get it. Thank you for the reply

facebookresearch / EmpatheticDialogues

about details of dataset #35