How to compute lexical overlap (F1 measured between two bags of words)?

princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

MIT License

3.37k stars 511 forks source link

How to compute lexical overlap (F1 measured between two bags of words)? #102

Closed YJiangcm closed 2 years ago

YJiangcm commented 2 years ago

Thanks for your excellent job. In section 4 of the paper, you compute the lexical overlap (F1 measured between two bags of words) for the entailment pairs (SNLI + MNLI) is 39%, while they are 60% and 55% for QQP and ParaNMT. I wonder if two bag of words means using window slice of 2? And how to define true posotive, true negative, false positive and false negative to compute the F1 score?

gaotianyu1350 commented 2 years ago

Hi,

Here we use the same way of calculating F1 as in question answering tasks. It's simply unigram, not bigram. For each sentence, you have a bag of words, and true positive would be the words that are at the intersection of both sets.

YJiangcm commented 2 years ago

Got it, thx!