RowitZou / topic-dialog-summ

AAAI-2021 paper: Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling.
MIT License
77 stars 9 forks source link

Why values of bows are tfidf? #31

Closed muguruzawang closed 2 years ago

muguruzawang commented 2 years ago

I mentioned the values of bows are calculated as term frequency/ total frequency. could you explain the reason?

RowitZou commented 2 years ago

tf-idf is a statistical measure that evaluates how relevant a word is to a document in a collection of documents, which is generally better than regular BOW representations.