Semantics-of-Sustainability / tempo-embeddings

Tools for analysing contextual (temporal) word embeddings
Apache License 2.0
1 stars 0 forks source link

Topic labeling #34

Closed carschno closed 1 month ago

carschno commented 10 months ago

Currently, the score of each word per cluster is computed like this:

score_{cluster}({word}) = \sum_{cluster-doc} (TF-IDF(word) * distance(centroid, doc))

Where

Alternative options to investigate:

Design idea:

carschno commented 1 month ago