Open clarkenj opened 5 months ago
@SophiePellerin what do you think of these? Including both 1 and 2 might be redundant.
I agree that 1 and 2 essentially get to the same thing, maybe keeping just 1 would be enough. 3 is fine to keep because it gets to something different (semantic similarity regardless of whether the words are used in the same contexts or not), it's very possible or even likely that semantically similar words are used in similar contexts but it's also likely not always the case.
Semantic diversity (semD)
Contextual diversity
Word Movers Distance
Note: these are still lexico-semantic features.