silky / ideas

:bulb: various ideas
https://github.com/silky/ideas/issues
MIT License
20 stars 2 forks source link

relate a words "information content" to it's usage count #686

Open silky opened 2 years ago

silky commented 2 years ago

sometimes people get upset when a word has many meanings

but, maybe that's because it's used a lot

you could calculate a terrible entropy for words by, say, computing the distance between the vectors of their definitions.

then, you could correlate that to how often that word is used

maybe that would be fun