Open utterances-bot opened 2 years ago
I have a question about the interpretation of the weighted log odds themselves. Can you say things like, "In the 1960s, the name Lisa was ~300 times more likely compared to all other names in the corpus of names"?
Unfortunately not. 😔 I talk about this in the documentation here:
The weighted log odds computed by this function are also z-scores for the log odds; this quantity is useful for comparing frequencies across sets but its relationship to an odds ratio is not straightforward after the weighting.
Introducing tidylo | Julia Silge
Today I am so pleased to introduce a new package for calculating weighted log odds ratios, tidylo. Often in data analysis, we want to measure how the usage or frequency of some feature, such as words, differs across some group or set, such as documents.
https://juliasilge.com/blog/introducing-tidylo/