Closed AnnaDai1001 closed 3 years ago
The RankDifference class is used. This part of the package is admittedly quite shambolic.
See https://nbviewer.jupyter.org/github/JasonKessler/PuPPyTalk/blob/master/notebooks/Class-Association-Scores.ipynb for an explanation of this metric.
Hello, I have been trying to figure out what are the default "scores" used when creating HTML file by scattertext.produce_scattertext_explorer function and read through the source code for hours but cant figure it out. Could anyone help me with this? Really appreciate it. I have created the html with the following piece of code:
This means that I didn't specify the parameter
scores
so the default will bescores=None
based on the source code. Plus, I didn't specify "term_scorer" either so the default will beterm_scorer=None
. From the source code of functionproduce_scattertext_explorer
we have below. So I thinkscores
will still beNone
. But in the HTML file, the terms are ranked by some scores. I am wondering what are these scores then? I have tried to calculate different metrics in each category, e.g. the frequency, f scaled score, pos precision etc. but none of them matched the HTML file.