Closed benel closed 4 years ago
Thank you for opening this. I normally show IDF in a Data Table as seen below. But you are making a point. Having a hidden token attribute is a big confusing for users and showing this in a Word Cloud could have a nice educational value.
IDF in action, even though in a slightly confusing sparse format:
It is fixed #486. Now the Word Cloud shows the bag-of-words weights in a word cloud.
Text version
0.5.2
Orange version
3.16
Expected behavior
My aim would be to show TF.IDF in action to my students. When connecting a
corpus
to abag of words
and thebag of words
to adata table
(or even better to aword cloud
), I would expect that changing thedocument frequency
parameter in thebag of words
fromnone
toIDF
would change the result (hiding common words in the language like "the", similarly to a stop words preprocessing, but also hiding words common to the corpus like "queen" for a tales corpus).Actual behavior
Changing the parameter doesn't seem to change anything in the result. @ajdapretnar explained the following in a related ticket (biolab/orange3#3426):