JasonKessler / scattertext

Beautiful visualizations of how language differs among document types.
Apache License 2.0
2.23k stars 287 forks source link

Lack of words in one category #111

Closed yshi2016 closed 2 years ago

yshi2016 commented 2 years ago

Hi! I ran scattertext for two categories reddit and quora, with max_docs_per_category=30000, however, the caption only shows that there are 30000 docs from reddit, and there is nothing for quora, though it ran successfully. In which case would this happen, is it due to quora corpus being too small compared to the other? Thank you!

image
JasonKessler commented 2 years ago

If you upload the code, source data and produced visualization, someone maybe can look into this. Otherwise, there's not enough information to investigate.