heerme / twitter-topics

Python code for detecting topics/events from a Twitter stream
GNU General Public License v3.0
101 stars 46 forks source link

CountVectorizer error #3

Open Mustyy opened 7 years ago

Mustyy commented 7 years ago

raise ValueError("empty vocabulary; perhaps the documents only" ValueError: empty vocabulary; perhaps the documents only contain stop words

Why am i getting this error, min_df = 2 and everything else seems to be right

bozzmob commented 7 years ago

@Mustyy Please can you tell me how you are working on this? Please can you share me the details? I need to know what's the input file structure.

Mustyy commented 7 years ago

@bozzmob Hey, thanks for getting back to me I extracted json to txt then im running twitter-topics-from-json-text-stream.py

bozzmob commented 7 years ago

Hi @Mustyy Thanks for the quick reply. That is actually my question.

How do you get extract-json-to-text-stream.py to work?

What format is your input file? Please can you share its format?