britth / GameChanger

A simple command line program that processes Twitter data to discover popular moments in multi-game sporting events. Never settle for a boring game!
MIT License
0 stars 0 forks source link

Calculating TF-IDF #7

Open sunhwap opened 10 years ago

sunhwap commented 10 years ago

This issue deals with making user interact with school data via ESPN API.

[edit: changed title to reflect actual issue; ESPN API use/school data was covered in issue #5. -@britth]

sunhwap commented 10 years ago

We can use an existing library that will calculate TF-IDF for obtaining most frequent terms or use Scikit-learn to simplify bag of words to TF-IDF.

sunhwap commented 10 years ago

This link seems to be helpful for extracting terms using tf-idf. http://stevenloria.com/finding-important-words-in-a-document-using-tf-idf/

britth commented 10 years ago

@sunhwap thanks for the link! definitely looks like some code we could modify for this