Datafable / epu-index

EPU index
http://www.applieddatamining.com/cms/?q=content/economic-policy-uncertainty-index
1 stars 0 forks source link

How to compare words #56

Closed bartaelterman closed 9 years ago

bartaelterman commented 9 years ago

There are 3 cases where we need to compare words:

  1. Scoring and article by applying a weight to every word in the text.
  2. Counting the number of unique words and determining their term frequency to build a word cloud.
  3. Removing stop words from a text before determining the word frequencies.

How exactly do we compare words? I would propose:

Drawbacks:

bartaelterman commented 9 years ago

Text will be cleaned first to remove punctuation (see #55). All words are then set to lowercase and compared.