Closed aronwc closed 10 years ago
I have started to try expanding the words by finding the most frequent co-ocurrences for each word from the original list. However, the results are not convincing as a lot of words are amongst the most frequent ones for each poms word.
Ideas to try:
Categorized POMS words: http://www2.ul.ie/pdf/526781607.pdf
I used WordNet to find synonyms to the POMS words and I am currently trying to develop an algorithm to find words that have the most similar context when compared to the POMS word contexts.
Latest results look pretty good -- top 50 words for anxiety/depression seem reasonable.
Next steps: labeling some tweets (~1000) that contain at least one of the keywords. Binary label: does the tweet express something about the emotional state of the user or not? (e.g., filter out movie quotes).
We discussed some options: