clulab / twitter4food

Repository for the health informatics analytics on twitter project
Apache License 2.0
1 stars 4 forks source link

Weight more important tweets more #12

Open herongrove opened 7 years ago

herongrove commented 7 years ago

Weighting by recency is a relatively simple tactic based on our observation that more recent tweets are more relevant, but there are other kinds of relevance. For example, we have an RT-or-not feature prefix, and multiple ways of looking at custom dictionaries. Examine more sophisticated methods of establishing relevance:

NB: the neural net architecture developed so far lies in t4fnet, and relies on data that is on the lab servers rather than in the repository. The NN code can be brought into this repo if that makes things easier.