davidkiss / storm-twitter-word-count

This is a sample project demonstrating real-time computation storm framework integration with twitter.
45 stars 55 forks source link

storm-twitter-word-count

Sample project based on https://github.com/abh1nav/dvto1 demonstrating real-time computation Storm framework (https://github.com/nathanmarz/storm).

See my blog post for mor details on this project: http://kaviddiss.com/2013/05/17/how-to-get-started-with-storm-framework-in-5-minutes/.

The code subscribes to Twitter's Sample feed, keeps stats on words occuring in tweets and logs top list with of words with most count in every 10 seconds.

This project contains a simple storm topology that connects to the sample stream of the Twitter Streaming API and keeps stats on words occuring in tweets and prints top list of words with highest count in every 10 seconds.

To get started:

You'll need to have valid Twitter OAuth credentials to get the sample working. For the exact steps on how to do that, visit https://dev.twitter.com/discussions/631.

Donation

If you found this code helpful, I'd be grateful for your donation.

paypal