ueshin / flume-twitter

This is a flume plugin for twitter streaming.
http://ueshin.github.com/flume-twitter/
6 stars 1 forks source link

data from flume #4

Closed raghavi222 closed 11 years ago

raghavi222 commented 11 years ago

hello sir, i've installd flume and i could get the tweets directly into my hdfs. i want to analyse these tweets which are in my hdfs system with the help of R tool.. the tweets are in json format so how can i extract tweets from that so that i can analyse them with the help of R . thanks

ueshin commented 11 years ago

Hi,

I don't know much about R, but I think you can parse json to something in Java.

Look at the start point of my code piece from here ( https://github.com/ueshin/flume-twitter/blob/master/src/main/java/st/happy_camper/flume/twitter/TwitterStreamingHBaseSink.java#L96 ). I do NOT believe it works well because the json schema might have been modified by Twitter or some reason, so please modify as you need.

The code piece uses jackson ( http://jackson.codehaus.org/ ). Please refer to the documents for more detailed usage.

Thanks.