gwu-libraries / social-feed-manager

"Old SFM" -- manage rules and streams from social data sources, starting with twitter.
MIT License
87 stars 20 forks source link

Export of stream tweets to csv. #286

Open justinlittman opened 9 years ago

justinlittman commented 9 years ago

Stream tweets are currently stored in time segmented line-oriented JSON files. Support for exporting these tweets to CSV is required. This support should include:

justinlittman commented 9 years ago

Here's info from @kerchner on how he has done this manually in the past:

@Justin, on second thought it looks like I did write a script to do just that.  Step 1 was "gunzip -r" the tree of twitterfilter output files.  Step 2 was "grep -h <twitterfilterdirectory> -r > singleoutputfile.json".  Then the python script, which apparently I didn't place in a good location, to read each line's json and extract the relevant columns and write a line of csv.  I'll see if I can find it but it wasn't too complicated.
Ah, here it is:  https://gist.github.com/kerchner/e471a5f010b6bcafb3f0