openeventdata / phoenix_pipeline

Turning news into events since 2014.
MIT License
50 stars 33 forks source link

changes to mongo_formatter #6

Closed myi100 closed 10 years ago

myi100 commented 10 years ago

Output Tabari formatted 'eventrecords...txt' from 'scraper_results..txt' where key for each story is the url for the story backwords: e.g., for story http://stream.wsj.com/story/latest-headlines/SS-2-63399/SS-2-444966/

140203 /669444-2-SS/99336-2-SS/senildaeh-tsetal/yrots/moc.jsw.maerts//:ptth Gov. Chris Christie, in a live radio appearance on Monday, denied any knowledge of or planning role in the lane closures at the George Washington Bridge last year, his first public remarks in nearly a month on a matter that threatens his political ambitions.

johnb30 commented 10 years ago

Can you rebase this on top of the current master so it can be merged in? There are currently some conflicts. Also, I think the .txt file with the output should be deleted from the repo.