scramblingbalam / Alta_Real

The python back end of a data product that aims to get at the truthiness of Trump's tweets. It uses PyStruct.EdgeFeatureGraphCRF to leverage the tree structure of twitter replies to gage veracity by the amount of support of denial that his tweets elicit.
MIT License
1 stars 0 forks source link

SWOT approach: Save order>1 replies to a new collection for each thread 'db[trump_tweet_id]' #53

Closed scramblingbalam closed 7 years ago

scramblingbalam commented 7 years ago

Analysis of #50

scramblingbalam commented 7 years ago

Strengths: 1) Would make processing by t-CRF training scripts easier

Weaknesses : 1) Would have to query a bunch of Mongo collections separately and then aggragate, not sure I can even do this

Opportunities: None

Threats: 1) Complex Mongo queries could mean that stream times out, documented here https://dev.twitter.com/streaming/overview/processing

scramblingbalam commented 7 years ago

Comparing SWOT for suggests I shouldn't go with #50