scramblingbalam / Alta_Real

The python back end of a data product that aims to get at the truthiness of Trump's tweets. It uses PyStruct.EdgeFeatureGraphCRF to leverage the tree structure of twitter replies to gage veracity by the amount of support of denial that his tweets elicit.
MIT License
1 stars 0 forks source link

SWOT approach: Save order>1 replies to a new collection for level of reply 'db[reply_level]' #54

Closed scramblingbalam closed 7 years ago

scramblingbalam commented 7 years ago

Analysis of #55

scramblingbalam commented 7 years ago

Strengths: 1) Would make processing by t-CRF training scripts easier than flat

Weaknesses : 1) Would have to query a bunch of Mongo collections separately and then aggragate, if I want to listen for all levels simultaneously 2) Since threads could be arbitrarily long the process to decide which Twitter auth to use where would be non trivial

Opportunities: None

Threats: 1) Complex Mongo queries could mean that stream times out, documented here https://dev.twitter.com/streaming/overview/processing

scramblingbalam commented 7 years ago

SWOT analysis in suggest method in issue #55 isn't best choice