Suggestion: use the RSS feeds rather than HTML scraping

Hey @omgmog , Thanks for the help man At the inception of the project i was fully confident of using the RSS Feeds for scraping , although below are some crucial points that made me alter my choice :

tribble is not a scraper but it is content organizer as well as a mini content management system, as a result i needed data of sorts like tags, keywords etc in order to filter the data that which i scrape.
Secondly, i will be including an analysis engine soon which has redundancy removal as an aspect as a result i need meta data like point of origin of the article.
Finally It is sometimes difficult to discern the origin of an RSS feed item. When an item is syndicated, the source is not always indicated. The metrics available are not always reflective of the traffic received.

NOTE :

Please understand the broader aspect of the application and let me elaborate it. If you had observed the application, as soon as the app starts both scraping as well as running an instance of a web application would take place, So there would be small frames displaying information on the front-end. For instance if i have an author publishing content on engadget, r/technology and other listed sites concurrently then on the basis of time stamp i would only display the data once instead of including it twice. So in order to achieve such tasks i guess RSS might give me a tough time.

Finally there is one more thing, to be noted I choose the websites not just on a random basis, each of them is good in a specific sector for a reason, so i wanted tribble to be not a simple scraper but a collective representation of information.

akhilpandey95 / tribble

Suggestion: use the RSS feeds rather than HTML scraping #13