rivernews / media-literacy

Exploring possibilities of technology empowering media literacy
1 stars 0 forks source link

Explore news site #10

Open rivernews opened 3 years ago

rivernews commented 3 years ago

It's important to make above modularize, so it's better for testing, reusing, parallel execution, etc.

Module: scraper (Golang)

Module: parser & formatter

Module: word cloud reducer

Module: time series scraper

Module: publisher


Scraping - Golang runtime environment


Let's split into two parts - 1) fetch html 2) rest of the processing

rivernews commented 3 years ago

POC: Parallel Batch Scraper with Golang

rivernews commented 3 years ago

Still work remaining