-
For some sites I could retrieve metadata but obviously for others I could not.
Here's my code:
constructor(){
new Scraper({
host: 'https://en.wikipedia.org/wiki/Main_…
-
The code we run for date guessing was mostly written a couple of years ago and has not been updated much since then. When we wrote it, we validated it as about 87% accurate, but we have anecodata tha…
-
# Problem
Lots of our scrapers have to deal with tables, particularly from Wikipedia. In the simple case, this is easy. However, if the table contains rowspans or colspans, the logic gets very compli…
-
Hello,
I'm trying the wikinews example project in the getting started guide https://django-dynamic-scraper.readthedocs.io/en/latest/getting_started.html by cloning the repo and installing DDS in a …
-
Inspiration:
[http://www.forbes.com/sites/kevinmurnane/2016/03/08/brilliant-data-visualization-brings-the-history-of-hip-hop-to-life/#7f48ca5a2213](http://www.forbes.com/sites/kevinmurnane/2016/03/08…
-
Since CouchDB is a rather big dependency that requires installation as a system-wide service, would it be possible to add support for an SQLite database as well? I would imagine that SQLite would be a…
-
The current version of `lfc_managers.Rmd` uses a CSV of managers created from the [Wikipedia's LFC manager's page](https://en.wikipedia.org/wiki/List_of_Liverpool_F.C._managers). I want to update the …
-
* [ ] 2016-10-23 :-1: **Lithuania** - scraper is failing, site is down.
* [x] 2016-10-23 :+1: **Somalia** - scraper seems to be running daily, site is [a parliament site](http://www.parliament.somali…
-
# Problem with Incoming Data
## Legislature
Sweden (Riksdag)
## Problem
A member is losing Wikidata and it seems like there are duplicate memberships with differing start and end dates:
`…
-
# Problem with Incoming Data
## Legislature
Alderney (States)
## Problem
New names coming in but they need to be added to new term.
## Steps
- [x] Add test framework to scraper
- [x] Refa…