CodeforLeipzig / kidsle

Kitas, Kindergärten, Spielplätze und Schulen in Leipzig
GNU General Public License v2.0
9 stars 3 forks source link

Collect Data from Leipzig #33

Open paesku opened 9 years ago

paesku commented 9 years ago

Currently our Dataset is very small and not up to date. As an example lot of Schools are missing. So I startet a Scraper over at morph.io written in JS as a starting point. So have a look at the GitHub repository.

DebVortex commented 9 years ago

I understand the point of using morph.io but in this point, I think we should not use it. I would rather vote for integrating the scraper directly into the kidsle project as manage.py commands. This way, we can setup a cron job and check for new stuff like once a week. What do you think about this?

paesku commented 9 years ago

Hm, do you really think in Leipzig a new School is build every week? I think it's regular enough to do it once a half or even a year. For our Playground dataset it looks something different. To me it's just a good starting point to get familiar with scraping anyway.