AvocadoMoon / CTEventScraper

Web scrapers that gather events from sites which publish them, and then upload them to a mobilizon instance. Currently used by CTGrassRoots
https://ctgrassroots.org/
2 stars 0 forks source link

Epic: Sites to Scrape From #1

Open AvocadoMoon opened 2 months ago

AvocadoMoon commented 2 months ago

Problem

There are sites which publish local events on them, but we want them to centralized in one location for easy search and pooled into a DB of other events people publish.

Solution

Create a web scraper that is able to gather all the information desired in a digestible form, and a driver which then publishes these events onto a website.

Driver Tasks

Scraper Tasks

AvocadoMoon commented 1 month ago

Scraper for Microformats following schema.org. Most sites that host events seem to not follow this, but every site seems to utilize one instance of it which is the "local business" type.

AvocadoMoon commented 1 month ago

Looks like some of the sites use word press which has some API that may be applicable for tapping into. Also there seems to be a library within python that is used for calendars, and my be able to pull online calendars.