blr-today / ingest

Ingestion pipeline for blr.today
https://blr.today
GNU General Public License v3.0
8 stars 1 forks source link
bangalore blr-today events git-scraping

blr.today ingestion

This is the source code for the BLR.Today ingestion pipeline. This code does the following:

  1. Fetches events from dozens of sources.
  2. Enriches events with metadata, and cleanup. For example - location information is added where possible, and events are classified using schema.org/Event sub-types.
  3. Events are exported in the schema.org/Event schema to a SQLite database.

Status

The following sources are automatically ingested, filtered to just Bangalore events wherever necessary.

A lot more are in-progress, please see TODO.md for a more updated list.

Dependencies:

Running

The code automatically runs using GitHub Actions once every four hours. You can run it using the following command:

make clean && make. Once complete, the events.db file will be updated with the latest events.

License

This repository is licensed under the GNU-GPLv3 license. This means that you can use, modify, and distribute this code as long as you also release your code under the same license. Please see LICENSE.txt for more details.

The files stored in out/ and fixtures/ directories is copyright of the original authors and is not covered by this license. Instead of using this, please use the published dataset available at blr.today/dataset, which is published under the Open Database License, which means you must attribute the data, and share it under the same license without any technical restrictions.