Beit-Hatfutsot / mojp-dbs-pipelines

pipelines for data sync of Jewish data sources to the DB of the muesum of the Jewish people
MIT License
0 stars 2 forks source link

Datapackage pipelines for The Museum of The Jewish People

Travis

Pipelines for data sync of Jewish data sources to the DB of The Museum of The Jewish People

Uses the datapackage pipelines framework

Overview

This project provides pipelines that sync data from multiple external sources to the MoJP Elasticsearch DB.

Running

Install some dependencies (the following should work on recent versions of Ubuntu / Debian)

sudo apt-get install -y python3.6 python3-pip python3.6-dev libleveldb-dev libleveldb1v5
sudo pip3 install pipenv

Install the app depepdencies

pipenv install

Activate the virtualenv

pipenv shell

Install the datapackage_pipelines_mojp package for development

pip install -e .

Get the list of available pipelines

dpp

Run a pipeline

dpp run <PIPELINE_ID>

Running the full pipelines environment using docker

This will provide:

After every change in the code you should run bin/docker/build.sh && bin/docker/start.sh

Additional features:

Running the tests using docker

Running the pipelines locally

Make sure you have Python 3.6 in a virtualenv

Available Data Sources

Clearmash

Clearmash is A CMS system which is used by MoJP for the MoJP own data

Clearmash exposes an API to get the data

relevant links and documentation (clearmash support site requires login)