VirtualFlyBrain / vfb-pipeline-config

Pipeline 2.0 Configuration
Apache License 2.0
0 stars 1 forks source link

Pipeline 2 Documentation

VFB Pipeline 2 comprises five servers/services and six data pipelines:

Server and data pipelines are combined into 6 general sub-pipelines which are configured as Jenkins jobs (currently located here). This documentation describes all 6 sub-pipelines in detail, including which role the individual servers and data pipelines play. All high-level documentation including images can be found on the vfb-pipeline-config repo. Note: There was once a pipeline server named vfb-integration-api which has since been discarded in favour of vfb-dumps.

Pipeline Overview

Sub-pipeline: Deploy KB (pip_vfb-kb)

Service: vfb-kb

Detailed notes on vfb-kb

Data pipeline: vfb-kb2kb [provisional]

Detailed notes on vfb-kb2kb

Data pipeline: vfb-validate

Detailed notes on vfb-validate

Sub-pipeline: Deploy triplestore (pip_vfb-triplestore)

Service: vfb-triplestore

Detailed notes on vfb-triplestore

Data pipeline: vfb-collect-data

neo4j2owl:exportOWL()

Detailed notes on vfb-collect-data

Data pipeline: vfb-update-triplestore

Detailed notes on vfb-update-triplestore:

Sub-pipeline: Data transformation and dumps for production instances (pip_vfb-pipeline-dumps)

Data pipeline: vfb-dumps

Detailed notes on vfb-dumps

Sub-pipeline: Deploy Owlery (pip_vfb-owlery, Service)

Service: vfb-owlery

Sub-pipeline: VFB prod (pip_vfb-prod)

Service: vfb-prod

Data pipeline: vfb-update-prod

Detailed notes about vfb-update-prod

Sub-pipeline VFB SOLr (pip_vfb-solr, Service)

Deployment during development phase:

  1. The pipeline is currently deployed as a series of connected Jenkis jobs.
  2. Every sub-pipeline has a Jenkins job that can be restarted manually. Every sub-pipeline will trigger all of its dependents. So if the pip_vfb-dumps pipeline is started, it will automatically trigger the pip_vfb-prod and pip_vfb-owlery pipelines to redeploy as well.
  3. The whole pipeline can be restarted by simply triggering the pip_vfb_kb pipeline to be re-run. This will trigger all downstream sub-pipelines.
  4. The whole pipeline is re-run every night at 4am.