genome-nexus / genome-nexus-importer

Import data into MongoDB for use by https://github.com/genome-nexus/genome-nexus/
MIT License
4 stars 16 forks source link

Import data in builder container #38

Closed nr23730 closed 3 years ago

nr23730 commented 3 years ago

Right now with every docker-compose up all the data will be imported when the container is started. This causes heavy load on each startup even when the data files are located in a separate volume.

My proposal would be to import the data prior in a builder container and copy them to a fresh production container. This also reduces size as the image does not contain all the raw data anymore. Also I made sure that volumes binding to /bitnami/mongodb/ will not cause anything to fail. In that case data will be copied to the volume before the container starts.

This can also be a foundation to preload additional data sources like MutationAssessor.