monarch-initiative / dipper

Data Ingestion Pipeline for Monarch
https://dipper.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
57 stars 26 forks source link

Start DipperCache and Go, Wormbase & Reactome #875

Closed TomConlin closed 4 years ago

TomConlin commented 4 years ago

DipperCache is not officially deployed yet but we can start using the preview version sitting on archive.mi/DipperCache.

Away at CCDH meeting (at which KS said the above was okay to merge)


Additional item

justaddcoffee commented 4 years ago

Both worm and reactome ingests with DipperCache work great on my end. Haven't tested GO yet as it requires a big download (idmapping_selected.tab.gz) that I don't have locally ATM.

One question I have is, does the code to actually populate DipperCache with the files live in here somewhere? Can't find it in this PR, although maybe I'm missing it. If it's not here, maybe that should go in this repo somewhere?

TomConlin commented 4 years ago

The code populating the cache is in a repo temporary still owned by me. Goal is to come up with a deployment process then transfer it to Monarch The deploy is in a bit of limbo while Monarch VMs have network issues.

I am keen on hearing thoughts on how to robustly deploy a purely static site in this day and age.

the repo you are wondering about is currently https://github.com/TomConlin/DipperCache

justaddcoffee commented 4 years ago

the repo you are wondering about is currently https://github.com/TomConlin/DipperCache

Possibly could move that code into Dipper? Seems like it belongs here, since it is going to become part of the 'E' in our ETL pipeline. (Otherwise, we'll need to add/update URLs for file downloads in both the Dipper repo and this other repo.) Just a thought

TomConlin commented 4 years ago

@justaddcoffee entirely possible. Won't be determined till we see how it is actually deployed.