Closed TomConlin closed 4 years ago
Both worm and reactome ingests with DipperCache work great on my end. Haven't tested GO yet as it requires a big download (idmapping_selected.tab.gz
) that I don't have locally ATM.
One question I have is, does the code to actually populate DipperCache with the files live in here somewhere? Can't find it in this PR, although maybe I'm missing it. If it's not here, maybe that should go in this repo somewhere?
The code populating the cache is in a repo temporary still owned by me. Goal is to come up with a deployment process then transfer it to Monarch The deploy is in a bit of limbo while Monarch VMs have network issues.
I am keen on hearing thoughts on how to robustly deploy a purely static site in this day and age.
the repo you are wondering about is currently https://github.com/TomConlin/DipperCache
the repo you are wondering about is currently https://github.com/TomConlin/DipperCache
Possibly could move that code into Dipper? Seems like it belongs here, since it is going to become part of the 'E' in our ETL pipeline. (Otherwise, we'll need to add/update URLs for file downloads in both the Dipper repo and this other repo.) Just a thought
@justaddcoffee entirely possible. Won't be determined till we see how it is actually deployed.
DipperCache is not officially deployed yet but we can start using the preview version sitting on archive.mi/DipperCache.
Source.py
self.files[src_key]['url']
Go & Reactome got minimal refactors to pull a a preprocessed gaf-eco mapping file from Dipper cache.
Wormbase needed a more extensive minimal refactor
test and translation tables are adjusted as needed
Away at CCDH meeting (at which KS said the above was okay to merge)
Additional item
Mouse ingests have the greatest variety of labs/urls and need most fixes
Wormbase ingest had an extra stutter in
species/c_elegans/PRJNA13758/c_elegans.PRJNA13758.WS273.annotations.gff3.gz