Open slzhao opened 4 years ago
@slzhao This is definitely a reasonable thing to do. Please feel free to issue a PR to do this.
That said, I have to give you a warning. Some of the datasources rely on sqlite3
and therefore have issues on some distributed filesystems (see this post on Lustre/NFS errors). There are a few posts in the GATK forums about this as well.
So you may want to do some testing before using one centralized copy of the data sources.
As a heads-up - I do not have access to a Lustre filesystem so I am unable to do any debugging with it.
Feature request
Tool(s) or class(es) involved
MuTect2 wdl (mutect2.wdl), task Funcotate
Description
May I know if it sounds like a good idea to add a option to skip the "Extract our data sources" part in mutect2.wdl. I am using mutect2.wdl in HPC system and all the data sources and gnomad for Funcotate were unzipped and ready to use. So there is no need to "Extract data sources" every time (and save time and resources). I can change it and make a pull request if it sounds like a good idea.
The code in mutect2.wdl that I'm going to make an option to skip listed below: