snakemake-workflows / dna-seq-varlociraptor

A Snakemake workflow for calling small and structural variants under any kind of scenario (tumor/normal, tumor/normal/relapse, germline, pedigree, populations) via the unified statistical model of Varlociraptor.
MIT License
82 stars 39 forks source link

fix: update .snakemake-workflow-catalog.yml #204

Closed charmoniumQ closed 1 year ago

charmoniumQ commented 1 year ago

Add extra flags that enable this workflow to run.

See snakemake-workflow-catalog and click on "standards compliant" at the top for the documentation of this YAML file.

johanneskoester commented 1 year ago

The mandatory flags are actually not meant to provide a hint for how to run on the test data. The test data isn't even present when you run through the deployment steps presented by the workflow catalog. Also, this would be misleading as people could think that you always need to use --directory to run the workflow.

johanneskoester commented 1 year ago

Thinking about it, you nevertheless have a case here. It does not hurt to have a mechanism in the catalog config to display how to run an example run with test data. I am happy to take a proposal for that within the workflow catalog repository, and then we could fill this out here. Maybe:

example: |
  wget <url-to-test-data>
  unzip ...
  cd ...
  snakemake ...
charmoniumQ commented 1 year ago

That's a good point. We need clarity on a standardized place to put a command that should work by default for reproducibility's sake. I'm not sure where that would be right now.

dlaehnemann commented 1 year ago

Maybe it would also make sense to automate the creation of such a test scenario from an existing test in the .test/ directory. For example if there is one called .test/minimal/. But maybe this is also making it too complicated to get it off the ground...

charmoniumQ commented 1 year ago

My motivation for this was because I am working on automatically running snakemake workflows (no manual intervention). Currently I am going over repositories in snakemake workflow catalog, applying options from the .snakemake-workflow-catalog.yml, and from github CI ./github/workflows/*.yaml. Here is my code and some out-of-date results. Despite a lot of trying stuff, I can only get ~5% of the Snakemake workflows to run.

If you know anyone who would be interested in using or contributing to a standard for specifying how to run Snakemake workflows, or even just using the data, let me know mailto:grayson5@illinois.edu.