snakemake-workflows / dna-seq-varlociraptor

A Snakemake workflow for calling small and structural variants under any kind of scenario (tumor/normal, tumor/normal/relapse, germline, pedigree, populations) via the unified statistical model of Varlociraptor.
MIT License
82 stars 38 forks source link

feat: improve and run sra download ci test case #276

Closed dlaehnemann closed 10 months ago

dlaehnemann commented 10 months ago

This improved test case, with documentation on the data used in the .test/config-sra/README.md file, should be configured to run through in a reasonable amount of time. It is set up to also test the case where one sample is used in multiple groups.

dlaehnemann commented 10 months ago

One further idea for this test: We could run this in parallel to all the other tests, to save overall runtime of the tests (and maybe speed up being able to merge stuff). To avoid running too much stuff while there are still blatant errors on a pull request, I would split the current Testing into three different sections:

  1. A Dryruns section for a quick number of --dryrun tests (without doing the Free Disk Space (Ubuntu) step), to catch the most blatant errors.
  2. A Short Tests section, that needs: Dryruns to have completed successfully.
  3. A Long Test section, that also needs: Dryruns to have completed succesfully.

And I would add a mechanism to stop all other tests if any of them fails. The snakemake test suite seems to have this, but I have not configured something like this in the past.

One downside is that both 2. and 3. will have to probably do the `Free Disk Space (Ubuntu) step (although we could try without them) and they will both have to separately set up all the conda environments...