Closed CarlosClassen closed 1 month ago
Despite efforts to use the conda packaged version of AnnotSV with the parameter -annotationsDir
, it's not able to find the refGene.txt list.
Maybe it's better to use a containerized version and mount it with singularity arguments during snakemake invocation.
Seems like just the refGene.txt list was not enough to make AnnotSV work. Figured it out now; to download the full annotation files - as given in the Makefile in the AnnotSV repository - I used this link.
Still, to avoid downloading this still large reference files, I'd suggest to just include a manual in the README.md
for the users to download them by themselves and give a path option in the configfile.
With this now clear, we can also stick to the conda version of the tool, avoiding the user to be able to run singularity containers.
Description
While running the AnnotSV step in our Snakemake workflow, the process fails with an error indicating that it cannot find the reference file
refGene.txt.gz
. Below is the specific error message:Snakemake Rule
Here is the Snakemake rule that triggers the error:
Steps to Reproduce
annotsv
rule.refGene.txt.gz
is not found.Expected Behavior
The
wget
command should download therefGene.txt.gz
file to the specified location, and AnnotSV should be able to access it without any issues.Actual Behavior
The
wget
command either fails to download the file or places it in an incorrect directory, causing AnnotSV to fail due to the missing reference file.Possible Solutions
config['params']['annotsv']['refGene']
is correct and accessible./tmp/Annotations_Human/Genes/GRCh38/
exists before running thewget
command.wget
from downloading the file.To make this pipeline executable behind a firewall in production environments, we should remove the downloading of the reference file and instead do a local copy of the file to the destination AnnotSV expects it to be or figure out how to give the correct path as a parameter