maxplanck-ie / snakepipes

Customizable workflows based on snakemake and python for the analysis of NGS data
http://snakepipes.readthedocs.io
378 stars 85 forks source link

Check for valid fastq files before workflow execution #881

Closed WardDeb closed 1 year ago

WardDeb commented 1 year ago

'empty' or corrupt fastq files only trigger a pipeline failure very late in mRNA-seq stage (~bamCoverage rules). Perhaps an initial check (no unexpected EOF, at least x reads present) could be done before pipeline execution ?

katsikora commented 1 year ago

Ideally I would put the responsibility on the user, but if it's a fast check, why not having it. Is this by any chance an issue with SRA-derived files?

WardDeb commented 1 year ago

Done in #899