Closed dkeitley closed 3 years ago
Hi Dan,
I appreciate your feedback and thank you for pointing out this bug.
The issue you are facing is likely due to the removal of temporary files (temp.txt and temp2.txt) from parallel sample processes. I have updated the rule "getCellsList" in Snakefile to create temporary files for each sample in your data instead of creating temp files that may get removed by parallel processes.
I hope that helps resolve your issue!
Best, Michael
Thanks Michael! I changed the pushed fix slightly (see below) in my local version just so that it's renamed more cleanly. I think {input}.temp.txt was coming out as "{path}/{sample}_gene_dge.summary.txt.temp.txt".
rule getCellsList:
input: '{path}/{sample}_gene_dge.summary.txt'
output: '{path}/{sample}_cellList.txt'
params:
sample='{sample}'
shell:
"""
sed '/^#/ d' < {input} > {params.sample}_temp.txt
tail -n +3 {params.sample}_temp.txt > {params.sample}_temp2.txt
cut -f1 {params.sample}_temp2.txt > {output}
rm {params.sample}_temp.txt
rm {params.sample}_temp2.txt
"""
Thanks again,
Dan
Hi Michael,
Sorry to open another issue.
I just tried running the pipeline on my own samples and am getting an error relating to the temp2.txt file that's produced in the getCellsList rule.
I tried leaving out this sample and running on the rest but I'm still getting the same error. It works fine running on a single sample, so I'm not sure if it's an issue with certain fastq files or if it's due to something else...?
Any suggestions would be much appreciated.
Many thanks,
Dan