sdparekh / zUMIs

zUMIs: A fast and flexible pipeline to process RNA sequencing data with UMIs
GNU General Public License v3.0
271 stars 67 forks source link

BUG When using fastq with no .gz suffixed, cell barcode can not be extracted. #249

Closed crotoc closed 3 years ago

crotoc commented 3 years ago

Thanks for fix my last bug and I encounter a new one.

Describe the bug A clear and concise description of what the bug is. In zUMIs.sh at line 257, pref=$(basename ${f} ) should be pref=$(basename ${f} .gz)

image

Because the former step is splitfq.sh, which will generate splitted gz files. If the .gz is not removed from basename of ${f}, it will result the line 96 of qfilter_v2.ql to add an extra .gz at the end of the splited gz files. Then no bam file can be produced and no cell barcode can be identified.

To Reproduce Provide the YAML file and full standard output.

Screenshots If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

Additional context Add any other context about the problem here.

cziegenhain commented 3 years ago

I would recommend to start with gzipped fastq files instead. We will probably retire the option for uncompressed input because it is not really practical.

crotoc commented 3 years ago

Okay. Thanks!