Sydney-Informatics-Hub / Fastq-to-VCF

High throughput illumina mammalian whole genome sequence analysis and joint genotyping
0 stars 0 forks source link

Running fastq split multiple times on same inputs can cause duplication of some reads #3

Closed calizilla closed 1 day ago

calizilla commented 1 day ago

If fastq split is run >1 on same input without cleaning Fastq_split output directory, some reads can end up duplicated because fastp does not always produce the same filepaths for outputs between runs. A cleanup is necessary between re-runs of fastq split.

calizilla commented 1 day ago

Fixed https://github.com/Sydney-Informatics-Hub/Fastq-to-VCF/pull/4