fls-bioinformatics-core / auto_process_ngs

Scripts and utilities for automatic processing & management of Illumina NGS sequencing data.
Other
9 stars 6 forks source link

'make_fastqs': add sequence length statistics to processing QC #974

Closed pjbriggs closed 1 week ago

pjbriggs commented 1 week ago

Updates the Fastq generation pipeline used by make_fastqs, to generate and report statistics on the sequence lengths of the generated Fastq files, as part of the processing QC.

This is useful to check any trimming of reads immediately after Fastq generation, without needing to do the full QC (as is the case at present).

Additionally as the updates re-use the existing code from the QC pipeline, the composition of reads (i.e. percentages of masked and padded reads in each file) is now also reported.