fls-bioinformatics-core / auto_process_ngs

Scripts and utilities for automatic processing & management of Illumina NGS sequencing data.
Other
9 stars 6 forks source link

Update Fastq generation and QC pipelines for Cellranger 8.0.0 #956

Closed pjbriggs closed 2 months ago

pjbriggs commented 3 months ago

Updates to the Fastq generation and QC pipelines and supporting functions, for compatibility with Cellranger 8.0.0.

Significant changes in Cellranger 8.0.0 which need to be accommodated in the QC pipeline are:

Creating BAM files will produce larger overall output sizes and can also result in longer compute times, but may not be particularly advantageous in all cases, specifically for Flex datasets it is recommended to disable BAM file creation (see https://kb.10xgenomics.com/hc/en-us/articles/24292856143885-Understanding-the-create-bam-Parameter-in-Cell-Ranger-v8-0).

The multi config template generation code has therefore been updated to take the target Cellranger version as optional input; the setup_analysis_dirs command supplies the version from the processing pipeline when generating templates (which should be consistent in most cases).

There is also a minor change to the per-sample metrics.csv files produced by 8.0.0 compared to earlier versions, in that 8.0.0 no longer outputs a median_reads_per_cell value (which is currently reported as part of the QC reporting); the mean_reads_per_cell value will be reported instead for version 8.0.0.

Finally, although this PR implements compatibility with Cellranger 8.0.0, the default version for testing remains at 7.1.0.