populationgenomics / production-pipelines

Genomics workflows for CPG using Hail Batch
MIT License
2 stars 0 forks source link

Simplify Alignment Job and Resolve SAM Flag Issue in CRAM to FASTQ Conversion #802

Open michael-harper opened 1 week ago

michael-harper commented 1 week ago

Context and Motivation

The primary motivation for these changes is to address an underlying bug in bazam, the tool we previously used for converting CRAM files to FASTQ format for re-alignment. The bug causes the SAM flag relating to read pair orientation to be overwritten for correctly oriented read pairs on the reverse strand. This is critical because it impacts the accurate representation of read pair orientations in re-aligned CRAM files. Notably, this issue does not affect the individual read strand flag, which remains correctly preserved.

Changes Introduced