Closed kautto closed 4 years ago
Thanks @kautto for reaching out. Just to check the obvious. These reads have been already collapsed with the pacbio ccs program?
Other than that I would need some examples to be able to see if I can help you.. Thanks Fritz
Hi Fritz,
yeah, they're actual HiFi/CCS reads, run through SMRTLink 8.
As a bit more detail, the reads are a subset of reads extracted from an already aligned BAM, i.e.
Step 2 isn't actually changing anything - I checked to make sure the FASTQ records stayed the same.
I just noticed that step 1 was done with ngmlr 0.2.7 and htslib/samtools 1.9, and step 3 (where it fails) is with htslib/samtools 1.10.
If I take the failing reads and attempt realignment in a conda environment with htslib 1.9, it works again - but as soon as I switch to 1.10 it fails. So it looks like some kind of incompatiblity with the newest version of htslib?
It only affects a small subset of reads, 36 out of about 280k, all which are supplementary (flag 2048 or 2064). I'll email you a FASTQ with some example reads so you can try to reproduce it.
Best, Esko
In case others run into this issue, it seems to be related to the new samtools/htslib version (1.10). Reverting back to 1.9 works fine, so that's my workaround for now.
I'm getting an odd error with some HiFi data I was trying to align. When I pipe the output straight from ngmlr to samtools sort, it crashes with something along the lines of:
If I write out the SAM output first and try to samtools view it, it's instead
Looking at the actual offending read in question, it's clear something went awfully wrong, since there's a negative mapping quality:
It looks like an integer overflow issue, but I have no idea why it's suddenly happening to some reads. I'm running ngmlr 0.2.7 and samtools 1.10
There's nothing particularly weird about the data, it's human data aligned to hs38DH and I used --bam-fix (although that doesn't seem to apply in this case).
Without having to write some kind of intermediate parser to strip out bad alignments, is there an easy fix to this?