Closed lckarssen closed 4 years ago
secondary+supplementary+unmapped are discarded.
idea: use comm
to find the missing reads ?
Thanks for the explanation about the missing reads. I had a look at the code and I think unmapped reads are included by default as well (see the if-block starting at line 620).
As yes, sorry, I wrote this yearsss ago :-)
Subject of the issue
When using
cmpbams
it looks like the output misses some reads. For example, when comparing a CRAM file to itself, I see the read phase lists:for both files. This is consistent with the output of
samtools idxstats
. However, if I count number of output lines I get fromcmpbams
I get 102380853.Your environment
9c88a80
${JAVA_HOME}
: emptySteps to reproduce
compbams
was run like this:This happened both for a comparison of two full CRAM files and for a comparison of part of a chromosome.
Expected behaviour
The total number of reads reported by
samtools idxstats
and/or the read phase ofcmpbams
should be identical.Actual behaviour
The two numbers of reads differ (the number of lines in the
cmpbams
output is lower).