Closed mpaya closed 4 years ago
Thanks for reporting :) .
I am guessing this is happening because I do some filtering by default on bam files. The flag --filter-flag
is by default set to 1540. The https://broadinstitute.github.io/picard/explain-flags.html explains this as
read unmapped (0x4)
read fails platform/vendor quality checks (0x200)
read is PCR or optical duplicate (0x400)
So your problem is likely that the reads are dropped due to this. Could you try with filter-flag
set to 0 and see if this solves your problem?
Anyways, I should really print what the --filter-flags
were when using bam-files :) Otherwise, I expect this will confuse many people in the future too :)
@mpaya Congrats on the paper btw!
Thanks! ^^ After reading your comments, it explains what happened. If filter-flags by default removes duplicates, keep-duplicates may have none to keep. When I have some moment I'll test this option. Best.
Closing issue, using keep-duplicates and filter-flag set to 0 gives the expected result. Thanks
Hi. While using the option
--keep-duplicates
I noticed that it is not working as expected.The process used on the analysis is:
Usually it's okay to use marked or filtered files on other software with distinct outputs, but here the behavior is weird when using the filtered or marked files. I'll indicate some relevant outputs below:
I'd expect:
I get that the process used to remove duplicates may be different, but these shouldn't be dropped out if not asked for even if they are marked. Whenever you have time, please take a look at why this is happening. Thanks