Open dheerajbobbili1988 opened 4 years ago
I think it is fine to try samblaster here with the --ignoreUnmated option. I see two issues:
1) Since it appears you have pulled reads only from chrom20, there are bound to be reads in the input that have their mate aligned to a different chromosome in the original reference. This is what is probably causing your unmated reads. I guess the fact that the first unmated read shown appears to be unaligned is due to the change in reference. Also, I hope the samtools fastq command will work on such input with unmated pairs. In the samblaster output stats, you should see the number of unmated pairs as a percent of all pairs and you can then judge if you think this chrom20 selection is indeed the issue. I suggest redirecting stderr in the samblaster command to capture these stats.
2) You don't need to sort into name order before using samblaster to mark duplicates. The samtools collate command will already make the input "read-id grouped" which is all samblaster requires, and BWA will not change the order of the reads in the output.
Hi,
I am trying to realign a bam file to a new reference and in the process I would like to use samblaster. When I use the command below, I am running into this error.
Here is my command
Now, my question is in this scenario can I safely add "--ignoreUnmated" flag or is samblaster is not suited for this purpose. Please let me know.