Closed mortunco closed 8 years ago
Tunc; You're exactly right in your diagnosis -- there is something problematic about the ICGC BAM files that GATK does not like. My suggestion would be to align with bwa through bcbio so you get clean inputs:
aligner: bwa
If you don't want to do that you could also try cleaning them as you suggest by adding:
bam_clean: picard
This may take some manual work on your side to clean up the quality scores in the input files if neither of these work cleanly and the original file is a mix of multiple quality encoding types, but fingers crossed one of these will work. Hope this helps.
Dear Brad;
If you may look at my configuration files, I have already settled bam_clean: picard
. Did you mean leaving bam_clean
only there and remove `bam_sort`` or trying that option. Because I have already included them in my configuration file and got this option.
Thank you,
Tunc
Tunc;
My mistake, sorry. I guess the cleanup we have is not handling this input file. I don't believe bam_sort
will help with this problem since it looks like it's due to misencoded quality scores. Hopefully realigning with bwa will resolve the issue.
Hi,
I have two bam files(normal-tumor). I would like to call variants across these samples. Since I dont require alignment I preferred bam clean and bam sort as it is suggested in the bcbio documentation. As I looked for the error, I dont know in which point I piped the wrong input. These bam files are the bam files from ICGC consortium so I dont know what I can do? Pass bam sort and clean ?
During the run I came across with the following error;
Also I would like to share my config file, maybe i did something wrong