heathsc / gemBS

gemBS is a bioinformatics pipeline designed for high throughput analysis of DNA methylation from Whole Genome Bisulfite Sequencing data (WGBS).
GNU General Public License v3.0
32 stars 21 forks source link

mapped bam files: SM key without value #100

Open NikolausMandlburgerCCRI opened 1 year ago

NikolausMandlburgerCCRI commented 1 year ago

Dear developers! The headers of the mapped bam files produced by the latest version of gemBS (gemBS 3.5.5) contain a SM key without value. This causes problems in various tools, for example picard MarkDuplicates (https://broadinstitute.github.io/picard/). This can easily be circumvented by adding a placeholder value (SM:placeholder), nevertheless it would probably be nice to not have to do this kind of extra reheadering, hence I wanted to draw your attention to this issue. All the best, Nikolaus gemBS_issue

To generate the error (in the directory where picard.jar is located): java -jar picard.jar MarkDuplicates I=test.bam O=test_deduped.bam M=marked_dup_metrics.txt REMOVE_DUPLICATES=true