kcleal / dysgu

Toolkit for calling structural variants using short or long reads
MIT License
88 stars 10 forks source link

input bam files refinements #58

Closed linsson closed 1 year ago

linsson commented 1 year ago

Hi. In some SNP discovery protocols it is suggested to perform pcr duplicate labeling and base quality score recalibration if possible. Would you also recommend this before running dysgu?

As for pcr duplicates, in case, is tagging enough or is removal better?

kcleal commented 1 year ago

Hi @linsson, I recommend duplicate marking - this is standard procedure for most pipelines. Just tagging duplicates is enough - dysgu will ignore reads flagged as duplicate.

linsson commented 1 year ago

Thanks a lot @kcleal !