LCR-BCCRC / lcr-modules

Collection of standard analytical pipelines for genomic and transcriptomic data
https://lcr-modules.rtfd.io
MIT License
15 stars 7 forks source link

Bug fix in QC module #290

Closed Kdreval closed 8 months ago

Kdreval commented 8 months ago

The qc module can take a custom bed file to restrict to loci where the qc metrics will be collected. While using it for external cohorts, I found that sometimes bed files can have uneven number of columns and consist of rows with standard chr\tstart\tend structure + plus other rows with uneven number of annotation columns. This was creating bedtools confusion in the rule _qc_sort_baits, and in turn causing this rule to fail while handling such bedfiles.

This PR adds a step to only process the standard first 3 columns of the bed file, therefore resolving the error.

This update was tested on toy bed files and on real samples in gambl, both tests were behaving as expected and generating the expected bed outputs.