CGATOxford / cgat

Do not use - please refer to our newest code: https://github.com/cgat-developers/cgat-apps
BSD 3-Clause "New" or "Revised" License
124 stars 66 forks source link

bam_vs_bed: split alignments #189

Closed IanSudbery closed 9 years ago

IanSudbery commented 9 years ago

bam_vs_bed currently treats alignments as starting at the start location and continuing to the end location, irrespective of if the alignment is split (/spliced).

Adding split to the bedtools intersect call will treat the two halfs of aligment as seperate intervals, and thus will not count towards anything in the spliced out region. However, this would mean the read might be counted twice.

Suggestions?

AndreasHeger commented 9 years ago

Thorough counting in an RNA-seq context will be hard without redesigning the counting from scratch.

In the meantime, we can enable the option to "--split" - and add to the documentation that this might cause one read to be counted multiple times.