biomedicalinformaticsgroup / Sargasso

Sargasso disambiguates mixed-species high-throughput sequencing data.
http://biomedicalinformaticsgroup.github.io/Sargasso/
Other
8 stars 4 forks source link

Cope with unequal paired-end read lengths #55

Closed lweasel closed 6 years ago

lweasel commented 6 years ago

hits_checker.py assumes that both reads of every pair are the same length. However this is not necessarily the case - eg. data where all first reads are 76bp and all second reads are 75b. Depending on which strand the pair maps, this may mean that the pair is considered to be of different lengths for different species, and the code for determining whether minmatch or mismatch thresholds has been violated can go haywire.

hxin commented 6 years ago

will be fix with #64