tleonardi / nanocompore

RNA modifications detection from Nanopore dRNA-Seq data
https://nanocompore.rna.rocks
GNU General Public License v3.0
79 stars 12 forks source link

Downsample to the same coverage as the lower coverage sample #69

Open tleonardi opened 5 years ago

tleonardi commented 5 years ago

Very different coverage between samples might causes troubles during GMM clustering. At the moment we can downsample to a fixed coverage level, but it would be great to downsample each reference independently so that the sample with higher coverage is brought down to the same level as the sample with lower coverage.

a-slide commented 5 years ago

Are you thinking of doing it on a position per position fashion ?

a-slide commented 5 years ago

I would actually suggest to move the reference level downsampling from Whitelist directly to txComp at position level. The advantage of having it in the whitelist is that is saves time as we don't read the data discarded by the downsampling.