wdecoster / nanocomp

Comparison of multiple long read datasets
MIT License
112 stars 9 forks source link

non-normalised histogram comparison? #69

Open celine304 opened 1 year ago

celine304 commented 1 year ago

Hi Wouter,

I'm trying to compare sequencing data from two different sequencing platforms (Pacbio sequel and revio) on the same graph but I can only get a graph where both have the same surface area but the read number for one is much higher than the other. Is there a way to not normalise the data like this as it does for nanoplot but show them in the same graph?

NanoComp_OverlayHistogram

Thanks, Celine

wdecoster commented 1 year ago

Hmm I would expect it not to normalize the dataset... remarkable. Let me look into that, but I will only have time for that after my holidays.

celine304 commented 1 year ago

Thanks Wouter, enjoy your holidays :)

wdecoster commented 1 year ago

Hrm it looks okay in a toy test with 50 and 300 reads:

image

In your plot, how much higher should the read number be for revio? There is at least a large difference in the ~5kb range...

celine304 commented 1 year ago

The sequel run had 2.2 million reads compared to 8.6 million for the revio