tleonardi / nanocompore

RNA modifications detection from Nanopore dRNA-Seq data
https://nanocompore.rna.rocks
GNU General Public License v3.0
78 stars 12 forks source link

What type of signal is plotted #129

Closed lingolingolin closed 4 years ago

lingolingolin commented 4 years ago

Hi @a-slide @tleonardi ,

I am not clear where the signal values plotted by nanocompore plot_signal function at each reference position come from. https://nanocompore.rna.rocks/demo/SampCompDB_usage/

Because the signals can be either event_level_mean or samples associated with the event_leve_mean value from nanopolish eventalign results. I attached one snapshot of nanopolish eventalign results in order to help making sense of my question.

Thanks a lot. :-)

image

lingolingolin commented 4 years ago

I looked into the codes. it seems to be the raw signals, i.e., the samples. But how nanocompore treat NNNN and mismatch kmers during statistical analyses and plotting? Thanks again.

a-slide commented 4 years ago

Yes it is the samples median, which is more precise that the event level mean. NNNN and mismatching kmers are normally skipped in the analyses.

If you want to visualise the proportion of these kmers you can use the plot_bleeding_hulk function in SampCompDB API. It will give you something like that:

output_45_0

lingolingolin commented 4 years ago

@a-slide ,cool. thanks very much for clarifying it. but do you mean sometimes 'nnnnn' and 'mismatching' kmers are kept? what is the 'missing' situation?

a-slide commented 4 years ago

NNNNN, mismatching and missing kmers are not used in the statistical test. Missing kmers are when a the reference kmers is not found in the signal and Nanopolish just skip it