Speed up by filtering the comparison positions at SampComp step

tleonardi / nanocompore

RNA modifications detection from Nanopore dRNA-Seq data

GNU General Public License v3.0

78 stars 12 forks source link

Hi, the per-position filtering by coverage is already implemented as part of the whitelisting functions. You have more flexibility if you run Nanocompore from python and call Whitelist explicitly. For example:

from nanocompore.SampComp import SampComp    
from nanocompore.Whitelist import Whitelist
fn_dict = { 
  "Cond1": {
     "Cond1_1": "/path/to/out_eventalign_collapse.tsv", 
     "Cond1_2":"/path/to/out_eventalign_collapse.tsv"
   },
  "Cond2":{
     "Cond2_1": "/path/to/out_eventalign_collapse.tsv", 
     "Cond2_2":"/path/to/out_eventalign_collapse.tsv"
   },
}
fasta="/path/to/fasta_file"
outdir="/path/to/out_dir"

wl = Whitelist(eventalign_fn_dict=fn_dict, fasta_fn=fasta, <options>)

s=SampComp(eventalign_fn_dict=fn_dict, fasta_fn=fasta, outpath=outdir, whitelist=wl, <options>)

db=s()

You can find here the documentation for the options supported by Whitelist. However, at the moment there is not way to explicitly provide a list of positions of interest.

tleonardi / nanocompore

Speed up by filtering the comparison positions at SampComp step #146