iprada / Circle-Map

A method for circular DNA detection based on probabilistic mapping of ultrashort reads
MIT License
62 stars 20 forks source link

how to avoid to getting repeat region ? #78

Open panxiaoguang opened 2 years ago

panxiaoguang commented 2 years ago

Hi Inigo,

I know that you use get_mate_intervals to build realign intervals, and you will use all the soft-clipped/hard-clipped reads and discordant reads in this function. But we know that if some eccDNA reads were from repeat regions (actually it's common), the aligner such as bwa will have multi-align reads including supplement reads and discordant reads mostly similar to eccDNA reads. Is it right?

So how can we avoid getting repeat regions in eccDNA? Will it work if I use --only_discordant?

Best wishes. Pan xiaoguang

iprada commented 2 years ago

Hi,

Read alignments to repeats and multi mapped regions should be handled by the probabilistic model. Reads with low probabilities are filtered out by default.

best,

Inigo