abyzovlab / CNVpytor

a python extension of CNVnator -- a tool for CNV analysis from depth-of-coverage by mapped reads
MIT License
178 stars 26 forks source link

CNVpytor Parameters #206

Closed zainabae closed 8 months ago

zainabae commented 8 months ago

Hello,

I have a question about the default parameter selection.

In my thesis project, I employed default parameters for CNV filtering. However, the reviewer has requested an explanation for the bioinformatic significance behind choosing these defaults in the software, specifically pN > 50%, q0 > 50%, and excluding CNVs smaller than the size of 3 bins. I am uncertain about why the default threshold is set at these values? Is this based on specific equation or sensitivity analysis?

arpanda commented 8 months ago

Here is the description for q0 and pN:

The purpose of utilizing pN and q0 is to eliminate regions in the genome that are characterized by low confidence and noise, such as those with gaps or low mapping quality.

The number of bins in a call also helps to reduce the noises/false positive calls that can arise for many reasons like repeat regions, complex sequence structure etc. The threshold increases the confidence for the call. The number is based on basic intuition.

I'm unsure if this addresses your question adequately.

-Arijit