egaffo / circompara2

Improved bioinformatic pipeline to identify and quantify circRNA expression from RNA-seq data by combining multiple circRNA detection methods
Other
8 stars 0 forks source link

CLR/CLP Calculation for PE and SE Reads #27

Closed ChengxuanChen10 closed 2 months ago

ChengxuanChen10 commented 2 months ago

Hello,

On the wiki page, there's a mention of the calculation of the circular-to-linear expression ratio (CLR) and the circular-to-linear expression proportion (CLP) with formulas presented as follows: CLR = BJR / LSRBJ CLP = BJR / (LSRBJ + BJR)

Should the formula for CLR differ between paired-end (PE) reads and single-end (SE) reads, as CLR = 2*BJR / LSRBJ for PE reads and CLR = BJR / LSRBJ for SE reads?

egaffo commented 2 months ago

Not really. The read bearing the backsplice junction (BSJ) is (almost) always only one of the two read mates; therefore, you don't want to count the mate that does not contain the BSJ. Moreover, the counts reported by CirComPara2 refer to sequenced fragments, i.e., read pairs. You may have found the formula CLR = 2BJR / LSRBJ in some papers, but, to my understanding, it was not due to PE reads. I think the BJRs are counted twice because you are comparing two different splice points (the LSBJs, which will have different reads encompassing the splice junctions) against one splice point that actually includes two canonical splice points. In my opinion, the CLR = 2BJR / LSRBJ interpretation is legit.

ChengxuanChen10 commented 2 months ago

Thank you!