tkzeng / Pangolin

Pangolin is a deep-learning method for predicting splice site strengths.
GNU General Public License v3.0
64 stars 32 forks source link

interpretation of pangolin results #4

Closed MiqG closed 2 years ago

MiqG commented 2 years ago

Hi!

thanks for making such a great tool!

After reading the paper and as I have started using pangolin, I have some questions on how one should interpret the results.

For example, following your usage example, I ran the following version of your BRCA example: (Note that I have added a line without a mutation.)

gene,CHROM,POS,REF,ALT
BRCA1,17,41276135,T,T
BRCA1,17,41276135,T,G
BRCA1,17,41276135,T,C

This gives the following output:

gene,CHROM,POS,REF,ALT,Pangolin
BRCA1,17,41276135,T,T,ENSG00000012048.23_1|-50:0.0|-50:0.0|Warnings:
BRCA1,17,41276135,T,G,ENSG00000012048.23_1|-3:0.15|-16:-0.04|Warnings:
BRCA1,17,41276135,T,C,ENSG00000012048.23_1|-1:0.86|-3:-0.83|Warnings:

From your README (gene|pos:largest_increase|pos:largest_decrease|), I understand pos are the positions with maximum and minimum splicing scores increase and decrease relative to the site where we introduced the mutation.

Thank you very much in advance for your help! Cheers,

Miquel

Ni-Ar commented 2 years ago

I have the exact same question regarding interpretation of the output pos:scores nomenclature.

tkzeng commented 2 years ago
MiqG commented 2 years ago

Alright! Thanks, @tkzeng ! I will keep experimenting.