tkzeng / Pangolin

Pangolin is a deep-learning method for predicting splice site strengths.
GNU General Public License v3.0
61 stars 32 forks source link

Could pangolin predict the splice site ab initio? #10

Open permia opened 1 year ago

permia commented 1 year ago

Hi, I am trying to predit the splice site of an RNA virus artificially expressed in host genome. Because the organism I studied is not model organism, there is no VCF file. The pangolin seem to must use a vcf file as the positional arguments. Could pangolin predict the splice site ab initio?

tkzeng commented 1 year ago

Yes, please see this script: https://github.com/tkzeng/Pangolin/blob/main/scripts/custom_usage.py

You will need to obtain the sequence around the sites you are interested in predicting. For this use case, you can take the mean or maximum predictions over all tissues.