XieResearchGroup / DISAE

MSA-Regularized Protein Sequence Transformer toward Predicting Genome-Wide Chemical-Protein Interactions: Application to GPCRome Deorphanization
Other
11 stars 4 forks source link

question about the distilled triplets #6

Open wawpaopao opened 2 years ago

wawpaopao commented 2 years ago

I am still a bit confused about this file.Take the first senqunece 'A0A016RYG7' in the 'gpcr_uniprot2triplets.json' as an example,there are 210 conserved positions. I don't understand how to get the distilled triplets of 'A0A016RYG7' like 'pnt ntp tpl pls' at the start because it seems no relation between the sequence of 'A0A016RYG7' in the Uniprot. I mean there is no 'pnt' in the sequence.

Thank you very much!

lxie21 commented 2 years ago

Here is the explanation from the developer:

"I get the sequence from Uniprot and find that there are “P” “N” “T”.

We find conserved position first then slid on the distilled sequence to get triplets."

On Fri, Dec 17, 2021 at 11:15 AM wawpaopao @.***> wrote:

I am still a bit confused about this file.Take the first senqunece 'A0A016RYG7' in the 'gpcr_uniprot2triplets.json' as an example,there are 210 conserved positions. I don't understand how to get the distilled triplets of 'A0A016RYG7' like 'pnt ntp tpl pls' at the start because it seems no relation between the sequence of 'A0A016RYG7' in the Uniprot. I mean there is no 'pnt' in the sequence.

Thank you very much!

— Reply to this email directly, view it on GitHub https://github.com/XieResearchGroup/DISAE/issues/6, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABZSBCUGE23IS3IYURDV3BDURNOZ7ANCNFSM5KJGOMTQ . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you are subscribed to this thread.Message ID: @.***>