agemagician / ProtTrans

ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit and hundreds of Google TPUs using Transformers Models.
Academic Free License v3.0
1.13k stars 153 forks source link

Prediction meanings #156

Open lhkhiem28 opened 2 months ago

lhkhiem28 commented 2 months ago

Thanks for an amazing repo

I recently checked the Prediction/ProtBert-BFD-Predict-SS3.ipynb and I am not sure about the meaning of the predictions of the model Rostlab/prot_bert_bfd_ss3 For example: {'entity': 'C', 'index': 1, 'score': 0.9472618103027344, 'word': 'M'}

I guess an entity of E means Sheet, H means Helix, and C means Others. Am I correct?

Could you explain this? Thank you so much.

mheinzinger commented 2 months ago

Yes, indeed, E=sheet, H=helix, C=others.