Closed Leo-T-Zang closed 4 months ago
Hi Leo, The last column is the prediction score of the model. The best Fmax score is usually achieved using a threshold between 0.2-0.3. However, if you like to see more specific annotations you might want to lower it. Depending on the protein sequence the predicted GO terms will be different.
Hi @coolmaksat,
Thanks a lot for your reply. If I understand correctly, higher score indicates more general GO terms like general function annotation, and lower score indicates very specific functions?
Not necessarily, some specific classes also have high scores, specifity of the classes mostly depend on the number of annotations. These scores represent prediction model's confidence.
Oh, I see. So you are saying that higher score means more confident prediction, but set threshold of 0.2-0.3 is good enough for best Fmax. Sorry if I misunderstand anything.
I guess my question is more from a user perspective: say if now I have some GO terms predicted from you model, should I select top 5 or top 10 for final annotation or use predicted score threhold 0.3 to filter predictions?
Thanks a lot !!
It is better to set a threshold, for example 0.3, but I would make this decision based on some indirect evaluation
Thanks!
Hi DeepGO2 Team,
Thanks for open-sourcing this useful tool!
I have question regarding how to understand the output predictions in tsv files, for example in example_preds_bp:
How do we understand the values in the last column and why each sequence has different output GO terms?
Thanks a lot!