Closed fergusonscripps closed 1 year ago
Thanks! You are correct that the output also returns the vector of likelihoods for SOS and EOS (not real aa's), however; the vector of likelihoods for each position only contain values for the 20 aa's. This was a design choice based on how we are using the model.
The likelihoods are not ordered alphabetically, but you can view the order with the following:
print(model.tokenizer.vocab_to_aa)
I hope this answers your questions!
Great thank you!
Sent from Outlook for iOShttps://aka.ms/o0ukef
From: Tobias Hegelund Olsen @.> Sent: Wednesday, July 26, 2023 6:30:14 AM To: oxpig/AbLang @.> Cc: James Ferguson @.>; Author @.> Subject: Re: [oxpig/AbLang] Clarification Question (Issue #8)
Closed #8https://github.com/oxpig/AbLang/issues/8 as completed.
— Reply to this email directly, view it on GitHubhttps://github.com/oxpig/AbLang/issues/8#event-9924188889, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AZ7U2ZTY7OZACMXXD2LERYDXSELWNANCNFSM6AAAAAA2XVBZWE. You are receiving this because you authored the thread.Message ID: @.***>
Cool code trying it for a few things in the lab.
For the "Residue likelihood" I wanted to check. The first and last probabilitiy arrays are the SOS and EOS and do not correspond to a "real" amino acid residue?
Are the probabilities in alphabetical for the 20 essential amino acids?