Thanks for providing the code. I have a question over your choice of MASK_TOKEN_INDEX, where you resort to MASK_TOKEN_INDEX = residue_constants.restypes_with_x.index('X'). That is, the mask token index is equivalent to the X amino acid.
However, this may cause some errors when calculating the loss. From my point of view, you forced the predicted probabilities of unknown residue (X) to be close to 0.
However, when you processed the data, any unknown residues were indexed using X's index.
Therefore, it seems weird to me that the GT for those unknown residues is X' index but their predicted prob. of X is tuned to 0.
Though no loss imposed in this case (so it is acceptable?).
Hi, Jason,
Thanks for providing the code. I have a question over your choice of
MASK_TOKEN_INDEX
, where you resort toMASK_TOKEN_INDEX = residue_constants.restypes_with_x.index('X')
. That is, the mask token index is equivalent to theX
amino acid.However, this may cause some errors when calculating the loss. From my point of view, you forced the predicted probabilities of unknown residue (
X
) to be close to 0.However, when you processed the data, any unknown residues were indexed using
X
's index.Therefore, it seems weird to me that the GT for those unknown residues is
X
' index but their predicted prob. ofX
is tuned to 0. Though no loss imposed in this case (so it is acceptable?).