Can this be a problem when choosing the same token id for both mask token and unknown residues?

Hi, Jason,

Thanks for providing the code. I have a question over your choice of MASK_TOKEN_INDEX, where you resort to MASK_TOKEN_INDEX = residue_constants.restypes_with_x.index('X'). That is, the mask token index is equivalent to the X amino acid.

However, this may cause some errors when calculating the loss. From my point of view, you forced the predicted probabilities of unknown residue (X) to be close to 0.

However, when you processed the data, any unknown residues were indexed using X's index.

Therefore, it seems weird to me that the GT for those unknown residues is X' index but their predicted prob. of X is tuned to 0. Though no loss imposed in this case (so it is acceptable?).

jasonkyuyim / multiflow

Can this be a problem when choosing the same token id for both mask token and unknown residues? #8