Closed sungho-ham closed 7 months ago
I've found my suggestion was wrong and already fixed. https://github.com/NVIDIA-Merlin/Transformers4Rec/pull/723#pullrequestreview-1490574571 However, from what I understand, the results is not correct. I reported it again according to the bug reporting format.
Description
When using
clm
for masking, it generates wrong masking schema. It can be checked by following simple code. In case of length 2 input, there should not be difference between last item only and all items conditions. However, there is difference between them.Related Code
https://github.com/NVIDIA-Merlin/Transformers4Rec/blob/348c9636399535c566d20e8ebff2b7aa0775f136/transformers4rec/torch/masking.py#L298
I think following code is correct:
mask_labels = labels != self.padding_idx