when running the code, the cls_loss will be nan. I think it is because of the 'mask' in train.py can be zero. But I am not sure. And another question is that when running the code, there will be a error "cuda runtime error: an illegal memory access was encountered". hoping to hear from you soon. thank you very much
when running the code, the cls_loss will be nan. I think it is because of the 'mask' in train.py can be zero. But I am not sure. And another question is that when running the code, there will be a error "cuda runtime error: an illegal memory access was encountered". hoping to hear from you soon. thank you very much