Open jin-yc10 opened 8 months ago
Yes, nan values appear when using DCNV4 instead of the normal volume downsampling operation, which is the problem I encountered
How did you solve this problem?
I also encountered this problem, is there a solution?
Encountered with Nan loss when training classification task on imagenet.
Cuda: 11.6 Torch: 1.12.1+cu116 Timm: 0.6.11
Any help is appreciated! Thanks