I tried to train the efficientnetb0-model on my own dataset, but got a loss of 0.0. After some digging I found that the output of the forwad function was just a tensor of NaNs. I tried the code for efficientnetb1 too and it works flawless. But I can't get behind this bug. What could I try?
I tried to train the efficientnetb0-model on my own dataset, but got a loss of 0.0. After some digging I found that the output of the forwad function was just a tensor of NaNs. I tried the code for efficientnetb1 too and it works flawless. But I can't get behind this bug. What could I try?