Hello, I will replace the backbone network of RESNET with your resnest. During the training process, the loss value decreases very slowly (LR here is the same as that of RESNET). Similarly, in 20 epoches, the loss of RESNET is 0.06 and that of resnest is about 0.5. What's going on next?
Hello, I will replace the backbone network of RESNET with your resnest. During the training process, the loss value decreases very slowly (LR here is the same as that of RESNET). Similarly, in 20 epoches, the loss of RESNET is 0.06 and that of resnest is about 0.5. What's going on next?