Closed zh-jp closed 7 months ago
Thank you for your great work! I find the backone in your code isn't the general Resnet. They are very different from the general.
And I try to replace resnet32 in the paper mentioned with resnet34 but the loss cannot converge and turn to nan finally.
This is bash command I tried (resnet32 has been changed to resnet34 realized by torchvision)
python cifar_train.py --arch resnet32 --gpu 0 --imb_type exp --imb_factor 0.01 --loss_type LDAM --train_rule DRW
Could you please provide further explanation?
My apologize... the loss to become 'nan' is occured by lacking of class NormedLinear(nn.Module). And the paper said but I didn't notice:
class NormedLinear(nn.Module)
Thank you for your great work! I find the backone in your code isn't the general Resnet. They are very different from the general.
And I try to replace resnet32 in the paper mentioned with resnet34 but the loss cannot converge and turn to nan finally.
This is bash command I tried (resnet32 has been changed to resnet34 realized by torchvision)
Could you please provide further explanation?