Using gereral Resnet causes the loss to become ’nan‘

Thank you for your great work! I find the backone in your code isn't the general Resnet. They are very different from the general.

And I try to replace resnet32 in the paper mentioned with resnet34 but the loss cannot converge and turn to nan finally.

This is bash command I tried (resnet32 has been changed to resnet34 realized by torchvision)

python cifar_train.py --arch resnet32 --gpu 0 --imb_type exp --imb_factor 0.01 --loss_type LDAM --train_rule DRW

Could you please provide further explanation?

kaidic / LDAM-DRW