I think the Resnet used in UCIR's code is more standard, so I run the experiment on cifar100(5 step) using your ucir_resnet.py, the result is worse than before a lot. Do I need to modify some other configurations to get a better result?
that is what I modify on your ucir_resnet.py
def forward(self, x, **kwargs):
x = self.conv1(x)
x = self.bn1(x)
x = self.relu(x)
fea1 = self.layer1(x)
fea2 = self.layer2(fea1)
fea3 = self.layer3(fea2)
attentions = [fea1, fea2, fea3]
raw_features = self.end_features(fea3)
features = self.end_features(F.relu(fea3, inplace=False))
return {"raw_features": raw_features, "features": features, "attention": attentions}
def end_features(self, x):
x = self.avgpool(x)
x = x.view(x.size(0), -1)
return x
I think the Resnet used in UCIR's code is more standard, so I run the experiment on cifar100(5 step) using your ucir_resnet.py, the result is worse than before a lot. Do I need to modify some other configurations to get a better result?
that is what I modify on your ucir_resnet.py