Open wangwuwu opened 5 months ago
x4 = self.squeeze_module(x4): RuntimeError: Given groups=1, weight of size [64, 3712, 3, 3], expected input[4, 1856, 128, 128] to have 3712 channels, but got 1856 channels instea
Hi, sorry for that. There are indeed still some bugs in using ConvNets, I'll try to spare time to fix them. You can first try the vision transformer backbones.
x4 = self.squeeze_module(x4): RuntimeError: Given groups=1, weight of size [64, 3712, 3, 3], expected input[4, 1856, 128, 128] to have 3712 channels, but got 1856 channels instea