Is it feasible to use resnet as student network and CLIP as teacher network? Their output tensors are different. Is it feasible for me to reshape?
My code is as follows:
z = self.encoder_q(img) //encode is resnet network
print("z0的shape=")
print(z.shape)# //[16,128] ,16 is batchsize
z=z.unsqueeze(2)
print("z1的shape=")
print(z.shape) //[16,128,1]
z=self.decoder(z) //一个卷积层,
print("z2的shape=")
print(z.shape) //[16,38400,1]
z=z.view(8, 50, 768) //Reshape is used here
Is it feasible to use resnet as student network and CLIP as teacher network? Their output tensors are different. Is it feasible for me to reshape? My code is as follows: z = self.encoder_q(img) //encode is resnet network print("z0的shape=") print(z.shape)# //[16,128] ,16 is batchsize z=z.unsqueeze(2) print("z1的shape=") print(z.shape) //[16,128,1] z=self.decoder(z) //一个卷积层, print("z2的shape=") print(z.shape) //[16,38400,1] z=z.view(8, 50, 768) //Reshape is used here
x_rec = self.decoder(z)