I would like to extract the "embedding" layer of the VGG network implemented in models.
By example, in the case of for resnet-18 for images, I would take the avgpool like
model = models.resnet18(pretrained=True)
layer = model._modules.get('avgpool')
self.layer_output_size = 512
I would like to extract the "embedding" layer of the VGG network implemented in models. By example, in the case of for
resnet-18
for images, I would take theavgpool
likeIs that correct for VGGSound?
Thank you.