I tried running the feature_extractor, but the output size of region_vectors from layer3 is incompatible with region_vectors from other layers. My input is size [64, 3, 240, 320], region_vector of layer 1 has size[64, 256, 3, 4], layer2 [64, 512, 3, 4], layer 3 [64, 1024, 4, 5],
layer4 [64, 2048, 3, 4]. Could you provide more information on the settings on how the extracted frames are preprocessed before extracting the resnet features? Thank you!
I tried running the feature_extractor, but the output size of region_vectors from layer3 is incompatible with region_vectors from other layers. My input is size [64, 3, 240, 320], region_vector of layer 1 has size[64, 256, 3, 4], layer2 [64, 512, 3, 4], layer 3 [64, 1024, 4, 5], layer4 [64, 2048, 3, 4]. Could you provide more information on the settings on how the extracted frames are preprocessed before extracting the resnet features? Thank you!