Hi, thank you very much for sharing your code! Quick question about I3D features.
You mention that 'I3D takes non-overlapping 16-frame chuncks as input for both two stream' in 4.2 Implementatino Details. When I borrow the released codes in https://github.com/piergiaj/pytorch-i3d for extracting I3D features, I notice that the I3D network has three layers (Conv3d_1a_7x7, MaxPool3d_4a_3x3, MaxPool3d_5a_2x2) whose stride is 2. Thus, the final features I get are sampled by 8. I wonder if I missed some processes and how to get the features sampled by 16.
is:issue is:open
Hi, thank you very much for sharing your code! Quick question about I3D features.
You mention that 'I3D takes non-overlapping 16-frame chuncks as input for both two stream' in 4.2 Implementatino Details. When I borrow the released codes in https://github.com/piergiaj/pytorch-i3d for extracting I3D features, I notice that the I3D network has three layers (Conv3d_1a_7x7, MaxPool3d_4a_3x3, MaxPool3d_5a_2x2) whose stride is 2. Thus, the final features I get are sampled by 8. I wonder if I missed some processes and how to get the features sampled by 16.