Number of frame - Githubissues

noureldien / timeception

Timeception for Complex Action Recognition, CVPR 2019 (Oral Presentation)

https://noureldien.com/research/timeception/

GNU General Public License v3.0

157 stars 33 forks source link

Number of frame #8

Closed CuongNguyen218 closed 5 years ago

CuongNguyen218 commented 5 years ago

Hi Noureldien, I have a question about r3d. As I see when I applied resnet-3d with input size like T x H x W x C, the time dimension T always decreases after some layers. So you set T = ? to get output of 3D-resnet equals 1024 x H x W x C.

noureldien commented 5 years ago

Hi,

Thanks for your question. One Timeception layer downsamples the timsteps by a factor of two. So, to get output T=1, you can use 1 Timeception layer with input T=2, or two Timeception layers with input T=4. Please use the code and experiment it. It is really easy to figure it out.

Thanks