Hi i have some problem in feature extractor in this paper.
assume one video , its temporal length is T
so you split it to 32 segments frtist as the paper said? segment_length is T//32
and do feature extactor in slide windows in step 1 ? so each segment would get T//32-15 features and merge them to one feature ?
Looking forward to your reply
Hi i have some problem in feature extractor in this paper. assume one video , its temporal length is T so you split it to 32 segments frtist as the paper said? segment_length is T//32 and do feature extactor in slide windows in step 1 ? so each segment would get T//32-15 features and merge them to one feature ?
Looking forward to your reply