happyharrycn / actionformer_release

Code release for ActionFormer (ECCV 2022)
MIT License
415 stars 77 forks source link

Can we apply actionformer to video streams? #125

Closed zwfcrazy closed 8 months ago

zwfcrazy commented 9 months ago

As mentioned in the paper, for inference, the whole video clip is fed into the network because no position encoding is added. What if the input is a non-stopping video stream? Can we make some adjustments to actionformer, like adding position encoding? Will it work?

tzzcl commented 9 months ago

Hello, I think they may refer to online action recognition/detection methods.