happyharrycn / actionformer_release

Code release for ActionFormer (ECCV 2022)
MIT License
415 stars 77 forks source link

About Multi Head Conv Attention #112

Closed ddz16 closed 1 year ago

ddz16 commented 1 year ago

Hello, I would like to know the reason why you use Multi-Head Conv Attention instead of Multi-Head Attention. It seems that the current parameter configuration (n_qx_stride=1, n_kv_stride=1) does not reflect the effect of MHCA.

image

happyharrycn commented 1 year ago

While the default parameters for this class are n_qx_stride=1, n_kv_stride=1, these parameters are overwritten based on the model config file. See the code here and here

ddz16 commented 1 year ago

Thanks a lot.