About Multi Head Conv Attention

happyharrycn / actionformer_release

Code release for ActionFormer (ECCV 2022)

MIT License

415 stars 77 forks source link

About Multi Head Conv Attention #112

Closed ddz16 closed 1 year ago

ddz16 commented 1 year ago

Hello, I would like to know the reason why you use Multi-Head Conv Attention instead of Multi-Head Attention. It seems that the current parameter configuration (n_qx_stride=1, n_kv_stride=1) does not reflect the effect of MHCA.

happyharrycn commented 1 year ago

While the default parameters for this class are n_qx_stride=1, n_kv_stride=1, these parameters are overwritten based on the model config file. See the code here and here

ddz16 commented 1 year ago

Thanks a lot.