Question about mask in cross attention component

dvlab-research / Mask-Attention-Free-Transformer

Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"

59 stars 6 forks source link

Closed triton99 closed 3 months ago

triton99 commented 1 year ago

Hi @X-Lai , Thanks for sharing this great work!

What is the purpose of attn_masks in your transformer decoder? In your paper, you mentioned that mask-attention-free-transformer.

Thank you.

NiccoloCavagnero commented 3 months ago

If I understand correctly, only the first cross-attention layer is not masked, all subsequent layers are.