pengzhangzhi / Awesome-Mamba

Awesome list of papers that extend Mamba to various applications.
110 stars 8 forks source link

The Hidden Attention of Mamba Models #7

Open lkenn012 opened 3 months ago

lkenn012 commented 3 months ago

Thank you for creating this excellent resource for the Mamba architecture.

Here is a recent paper investigating the interpretability of these models, analogous to the attention mechanism in Transformers. I think it is highly relevant for understanding the mechanisms of Mamba models. https://arxiv.org/pdf/2403.01590.pdf