attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lightly-ai/lightly #1587

Grad-CAM or attention map

Would it be possible to add functionality for **Grad-CAM** or **attention map** similar to those used in DINO? Thank you!

s9021025292140 updated 4 days ago
1
axolotl-ai-cloud/axolotl #1774

Migrate multipack to refactored flash attention

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/axolotl-ai-cloud/axolotl/discussions/categories/ideas) …

casper-hansen updated 14 hours ago
1
huggingface/text-generation-inference #2073

Tree-attention for medusa

### Feature request Hi, in Medusa's paper, they adopt tree-attention and use typical sampling to increase the speedups, but in current code bases, I think it only use argmax() and no tree-attention. …

hustxiayang updated 2 days ago
3
arXiv/html_feedback #1680

Attention Is All You Need

nehtech9 updated 1 week ago
1
cmsflash/efficient-attention #13

efficient-attention applly in Cross Attention

Hello, I have recently implemented a cross attention application with multi-modal fusion, but because the image resolution is too large, cuda OOM occurs when calculating q and k, so I found your paper…

stanny880913 updated 1 month ago
1
Dao-AILab/flash-attention #1049

Flash Attention 3 3090 support

Will there be 3090 support on Flash Attention 3 in the future?

win10ogod updated 6 days ago
4
espnet/espnet #5806

Flash attention?

"We do not have so many requests, actually. We also have some internal discussions, but there are a lot of alternatives for the faster (lightweight) encoder and Squeezeformer does not come to a hig…

miraodasilva updated 1 month ago
2
SHI-Labs/NATTEN #142

circular/wrapping attention

Maybe it is to niche, but we would be interested in a fused circular windowed attention. Currently, we pad q,k,v, use the fused kernel and crop. It would either help if k,v could have different di…

fzimmermann89 updated 3 weeks ago
8
huggingface/parler-tts #62

attention_mask

hi, I have attention_mask problem mismatch in the cross attenstion can you please explain this line: requires_attention_mask = "encoder_outputs" not in model_kwargs ? why is comed after this: …

netagl updated 1 month ago
3
intel/intel-xpu-backend-for-triton #1466

[Productize Flash Attention performance #6] optimize slm sto…

Dewei-Wang-sh updated 1 day ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for attention

1000+ results
for attention