attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #137901

Any plan to support flash attention 3 for hopper GPUs?

### 🚀 The feature, motivation and pitch Flash Attention 3 (https://github.com/Dao-AILab/flash-attention) has been in beta for some time. I tested it on H100 GPUs with CUDA 12.3 and also attempted a…

fno2010 updated 2 weeks ago
5
tenstorrent/tt-metal #12444

Remove attention mask from models using SDPA

With https://github.com/tenstorrent/tt-metal/pull/12309, causal SDPA no longer accepts an attention mask. It instead generates its own causal mask. The PR only removed the attention mask from calls to…

cglagovichTT updated 1 month ago
1
huggingface/parler-tts #55

Flash Attention Support

Is there any way the Flash Attention 2 support for this model? if there is a way to do it i would love to get involved and help out! I've tried implement by looking at [MusicGen's one ](https://git…

sang-nguyen-ts updated 4 months ago
7
pytorch/pytorch #133254

Shared memory out of resource when using flex attention

### 🐛 Describe the bug When I use flex attention on one RTX 4090, I got some error. A minimal repro: ```python import torch from torch.nn.attention.flex_attention import flex_attention flex_at…

TechxGenus updated 3 weeks ago
4
jdoleary/Spellmasons #409

Attention Marker Inaccuracy

Attention Markers can be incorrect at times because the prediction is not simulated out the same way it actually occurs in real time. Things like PlanningView and OnDrawSelected tend to find targets b…

SoulMuncher updated 3 months ago
2
YOLOonMe/EMA-attention-module #2

add attention model

What is the position of the attention module added in the network when you conduct the experiment?

ChenJian7578 updated 3 months ago
9
facebookresearch/xformers #1068

When will xformers support Flash Attention 3?

# 🚀 Feature Support Flash Attention 3 ## Motivation Flash Attention 3 has been proved to greatly accelerate Flash Attention 2 on H100. ## Pitch Offer Flash Attention 3 support

complexfilter updated 3 weeks ago
5
vllm-project/vllm #10119

[New Model]: dunzhang/stella_en_1.5B_v5

### The model to consider. https://huggingface.co/dunzhang/stella_en_1.5B_v5 last_hidden_state = model(**input_data)[0] in __init__ model: vector_linear = torch.nn.Linear(in_features=model.conf…

cavities updated 20 minutes ago
9
XLearning-SCU/2024-ICLR-READ #7

Could you provide the code for visualizing the attention map…

Great job! I wonder if you could kindly open-source the code for visualizing the attention map? Looking forward to your response! Thanks so much!

heyuanpengpku updated 3 days ago
4
vllm-project/vllm #6078

[Feature]: Add support for interchangable radix attention

### 🚀 The feature, motivation and pitch I am working on adjustment of radix attentions now. Thank you for your support for the radix attention. Currently, catching for A that allows for more efficien…

yifan1130 updated 2 weeks ago
1

上一页 1...31 32 33 34 35 36 37...100 下一页

1000+ results for attention

1000+ results
for attention