attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

showlab/Show-o #8

FlexAttention example (for `mmu_vit` mask)

We recently released [FlexAttention](https://pytorch.org/blog/flexattention/), which automatically generates fused flashattention kernels for a diverse range of attention variants. For example, the…

Chillee updated 2 weeks ago
4
intel/intel-xpu-backend-for-triton #773

[Attention Performance] Flash Attention performance get to 8…

we aim to get 80%+ of XeTLA use `python/tutorials/06-fused-attention.py` as the test case. - #912 - #913 - #914 - #915 - #916 - #917 - #1102 - #1103 - #1192 (batch head n_ctx d…

Dewei-Wang-sh updated 1 month ago
2
iree-org/iree #17892

E2E tests suite for the Attention.

### Request description E2E tests suite for the Attention that has reference implementation in it. ### What component(s) does this issue relate to? Compiler ### Additional context I ra…

erman-gurses updated 1 month ago
13
YOLOonMe/EMA-attention-module #2

add attention model

What is the position of the attention module added in the network when you conduct the experiment?

ChenJian7578 updated 2 months ago
9
Dao-AILab/flash-attention #1121

Flash Attention 3 fp8 support 4090?

My device is a 4090 in hopper architecture, consistent with the h100 architecture. But on the homepage it says “Requirements: H100 / H800 GPU, CUDA >= 12.3.” I would like to know if flash attentio…

huanpengchu updated 1 month ago
1
shibing624/ChatPDF #36

运行之后出现了这个问题，在预测的部分出现的以下问题，请问应该如何解决

The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask`…

qiufb updated 1 week ago
1
ROCm/composable_kernel #431

Add group attention instances

We do not have grouped att for now, @rosenrodt . @asroy Do we need instances for group bmm+softmax+gemm+permute _Originally posted by @shaojiewang in https://github.com/ROCmSoftwarePlatform/composa…

rosenrodt updated 4 weeks ago
1
cubiq/ComfyUI_essentials #88

[Question] Does `SD3 Attention Seeker L/G` also work with S…

Hi @cubiq, Since the `SD3 Attention Seeker L/G` node adjusts Clip L and Clip G, does that mean it could also work with SDXL? I tried it and it does something, but I don't know if it's working p…

LukeG89 updated 4 days ago
1
rxtan2/Koala-video-llm #5

Regarding Attention Heatmap

Hi, thank you for an interesting work :) I was wondering how the "attention heatmap" in the paper was drawn. If I have understood your method correctly, the learnable parameters are only added to …

rbsohee updated 3 months ago
1
Dao-AILab/flash-attention #1071

BF16 Flash Attention producing incorrect values compared to …

Repro ``` import flash_attn import torch from einops import rearrange def snr(a: torch.Tensor, b: torch.Tensor): if torch.equal(a, b): return float("inf") if a.dtype == t…

JerrickLiu updated 2 months ago
3

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for attention

1000+ results
for attention