attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Omid-Nejati/BEFUnet #3

CrossAttention And Self-Attention

Hello Author： In the CrossAttention class in the utils.py file, there is only one input parameter x, which actually computes Self-Attention. Is your code inconsistent with the content of your paper?

daichuangye updated 1 month ago
1
comfyanonymous/ComfyUI_TensorRT #65

Error occurred when executing Flux model conversion

All flux models, dev, schnell and fp8 versions, report this error during conversion, whether use Dynamic or Static conversion: File "K:\ComfyUI_windows_portable\python_embeded\Lib\site-packages\…

deepfree2023 updated 3 weeks ago
3
sustcsonglin/flash-linear-attention #46

Hello from HF Diffusers

Thanks for the incredibly clean repository! I am Sayak from the [Diffusers](https://github.com/huggingface/diffusers) team at Hugging Face. My question is probably very naive, so I apologize for th…

sayakpaul updated 1 day ago
4
vllm-project/vllm #3385

Attention sliding window

In Hugging Face "eager" Mistral implementation, a sliding window of size 2048 will mask 2049 tokens. This is also true for flash attention. In the current vLLM implementation a window of 2048 will mas…

caiom updated 1 month ago
8
turboderp/exllamav2 #397

ROCm Flash-Attention 2

I have been informed that while Flash Attention's there it's not being used - https://github.com/oobabooga/text-generation-webui/issues/3759#issuecomment-2031180332 The post has a link to what has …

nktice updated 1 month ago
2
pytorch/pytorch #133674

FlexAttention Gives Different Output Shapes With/Without Com…

### 🐛 Describe the bug I am running a FlexAttention operation and it returns different output shapes with and without compile. The correct output shapes are those returned without compile. ```P…

kiddyboots216 updated 1 week ago
11
InternLM/lmdeploy #2416

[Bug] Met a error when deploying an AWQ model on H20.

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related iss…

medwang1 updated 1 week ago
17
NVIDIA/Megatron-LM #991

[BUG] clip key mismatch

**Describe the bug** I try to use LLaVA example and faced to key mismatch error. I am on latest commit in main branch. (094d66b) [rank0]: RuntimeError: Error(s) in loading state_dict for LLaVAMode…

KookHoiKim updated 1 month ago
1
cubiq/ComfyUI_essentials #88

[Question] Does `SD3 Attention Seeker L/G` also work with S…

Hi @cubiq, Since the `SD3 Attention Seeker L/G` node adjusts Clip L and Clip G, does that mean it could also work with SDXL? I tried it and it does something, but I don't know if it's working p…

LukeG89 updated 20 hours ago
1
Dao-AILab/flash-attention #1169

FP8 for flash attention 3 and possible concerns

Thank you for this amazing work! I was wondering if the fp8 implementation of flash attention 3 will be able for public to use? My main concern will be accuracy (block quant may have alleviated thi…

TheTinyTeddy updated 2 weeks ago
4

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for attention

1000+ results
for attention