attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/diffusers #8535

[SD3] pipe.enable_xformers_memory_efficient_attention

### Describe the bug > RuntimeError: The size of tensor a (154) must match the size of tensor b (2304) at non-singleton dimension 1 ### Reproduction ```python # StableDiffusion3Pipeline pipe.enab…

CanvaChen updated 5 days ago
11
lamalab-org/MatText #43

document module to analyse attention weights

n0w0f updated 2 days ago
1
alipay/PainlessInferenceAcceleration #27

modeling_qwen attention not use multi branch position ids & …

I reviewed the code of modeling_qwen.py, and I noticed that, within the lookahead process, the draft_ids matched from the TrieTree are such that the attention_mask and position ids associated with the…

snippetzero updated 2 months ago
1
apple/coremltools #2325

Failed to build the model execution plan using a model archi…

## 🐞Describing the bug Hello. I'm trying to convert PyTorch model to Stateful CoreML Model I wrote this code referred to [WWDC 2024 session Mistral-7B model](https://github.com/huggingface/swift-t…

Skyline-23 updated 2 weeks ago
2
OreoChocolate/MUREN #13

About the Model structure

Hello, can you send a code about the lack of "attention fusion" for reference

zhima-win updated 20 hours ago
1
lllyasviel/stable-diffusion-webui-forge #1116

Bad results with built-in Perturbed-Attention Guidance, can'…

I used [sd-perturbed-attention](https://github.com/pamparamm/sd-perturbed-attention) and had clear, mostly positive results with it. Integrated Perturbed-Attention Guidance was presented with limited …

younyokel updated 1 month ago
1
zjysteven/lmms-finetune #37

V100 support ?

请问微调训练支持V100服务器吗，8*32G？V100原生不支持Flash Attention？

hfutzzw updated 2 days ago
1
NVIDIA/TensorRT-LLM #1978

[0.11.0] T5 model running issue

### System Info L4 GPU (AWS G6.12xl) with TensorRTLLM 0.11.0, running with Tritonbackends ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified …

lanking520 updated 1 month ago
3
huggingface/diffusers #9336

IPAdapter attention layers run with requires_grad=True for p…

### Describe the bug When IP adapter is loaded, the IPAdapterAttnProcessor2_0 class is not set with training=False. Didnt see a huge difference in memory if its manually set to false on those process…

darshats updated 2 weeks ago
1
Lightning-AI/litgpt #1665

Gemma 2B weights seem to have changed

### Bug description It seems that they updated the Gemma v1 2B weights. Something to look into: ``` ⚡ main ~/litgpt litgpt chat checkpoints/google/gemma-2b {'access_token': None, 'checkpoint_…

rasbt updated 1 month ago
1

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for attention

1000+ results
for attention