efficient-attention Search Results

1000+ results
for efficient-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/xformers #678

`memory_efficient_attention` makes no difference

# ❓ Questions and Help Hi guys, Thanks a lot for the amazing work. I am trying to use `xformers` on CLIP, following the `timm` tutorial I've put together the following code ```python MODEL_NAM…

FrancescoSaverioZuppichini updated 1 year ago
8
huggingface/transformers #10612

Implementing efficient self attention in T5

# 🌟 New model addition My teammates and I (including @ice-americano) would like to use efficient self attention methods such as Linformer, Performer and Nystromformer ## Model description The…

JamesDeAntonis updated 3 years ago
1
pytorch/pytorch #133462

PT2 inference slowdown after updating Huggingface pin on CI

From https://github.com/pytorch/pytorch/pull/133065#issuecomment-2288701447 . Basically, there was a noticeable performance drop on the inference side after bumping up the HF pin, [dashboard](https://…

desertfire updated 2 weeks ago
7
xmu-xiaoma666/External-Attention-pytorch #25

How to use attention module efficiently?

Hi, This repository helped me a lot. thank you By the way, I have a question. Is there a way to do attention only certain parts of the image? In other words, is there a way to specify the part …

SoyeongChoi updated 2 years ago
9
facebookresearch/xformers #932

Incoherence in documentation for memory_efficient_attention

# 🐛 Bug The documentation says one thing that doesn't match the equivalent code: [equivalent code](https://github.com/facebookresearch/xformers/blob/97cc81f5e4aef5af7202791bd75a7e3fb5a1762e/xfor…

jmercat updated 9 months ago
1
aqlaboratory/openfold #468

Output issue on H100 with memory-efficient kernel

Hello, As per [this line](https://github.com/aqlaboratory/openfold/blob/main/openfold/model/evoformer.py#L649), when neither Deepspeed nor LMA is selected, the custom memory-efficient [kernel](http…

psohani updated 1 week ago
12
microsoft/MInference #71

Performance Degradation when Using MInference with Qwen2-7B…

### Describe the issue Hello, I am encountering an unexpected performance issue while using the MInference library with the Qwen/Qwen2-7B-Instruct model. I have followed the example provided in ru…

yumingfan-0219 updated 1 week ago
1
pytorch/pytorch #117016

There is a performance drop because we have not yet implemen…

### 🚀 The feature, motivation and pitch I'm working on ensembling multiple UNet with the method mentioned in [MODEL ENSEMBLING ](https://pytorch.org/tutorials/intermediate/ensembling.html). This met…

5c4lar updated 2 weeks ago
3
googlecolab/colabtools #4824

Stable Diffusion is not running

WARNING[XFORMERS]: xFormers can't load C++/CUDA extensions. xFormers was built for: PyTorch 2.3.0+cu121 with CUDA 1201 (you have 2.4.0+cu121) Python 3.10.14 (you have 3.10.12) Please rei…

Shiro-Gallen updated 2 days ago
1
unslothai/unsloth #315

BurstAttention:An Efficient Distributed Attention Framework …

> The experimental results under different lengths demonstrate that BurstAttention offers significant advantages for processing long sequences compared with these competitive baselines, especially ten…

sorasoras updated 5 months ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for efficient-attention

1000+ results
for efficient-attention