efficient-transformers Search Results

soyamash/read_paper #1

Efficient Transformers: A Survey

https://arxiv.org/abs/2009.06732

soyamash updated 3 years ago

ouusan/some-papers #1

Advancing Vision Transformers with Group-Mix Attention (---E…

1.Public code and paper link: I have installed the following code: [https://github.com/AILab-CVC/GroupMixFormer](url) paper link : [https://arxiv.org/abs/2311.15157](url) 2. What does this work d…

ouusan updated 3 months ago

mutonix/pyramidinfer #3

LlamaForCausalLM is out of date

Great work! Currently, I am reproducing this work. I found that the `LlamaForCausalLM` used in the repository is out of date, and its memory cost is much higher than the `LlamaForCausalLM` from Hug…

gpzlx1 updated 1 week ago

Safe-DS/Library #859

Parallel table transformer

### Is your feature request related to a problem? The sequential table transformer (#802) is great if later transformations depend on prior ones. Often, however, columns are transformed independently…

lars-reimann updated 1 week ago

pytorch/pytorch #129861

DISABLED test_mem_efficient_attention_attn_mask_vs_math_ref_…

Platforms: linux This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1…

pytorch-bot[bot] updated 2 weeks ago

AlexeyAB/darknet #7220

DeiT: Data-efficient Image Transformers

Hi @AlexeyAB, Can we have this [DeiT](https://github.com/facebookresearch/deit)? Thanks

sctrueew updated 3 years ago

NVlabs/VILA #94

Deployment to SageMaker and/or HuggingFace Inference Endpoin…

When attempting to manually deploy the model to sagemaker via a deployment script or automatically deploying the model via the huggingface inference endpoints UI, I receive the same error: "ValueEr…

averypfeiffer updated 3 days ago

pytorch/pytorch #129523

`test_dummy_mha_with_nt_cuda` fails on `sm70`, `sm75`

### 🐛 Describe the bug Looks like it's dispatching to efficient attention backward and failing one of the shape checks ( ``` TORCH_CHECK( max_seqlen_k

eqy updated 1 week ago

QwenLM/Qwen2 #736

MoE finetuning extreme slow

The finetuning of Qwen2-57B-A14B-Instruct is extremely slow compared to finetuning of Qwen2-72B-Instruct. Here are the runtimes: **Qwen/Qwen2-7B-Instruct:** {'train_runtime': 100.8509, 'trai…

H-Simpson123 updated 2 weeks ago

AUTOMATIC1111/stable-diffusion-webui #5898

[Bug]: Xformers available but not

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What happened? xformers is installed and available in my conda env yet n…

mmann1123 updated 2 weeks ago

1000+ results for efficient-transformers

1000+ results
for efficient-transformers