efficient-transformer Search Results

rhymes-ai/Allegro #17

Is there a strict requirement for GPUs that support flash_at…

Is there a strict requirement for GPUs that support flash_attention? I tried to test on V100, but this GPU does not support flash_attention, resulting in an error with the Runtime Error: No available …

ChalvYongkang updated 5 days ago

infocusp/varformers #1

Suggestion to Implement Additional Efficient Transformer Var…

# Description: Hello! I appreciate the excellent work on benchmarking Performer and Longformer against the base Transformer. I’d like to propose the implementation of additional efficient Transformer…

rajveer43 updated 2 months ago

huggingface/transformers #34649

Add functionality for deleting adapter layers in PEFT integr…

### Feature request This request aims to introduce functionality to delete specific adapter layers integrated with PEFT (Parameter-Efficient Fine-Tuning) within the Hugging Face Transformers librar…

itsskofficial updated 2 days ago

pentium3/sys_reading #348

Efficiently Scaling Transformer Inference

https://proceedings.mlsys.org/paper_files/paper/2023/file/523f87e9d08e6071a3bbd150e6da40fb-Paper-mlsys2023.pdf

pentium3 updated 8 months ago

aws-neuron/aws-neuron-samples #82

Llama3 8B 32K sample generates garbage

Model generates only garbage. Sample: https://github.com/aws-neuron/aws-neuron-samples/blob/master/torch-neuronx/transformers-neuronx/inference/llama-3-8b-32k-sampling.ipynb NeuronSDK2.19 PyTorc…

samir-souza updated 6 days ago

rhymes-ai/Allegro #28

"No available kernel. Aborting execution." I installed all o…

(allegro) D:\PyShit\Allegro>python single_inference.py ^ More? --user_prompt "A seaside harbor with bright sunlight and sparkling seawater, with manyboats in the water. From an aerial view, the boats…

MinervaArgus updated 2 weeks ago

taolinzhang/SynQT #1

Parameter-Efficient and Memory-Efficient Tuning for Vision T…

您好，这篇论文最近有开源计划吗

AsuradaYuci updated 4 months ago

huggingface/transformers #34417

Memory Issues when Attempting to Load GGUF Tensors in transf…

### System Info Environment: OS: Ubuntu 24.04 Python version: 3.11.8 Transformers version: transformers==4.45.2 Torch version: torch==2.3.0 Model: Meta-Llama-3.1-70B-Q2_K-GGUF - https://hugg…

mpperez3 updated 1 day ago

5g4s/paper #29

Reformer: The Efficient Transformer

https://arxiv.org/abs/2001.04451

5g4s updated 1 year ago

e4exp/paper_manager_abstract #251

Reformer: The Efficient Transformer

* https://arxiv.org/abs/2001.04451 * ICLR 2020 大規模なトランスモデルは日常的に多くのタスクで最先端の結果を得ていますが、これらのモデルのトレーニングは、特に長いシーケンスでは法外なコストがかかることがあります。我々はトランスフォーマーの効率を向上させるために2つの手法を紹介する。 1つは、ドット積注意をロカリティ依存のハッシュを使用する…

e4exp updated 3 years ago

1000+ results for efficient-transformer

1000+ results
for efficient-transformer