linear-transformer Search Results

1000+ results
for linear-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xorbitsai/inference #921

BUG Unable to load model

### Describe the bug I was unsuccessful in loading the model with the following parameters, and the latest version of xinference is 0.8.1 I started qwen-chat on the ui pagemodel format: gptq mode…

auxpd updated 7 hours ago
2
aai-institute/continuiti #40

Linear Attention

# Description Current challenges in using Neural Operators are: irregular meshes, multiple inputs, multiple inputs on different meshes, or multi-scale problems. [1] The Attention mechanism is promi…

JakobEliasWagner updated 3 months ago
1
Beomi/InfiniTransformer #11

Model generating random sequence

By saving the model and reloading it I managed to get the model working, both with quantized and full precision (it still uses 10gb max of gpu ram). However, the model generates random characters. He…

Lazy3valuation updated 2 months ago
8
huggingface/diffusers #8605

global, eager model weight GPU unloading

**What API design would you like to have changed or added to the library? Why?** Most people expect `diffusers` and `transformers` "models" to be "unloaded" so that they can "just" "run" a "big" "p…

doctorpangloss updated 2 weeks ago
3
huggingface/transformers #31474

Quantization support for heads and embeddings

### Feature request Hi! I’ve been researching LLM quantization recently ([this paper](https://arxiv.org/abs/2405.14852)), and noticed a potentially improtant issue that arises when using LLMs with 1-…

galqiwi updated 6 days ago
10
pytorch-labs/float8_experimental #238

torch.inference_mode switches`aten.linear.default, this is n…

First, congrats for the repo - looks great I discovered that switching between `torch.no_grad` and `torch.inference_mode` leads to a switch to `aten.linear.default`. Feel free to use this feedback …

michaelfeil updated 2 months ago
3
intel-analytics/ipex-llm #11167

finetune chatGLM6B using LoRA on arc

we are trying to finetune chatGLM6B using LoRA on arcA770 1card and 2cards , use the following command 1card: ``` python ./alpaca_lora_finetuning.py \ --base_model "/home/intel/models/chat…

YongZhuIntel updated 1 week ago
18
Vahe1994/AQLM #106

NaNs in sequence classifier output

Hello, I have an enormous amount of `nan` and `inf` in outputs of quantized models for sequence classification. It is not the case with non-quantized models, which never outputs nans whatever the s…

timo-obrecht updated 1 week ago
1
qiskit-advocate/qamp-spring-23 #31

Hybrid Algorithm to Explore Properties of GPT in Quantum Tra…

Description - This project is intended to explore couple papers in literature of Quantum Transformer models [self attention model: https://arxiv.org/abs/2205.05625 , Quantum vision transformers : htt…

hykavitha updated 5 months ago
13
OpenAccess-AI-Collective/axolotl #777

Support Fuyu-8B

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/OpenAccess-AI-Collective/axolotl/discussions/categories…

dongxiaolong updated 3 months ago
4

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for linear-transformer

1000+ results
for linear-transformer