auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #2700

[Bug] tvm._ffi.base.TVMError: TVMError: Assert fail: T.Cast(…

## 🐛 Bug I am trying to optimise the `Qwen/Qwen1.5-4B-Chat` model. As I have only 8GB RAM on my MAC M1, I use 3bit quantisation and a really small prefill chunk size = 2048. I get the following err…

pra-dan updated 2 months ago
1
spcl/QuaRot #19

multi GPU inference

hi, thanks for your wonderful work. I was wondering if i want to infer a model with multi-gpu，what should i do? I have tried with belowing code when load model with `device_map` parameter: ``` model…

hensiesp32 updated 5 months ago
1
InternLM/xtuner #798

不能调用deepspeed，出现Segmentation Fault

` root@dsw-541920-5fd5c64bc4-m25b4:/mnt/workspace/modelscope# xtuner train llama2_7b_chat_qlora_custom_sft_e1_copy.py --deepspeed deepspeed_zero1 [2024-07-01 21:43:15,368] [INFO] [real_accelerator.p…

mxdlzg updated 1 month ago
3
huggingface/trl #1723

FSDP Must flatten tensors with uniform dtype but got torch.b…

running dpo with Qwen meet flatten problem. FSDP config as follow ```yaml compute_environment: LOCAL_MACHINE debug: false distributed_type: FSDP downcast_bf16: 'no' fsdp_config: fsdp_auto_w…

qZhang88 updated 2 months ago
10
huggingface/peft #1890

ValueError: Trying to set a tensor of shape torch.Size([4317…

### System Info bitsandbytes==0.43.1 peft==0.11.0 accelerate==0.31.0 transformers==4.38.2 trl==0.9.4 ### Who can help? @BenjaminBossan @sayakpaul ### Information - [X] The official …

KarasZhang updated 4 months ago
15
huggingface/transformers #31332

Can't do fine-tuning on Colab

Hello everyone. I used this code before for LLaMA 2 7B. But now, it doesn't work with any model, even Phi 3!!pip install -q accelerate bitsandbytes peft==0.4.0 transformers==4.38.2 trl==0.4.7!pip inst…

yukiarimo updated 4 months ago
5
huggingface/text-generation-inference #1750

Can't run Mistral quantized on T4

### System Info ``` +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 550.54.15 Driver Version: 550.54.15 CUDA Version: 12.4…

emillykkejensen updated 4 months ago
4
janhq/cortex.cpp #1154

epic: Implement new Model Folder and model.yaml

## Goal - We should have a model folder that is able to handle different models - Built-in models (e.g. `janhq/llama3:7b-tensorrt-llm`) - Huggingface GGUF repos with multiple quants (e.g.…

dan-homebrew updated 1 month ago
24
OpenBMB/MiniCPM-V #563

[BUG] MiniCPM-Llama3-V-2_5-int4微调出现：RuntimeError: only Tenso…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

KeepFaithMe updated 1 month ago
3
vllm-project/vllm #5793

[Bug]: Different quality responses using GPTQ / marlin kerne…

### 🐛 Describe the bug Hello, I am running llama3-70b and mixtral with VLLM on a bunch of different kinds of machines. I encountered wildly different quality performance on A10 GPUs vs A100/H…

joe-schwartz-certara updated 3 months ago
8

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant