auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ndif-team/nnsight #86

NNsight bug with quantized models

I'm using the `NNsight` class to wrap a 4-bit quantized LLaVA model and encountered this error: ``` ... File ~/miniconda3/envs/llava/lib/python3.10/site-packages/bitsandbytes/nn/modules.py:429, i…

HuFY-dev updated 6 months ago
3
vllm-project/vllm #1114

AWQ: bfloat16 not supported? And `--dtype` arg doesn't allow…

Hi guys I had a report earlier today from a user telling me that he tried one of my new AWQ models, and got an error indicating that only float16 is supported with AWQ. I tested it myself with t…

TheBloke updated 4 weeks ago
9
PaddlePaddle/Paddle #51423

测试编译错误

我在docker中编译cpu版的paddle，遇到如下错误： ``` paddle/fluid/jit/CMakeFiles/jit_download_program.dir/build.make:57: recipe for target 'paddle/fluid/jit/CMakeFiles/jit_download_program' failed make[2]: *** [padd…

xiehuanyi updated 1 year ago
13
silvanmelchior/RPi_Cam_Web_Interface #583

Recordings are 15fps Not 30fps 30fps not 90 or 120fps - Stre…

I have searched in several places and was unable to pin down this issue. i wanted to start a ticket before I do any more changes and experiments to my setup. That way I can come back here and say wh…

RichShumaker updated 11 months ago
31
mobiusml/hqq #47

TypeError when load from_pretrain

Hi, I met the following error when I tried to load a llama model: ``` Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████████████████████████████████…

ghost updated 5 months ago
10
microsoft/Phi-3CookBook #89

triton.runtime.autotuner.OutOfResources: out of resource: sh…

Hi. I run into this error when trying to fine-tune **Phi3 small**: ```triton.runtime.autotuner.OutOfResources: out of resource: shared memory, Required: 180224, Hardware limit: 101376. Reducing bloc…

shashank-agg updated 1 month ago
5
huggingface/transformers #29303

ValueError: .to is not supported for 4-bit or 8-bit models.…

### System Info transformers==4.31.0 accelerate==0.21.0 deepspeed==0.13.2 bitsandbytes==0.42.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [X] My own m…

robinsonmhj updated 4 months ago
5
bitsandbytes-foundation/bitsandbytes #1212

Could not load bitsandbytes native library: 'NoneType' objec…

### System Info System Info bitsandbytes 0.43.1 Python 3.10.12 "CUDA" library: rocm-libs Version: 6.0.0.60000-91~22.04 Ubuntu 22.04.1 Getting the following error after Mistral safetensors are …

jcalvoch updated 4 months ago
1
huggingface/trl #1225

Zero Training Loss while finetuning a mistral model for summ…

I am trying to finetune a bnb quantized model for summarization using LORA base_model - `cognitivecomputations/dolphin-2.2.1-mistral-7b` I am training it for 1 epoch, the loss weirdly is at 0 from…

Praful932 updated 3 months ago
3
THUDM/GLM-4 #274

无法使用vllm

### System Info / 系統信息 ubuntu22 conda python3.11 nvidia-cudnn-cu12 torch 2.3.0 vllm 0.5.0.post1 vllm-flash-attn 2.5.9…

itbithubman updated 2 months ago
6

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant