auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PygmalionAI/aphrodite-engine #448

[Installation]: ValueError: 17 is not a valid GGMLQuantizati…

### Your current environment ```text The output of `python env.py` ``` ### How did you install Aphrodite? When using pre conversion i ran into following error ValueError: 17 is not a valid GGM…

Abulhanan updated 5 months ago
21
broncowdd/BoZoN #102

share link

Bonjour, J'ai partagé un dossier à l'autre utilisateur, mais quand j'ai voulu arrêter ce partage en utilisant "regen the share link", le lien de partage reste toujours dans cet utilisateur. Comment f…

ilood updated 8 years ago
40
vllm-project/vllm #6689

[Model] Meta Llama 3.1 Know Issues & FAQ

## Please checkout [Announcing Llama 3.1 Support in vLLM](https://blog.vllm.ai/2024/07/23/llama31.html) ## * Chunked prefill is turned on for all Llama 3.1 models. However, it is currently incompat…

simon-mo updated 3 weeks ago
82
byhook/ffmpeg4android #1

命令行执行失败

你好,按你博客编译出了ffmpeg.so库,但是执行ffmpeg的命令行成功后却还是报错崩溃,看了日志也没看出哪出问题了,请教一下你 **日志如下:** ffmpeg version 3.3 Copyright (c) 2000-2017 the FFmpeg developers built with gcc 4.9 (GCC) 20140827 (prerelease) c…

wuMaoTou updated 5 years ago
1
PaddlePaddle/Paddle #61626

[bug] 多个预测模型，基于 PaddleSlim 使用多卡启动自动压缩产出量化模型报错：IndexError: l…

### bug描述 Describe the Bug ### 问题描述基于 develop 分支编包，EfficientNetB0、GhostNet_x1_0 与 MobileNetV1 等多个预测模型，多卡启动自动压缩产出量化模型报错：IndexError: list index out of range ``` Traceback (most recent call last)…

EmmonsCurse updated 7 months ago
1
casper-hansen/AutoAWQ #292

awq compression of llama 2 70b chat got bad result

I use awq to quantize llama 2 70b-chat by: ``` CUDA_VISIBLE_DEVICES="1,2,3,4,5,6,7" python quantize_llama.py ``` the codes of quantize_llama.py： ``` from awq import AutoAWQForCausalLM from tr…

fancyerii updated 7 months ago
4
huggingface/trl #1663

error when using PPO in Gemma

### System Info Hi, I tried using ppo with gemma model but I get this error I think the issue is here [is_encoder_decoder](https://github.com/huggingface/trl/blob/e90e8d91d2265e484f229c45a5eb8982f…

mostafamdy updated 4 months ago
14
casper-hansen/AutoAWQ #370

AssertionError: Marlin kernels are not installed. Please ins…

run quantize and save_quantized success. but load model to generate, get the AssertionError: Marlin kernels are not installed. Please install AWQ compatible Marlin kernels from AutoAWQ_kernels. The lo…

DonliFly updated 6 months ago
4
vllm-project/vllm #6179

[Usage]: Struggling to get fp8 inference working correctly o…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

williambarberjr updated 2 months ago
17
huggingface/trl #1303

SFTTrainer not using both GPUs

I am trying to fine-tune Llama 2 7B with QLoRA on 2 GPUs. From what I've read SFTTrainer should support multiple GPUs just fine, but when I run this I see one GPU with high utilization and one with al…

johnowhitaker updated 3 months ago
9

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant