auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OCA/stock-logistics-warehouse #1278

Migration to version 15.0

# Todo https://github.com/OCA/maintainer-tools/wiki/Migration-to-version-15.0 # Modules to migrate - [x] account_move_line_product - By @JuanyDForgeflow - #1428 - [ ] product_route_profile -…

OCA-git-bot updated 3 weeks ago
14
NVIDIA/TensorRT-LLM #663

256G mem is Not Enough （AWQ 4bit LLama 70b）

v 0.6.1 ```bash python quantize.py --model_dir ./hg_weight_3999/ --dtype float16 --qformat int4_awq --export_path ./quantized_int4-awq --calib_size 32 ``` ```log Using pad_token, but it is not se…

busishengui updated 7 hours ago
4
QwenLM/Qwen2-VL #492

OSError: Error no file named model.safetensors found in dire…

Hi, I have finetuned Qwen2-VL using Llama-Factory. I successfully quantized the fine-tuned model as given ``` from transformers import Qwen2VLProcessor from auto_gptq import BaseQuantizeC…

bhavyajoshi-mahindra updated 3 days ago
3
ModelCloud/GPTQModel #536

Replace auto_gptq by gptqmodel in HuggingFace/Optimum

Hi @Qubitium . Since the CPU path is already in gptqmodel, when do you plan to replace auto_gptq to gptqmodel in HuggingFace/optimum? I think we can start an issue in Optimum to let the maintainer kno…

jiqing-feng updated 2 days ago
13
vllm-project/vllm #3886

[Bug]: Error When Using quantized Jais on Specific Layers wi…

### Your current environment Hello everyone, I need some help here, please. I tried to quantize the JAIS model using GPTQ. Here is my code: ``` from auto_gptq.modeling._base import BaseGPTQForC…

Mohammad-Faris updated 2 weeks ago
2
huggingface/optimum #1742

Mixtral-8x7B-Instruct-v0.1-GPTQ AssertionError

### System Info ```shell Name: optimum Version: 1.18.0.dev0 Name: transformers Version: 4.36.0 Name: auto-gptq Version: 0.6.0.dev0+cu118 CUDA Version: 11.8 Python 3.8.17 ``` ### Who can help…

paolovic updated 2 months ago
7
OCA/stock-logistics-workflow #1101

Migration to version 16.0

# Todo https://github.com/OCA/maintainer-tools/wiki/Migration-to-version-16.0 # Modules to migrate - [ ] delivery_procurement_group_carrier - By @rousseldenis - #1158 - [x] delivery_total_we…

OCA-git-bot updated 1 week ago
21
NVIDIA/TensorRT-LLM #960

No module named 'utils.utils' When try to run QwenVL

### System Info CPU architecture : x86_64 GPU name : NVIDIA V10 32G ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts …

Agent-Chu updated 2 days ago
1
LlamaFamily/Llama-Chinese #75

AutoGPTQForCausalLM.from_quantized 加载官方4bit量化模型报错：NameError:…

AutoGPTQForCausalLM.from_quantized 加载官方4bit量化模型（[Llama2-Chinese-13b-Chat-4bit](https://huggingface.co/FlagAlpha/Llama2-Chinese-13b-Chat-4bit/tree/main)）报错：NameError: name 'autogptq_cuda_256' is not de…

gpww updated 1 year ago
1
microsoft/DeepSpeed #6725

[BUG] [ROCm] Fine-tuning DeepSeek-Coder-V2-Lite-Instruct wit…

**Describe the bug** I am trying to fine-tune DeepSeek-Coder-V2-Lite-Instruct (16B) on a system with 8 MI300X GPUs. Running on any number of GPUs less than 8 works as expected and runs to completion. …

nikhil-tensorwave updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant