auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/peft #1831

RuntimeError: Expected all tensors to be on the same device,…

### System Info ``` Transformers 4.41.2 peft 0.11.1 ``` Single `T4` GPU. I am implementing QLoRA for fine-tuning a `mistral-7b` on a `T4` gpu. I loaded the model with the quantized confi…

kevalshah90 updated 4 months ago
9
vroland/epdiy #31

Resources and Suggestions

Feel free to post useful resources, suggestions or show off your projects here. Older comments of this nature can be found in #21 .

vroland updated 5 months ago
177
InternLM/lmdeploy #2117

[Bug] Llama 3.1 Support

### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related issue y…

vladrad updated 1 month ago
21
mlc-ai/mlc-llm #2418

[Bug] mlc_llm chat not working: ValueError: Cannot find glob…

## 🐛 Bug I ran `mlc_llm chat HF://mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC` and it failed with `ValueError: Cannot find global var "multinomial_from_uniform1" in the Module` ## To Reproduce S…

kalradivyanshu updated 5 months ago
7
huggingface/transformers #31104

NotImplementedError: Cannot copy out of meta tensor; no data…

When I fine tuning llama2 with deepspeed and qlora on one node and multi GPUs, I used zero3 to partition the model paramters, but it always first load the whole params on each GPU and partition params…

CHNRyan updated 3 months ago
7
amd/RyzenAI-SW #85

All operators are computed on CPU with VitisAIExecutionProvi…

Hi team, I an trying to deploy my model on AMD NPU device using VitisAIExecutionProvider. I thought that all supported operators can be computed on NPU, but often I encounter this notice: `I202405…

qz233 updated 5 months ago
5
InternLM/lmdeploy #1660

[Bug] InternVL-Chat-V1-5量化报错

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug w4a16量化和w8a8量化InternVL-Chat-V…

BigWhiteFox updated 5 months ago
4
InternLM/lmdeploy #2124

[Bug] AttributeError: 'dict' object has no attribute 'num_at…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [x] 3. Please note that if the bug-related iss…

ztfmars updated 1 month ago
10
InternLM/xtuner #798

不能调用deepspeed，出现Segmentation Fault

` root@dsw-541920-5fd5c64bc4-m25b4:/mnt/workspace/modelscope# xtuner train llama2_7b_chat_qlora_custom_sft_e1_copy.py --deepspeed deepspeed_zero1 [2024-07-01 21:43:15,368] [INFO] [real_accelerator.p…

mxdlzg updated 2 weeks ago
3
mlc-ai/mlc-llm #1420

[Tracking] SLM Rollout

## Overview Recently, the mlc-llm team has been working on migrating to a new model compilation workflow, which we refer to as SLM. SLM is the new approach to bring modularized python first compila…

CharlieFRuan updated 2 months ago
6

上一页 1...83 84 85 86 87 88 89...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant