quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rustformers/llm #244

Write a 0.2 changelog

There's been quite a few changes from 0.1. We should document them for people updating their applications.

philpax updated 1 year ago
8
ggerganov/llama.cpp #8147

Bug: Cannot quantize a model to BF16 due to an overflow in g…

### What happened? Hello, i'm having a problem quantizing a `safetensors` model to BF16 using `convert-hf-to-gguf.py`. I can quantize any model to `f16` or `q8_0`, but i can't convert them to `bf1…

SteelPh0enix updated 4 months ago
5
pytorch/pytorch #95785

Fully quantized model (`torch.quantization.convert`) produce…

### 🐛 Describe the bug ## Description The output of fully quantized and fake quantized models do not match, with the fully quantized model not matching the expected analytical results for a minima…

mylesDoyle updated 7 months ago
4
hiyouga/LLaMA-Factory #4175

Export 4bit quantization model got error: Please merge adapt…

根據先前的討論, 我有先選 export_quantization_bit=None, 導出無量化模型檔到export_dir. 然後仍然選擇導出路徑在export_dir, 這次選擇 export_quantization_bit = 4bit 但仍然出現錯誤: Please merge adapters before quantizing the model. 麻煩您解惑. 謝…

wennycooper updated 5 months ago
4
tensorflow/models #9287

SSD ResNet from model zoo not working after conversion to TF…

- [x] I am using the latest TensorFlow Model Garden release and TensorFlow 2. - [x] I am reporting the issue to the correct repository. (Model Garden official or research directory) - [x] I checke…

lechwolowski updated 1 year ago
20
QwenLM/Qwen2.5 #328

我想通过gptq量化qwen-moe-a2.7b。但是好像不支持，请问官方怎么量化的。

Traceback (most recent call last): File "/home/admin/workspace/aop_lab/app_source/run_gptq.py", line 89, in model = AutoGPTQForCausalLM.from_pretrained(args.model_name_or_path, quantize_confi…

wellcasa updated 3 months ago
3
hustzxd/LSQuantization #1

The experimental results on ImageNet dataset

As written in the README, the results on ImageNet are not good like the paper. Can you tell me how different the accuracy results are?

creaitr updated 2 years ago
17
ml-explore/mlx-examples #854

Received parameters not in model: {extras}.

Good to see you. I'm a newbie. I am using an Apple M2 laptop. I am going to try to train a model using lora.py. If I run the following. ```bash python lora.py --train --model Qwen/Qwen2-0.5B-…

Gloriashield updated 4 months ago
1
analogdevicesinc/ai8x-training #323

QAT

hi there according to the documentation https://github.com/analogdevicesinc/ai8x-training#quantization-aware-training-qat we can use either QAT or post quantization but can I use both of them? if …

fzh-adham updated 1 month ago
5
bytedance/SALMONN #51

Precise details on GPUs and Memory needed for Inference

Hi, Could you please provide more precise details on how many GPUs and how much memory are needed for running the inference? For training, I'm assuming based on the readme that it's one A100-SXM-80…

SaraAlthubaiti updated 4 months ago
1

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for quantizing

1000+ results
for quantizing