quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

casper-hansen/AutoAWQ #492

quantize models with large context

I want to quantize the CodeQwen model using a custom dataset, but all sample lengths exceed 512. Why doesn't AWQ support sample with lengths longer than 512? Are there any alternative methods for quan…

chennnM updated 3 months ago
3
city96/ComfyUI-GGUF #12

[Feature Request] Modify save checkpoint node

Not sure if this feasable but would love to see a modified save checkpoint node for this somehow so we can save save gguf or exl2 for merging in loras to a gguf or exl2 checkpoint directly. Check o…

311-code updated 1 month ago
1
Xilinx/Vitis-AI #1447

YOLOv5 quantization

Hi everyone, I'm trying to quantize the YOLOv5n model from [here](https://github.com/ultralytics/yolov5). I'm using the Vitis-AI v3.0 docker with the following code: ``` import pytorch_nndct i…

60rw311 updated 1 month ago
1
casper-hansen/AutoAWQ #583

Converting finetuned Llama 3.1 using LORA into AWQ

I have finetuned the llama 3.1 using unsloth. Then, i merged and unloaded the LORA model and pushed to the hub. Now when i tried quantizing it using: ``` from awq import AutoAWQForCausalLM qua…

fusesid updated 1 month ago
2
rajatkrishna/chat-llama3 #1

Llama 3.1 support

I am having trouble with running latest llama 3.1 on openvino. I am trying to use optimum-intel to convert the new model but I always fail with an error. Would be great to have 3.1 also already quanti…

taxmeifyoucan updated 1 week ago
2
googlecolab/colabtools #4652

Downloading big files does not work.

In colab, I made some test quantizing some A.I. models. Once quantized, the files are usually between 4 and 7 gb. The only thing that seems to work is to move them momentarily on google drive and th…

0wwafa updated 2 months ago
1
AutoGPTQ/AutoGPTQ #50

Quantizing Lora Alpaca (a model with an adapter)

Hello, Is it possible to load a lora model - peft model with an adapter such as alpaca lora (https://github.com/tloen/alpaca-lora)? There is a script there, to add peft weights to model but it d…

Oxi84 updated 1 year ago
4
deepseek-ai/DeepSeek-Coder #124

tokenizer.json issue creating gguf files

When quantizing deepseek coder models, the tokenizer.json file seems to be throwing an error. This wasn't previously an issue. Cross posting from [here](https://github.com/ggerganov/llama.cpp/issue…

RonanKMcGovern updated 6 months ago
2
ROCm/AMDMIGraphX #3341

Introduce `--int4-weights` option in `migraphx-driver`. This…

lakhinderwalia updated 1 month ago
1
openxla/stablehlo #1407

Consider adding support for unknown scales and zero_points

The goal of the ticket is to track the support of unknown scales and zero-points. This is required to represent the scales and zero-points, in StableHLO graph, calculated on the fly by the training pr…

sdasgup3 updated 2 months ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for quantizing

1000+ results
for quantizing