quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/ao #1135

Fail to reproduce benchmark results

Hi! I try to reproduce the benchmark [results](https://github.com/pytorch/ao/tree/main/torchao/quantization#benchmarks) using torchao/_models/llama/generate.py. However, I can not benchmark the quanti…

ThisisBillhe updated 2 weeks ago
3
AutoGPTQ/AutoGPTQ #196

[BUG]torch._C._LinAlgError: linalg.cholesky: The factorizati…

Error while quntising pretrained_model_dir = "tiiuae/falcon-7b" :- 2023-07-18 10:48:21 INFO [auto_gptq.modeling._base] Quantizing mlp.dense_4h_to_h in layer 2/32... Traceback (most recent call la…

tarunmcom updated 3 months ago
9
huggingface/exporters #75

Support for smaller quantization, 8 or 4 at least

This tool is amazing, having tried scripting using the coreml library by hand, running into all kinds of fun issues, then trying this and it all being orchestrated/abstracted for you, this is excellen…

Proryanator updated 7 months ago
1
abetlen/llama-cpp-python #1696

Empty output when running Q4_K_M quantization of Llama-3-8B-…

Hi! I'm trying to run the Q4_K_M quantization of Meta-Llama-3-8B-Instruct on my Mac (M2 Pro, 16GB VRAM) using llama-cpp-python, with the following test code: ``` from llama_cpp import Llama llm4 …

smolraccoon updated 1 month ago
3
pytorch/tutorials #899

How to apply torch.quantization.quantize_dynamic for conv2d …

I am working on quantizing resnet50 model. I tried to use the following command. ``` quantized_model = torch.quantization.quantize_dynamic( resnet18, {torch.nn.Conv2d,torch.nn.Linear}, dtype=…

Midhilesh29 updated 3 years ago
3
CodeSmile-0000011110110111/GMesh #3

Use octree for merging vertices?

I noticed you're using quantizing/hashing to determine when to weld nearly coincident vertices in the Combine method. Am I right in thinking that this would incorrectly overlook vertices that were …

andybak updated 1 year ago
2
ultralytics/ultralytics #16737

Issue quantizing YOLO11 model to int8

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component Expo…

lucalazzaroni updated 3 weeks ago
12
AILab-CVC/YOLO-World #327

About the problem that after quantizing the model, the model…

Hi @wondervictor ,I changed the associated config, checkpoint, and img-size in export_onnx.py. ![image](https://github.com/AILab-CVC/YOLO-World/assets/59815166/a9320cc6-19dc-469b-9136-211031244de2) …

chenjiafu-George updated 4 months ago
36
facebookresearch/fastText #821

what's the typical saving with FTZ file?

Hi, My original trained file (.bin) is 318 MB. After quantizing it reduced to 250 MB. Is this much of reduction in the file size expected or there's a possibility of further reduction in size? …

prabhatM updated 5 years ago
1
yuhuixu1993/qa-lora #35

Merging problem

Im very confused on the merging step. In Appendix B, the proof is solid, however there is no guarantee that the new matrix B is in integer format. In standard linear quantization, zeros are represente…

samuelqy updated 2 weeks ago
10

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for quantizing

1000+ results
for quantizing