quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #1636

INT4 AWQ quantization fails for Llama 2 7B & 13B with higher…

### System Info - TensorRT-LLM v0.9.0 - Nvidia A10G ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Reproduction …

ethnzhng updated 4 months ago
1
QwenLM/Qwen2.5 #375

Will Qwen1.5-110B-GPTQ-Int8 be released?

Hello, Is it any plan to release GPTQ Int8 quantized of 110B model? Thanks for the Qwen1.5 open source great job!

baiyongrui updated 4 months ago
2
QwenLM/Qwen #1090

[BUG] <title>When quantifying the trained Qwen Chat-7B model…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

anyiz updated 6 months ago
2
ocrmypdf/OCRmyPDF #293

file size increase for pdf/a

OCRmyPDF is really marvelous! Thanks! I have one question regarding output file size: Unless explicitly selecting pdf as output type, I have quite large file sizes (~4x) after "ocrmypdf in.pdf out.…

femifrak updated 3 months ago
11
huggingface/optimum-quanto #180

Potential readme issue - falls back to original dtype, not f…

In the docs, it says that when quantizing to anything other than int8, many operations will fall back to fp32. However, looking through the code (and inserting some print lines) it seems like it ac…

calmitchell617 updated 5 months ago
3
mobiusml/hqq #113

Question about Quantization

Hey, it's me again! 😆 I've done testing on the HQQ pre-trained model inside the Linux system, and it is working well with the custom transformer code you gave me. Now, I want to test the quantization …

NEWbie0709 updated 2 months ago
4
Xilinx/finn #1174

Preprocessing Quant : InferThresholdingLayer Not infering Th…

## Quick summary ![_tmp_finn_dev_rootmin_video_streamlined_merged_and_ready onnx](https://github.com/user-attachments/assets/d510f4be-978c-4849-ad82-c47019d28737) running this code ```python…

0BAB1 updated 1 month ago
9
drChungus/VCSequentialSwitch #10

DG412 version substitution

Hello, it seems there's a [

extremenerble updated 4 months ago
2
ultralytics/ultralytics #4120

YOLO-NAS predict fail

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and f…

huynhhoanghuy updated 2 weeks ago
7
ibm-granite-community/pm #23

Make Granite Code available via Ollama

adampingel updated 2 months ago
9

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for quantizing

1000+ results
for quantizing