ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #8368

int8 quantization on GPU support with transformers like bert…

**Describe the solution you'd like** I found that the latest release of tensorrt 8.0 is support for the int8 quantization on GPU, which is great accelerate inference speed. And now onnxruntime is …

RyanHuangNLP updated 2 years ago
4
ModelTC/Dipoorlet #26

dipoorlet 支持动态输入吗

在试用dipoorlet PTQ量化 torch 导出的onnx模型时报错: ValueError: cannot reshape array of size 172800 into shape (0,0,3,180,320)。 1. torch.onnx.export() 导出时指定了dynamic_axes, 具体如下： ``` torch.onnx.export( …

kyrie2to11 updated 1 year ago
1
thu-nics/MixDQ #1

Can it be used in sd1.5 and can it be combined with other ac…

Can it be used in sd1.5 and can it be combined with other acceleration methods such as ByteDance/Hyper SD

libai-lab updated 5 months ago
6
we0091234/yolov7_plate #14

部署的时候会考虑量化模型么?

首先感谢大佬无私的开源, 整套工具都很全, 从训练到部署, 支持的也比较全, 但是我看没有做8bits量化,大佬请问是有尝试过么, 做量化会损失精度么?

jiuzhuanzhuan updated 1 year ago
1
quic/aimet #3353

Can not make great speed improvement on GPU

I tried to run the exported onnx file on both RTX3070 and RTX 4090, but can not see speed improvement (even slower than the unquantized model). Here is the warning of onnxruntime: `2024-09-20 19:58:0…

Yongfan-Liu updated 1 week ago
6
tensorflow/tensorflow #62530

Internal quantize ops don't match external quantization

### 1. System information - Occurs in Google Colab w/ TF 2.14 - Have also verified w. TF 2.7 (Anaconda) on Windows 10 ### 2. Code [Colab to reproduce issue](https://colab.research.google.com…

EClemMarq updated 3 months ago
4
magefree/mage #3699

Add Team Limited/Team Constructed events

It was announced that there will be a Team Trios Pro Tour in 2018, as well as an increase in team Grand Prix and team PTQs. It seems like it would be a great option to have on xmage. Team Construct…

fireshoes updated 4 years ago
1
open-mmlab/mmrazor #347

[RFC] MMRazor Quantization Design

## Motivation 1. To design and implement the better quantization part of MMRazor with community. 1. Collect more requirements and suggestions before releasing quantization by RFC (Request for C…

humu789 updated 1 year ago
5
microsoft/DeepSpeed #2894

[REQUEST] Add more device-agnostic compression algorithms

## **Summary** This is a design discussion RFC for contributing some device-agnostic compression algorithms, like the post training quantization(QDQ quant format) and structural sparsity supported …

ftian1 updated 1 year ago
8
SWI-Prolog/packages-jpl #103

The jpl:prolog_in_java test stopped working for 9.2.7

Hi, maintainer of the `swi-prolog` Arch Linux package here. When upgrading the package from 9.2.4 to 9.2.7, one of the tests stopped working: ``` ctest --test-dir build --output-on-failure Int…

xyproto updated 2 months ago
7

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for ptq

1000+ results
for ptq