ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/TensorRT #1467

❓ [Question] Profiling examples?

## ❓ Question When I'm not using TensorRT, I run my model through an FX interpreter that times each call op (by inserting CUDA events before/after and measuring the elapsed time). I'd like to do so…

collinmccarthy updated 1 year ago
7
pytorch/TensorRT #1229

🐛 [Bug] Poor quantization accuracy of resnet50 model int8 us…

## Bug Description Perform int8 quantization on resnet50 in the reference Test-demo ( https://github.com/pytorch/TensorRT/tree/master/tests/py/ptq ), and compare the inference result with the origin…

lixiaolx updated 1 year ago
10
huggingface/pytorch-image-models #1412

[BUG]Resent50 model with wrong precision after quantization …

**Describe the bug** timm0.6.7 version, using timm's resnet50, first convert the model to onnx, and then use tenosrrt's model PTQ quantization function, the quantized model is verified on the val (50…

lixiaolx updated 1 year ago
6
NVIDIA/TensorRT #2863

How to convert onnx(Quantified model) to tensorrt using pyth…

I follow this [https://github.com/NVIDIA/TensorRT/blob/release/8.6/quickstart/quantization_tutorial/qat-ptq-workflow.ipynb](url) converted q_model() to onnx format, I want use python api convert onnx …

Twilighter9527 updated 1 year ago
2
roflcoopter/viseron #683

coral.ai USB Accelerator in Proxmox VM matched/configured bu…

I am trying to use the [coral.ai USB Accelerator](https://coral.ai/products/accelerator) within a [Proxmox](https://www.proxmox.com/proxmox-virtual-environment) VM using docker command: `sudo docker …

vchrizz updated 1 year ago
10
NVIDIA/TensorRT #2659

PTQ quantization int8 engine is lower than fp16 engine

I'm trying to quantize yolox-l model and convert to int8. However, after I conver to int8 version onnx and conver to engine, fp16 is faster than int8 version. Can you take a look at my onnx? This onnx…

shuyuan-wang updated 1 year ago
4
PaddlePaddle/Paddle #53525

受到 pr_50915 合入影响，AFQMC_base, AFQMC_PTQ_1 模型在 develop 分支多环境下执…

### bug描述 Describe the Bug ### 错误信息错误引入 PR：https://github.com/PaddlePaddle/Paddle/pull/50915 case 地址：https://github.com/PaddlePaddle/PaddleTest/tree/develop/inference/python_api_test/test_nlp_…

EmmonsCurse updated 1 year ago
2
intel/neural-compressor #834

Bug of Smooth quant alpha auto tuning

I have tried [this official example of Smooth quant alpha auto tuning](https://github.com/intel/neural-compressor/tree/master/examples/pytorch/nlp/huggingface_models/language-modeling/quantization/ptq…

rnwang04 updated 1 year ago
7
PaddlePaddle/PaddleSpeech #2992

请问PaddleSpeech/examples/csmsc/voc5中PTQ_static.sh跑不通

问题：请问PaddleSpeech/examples/csmsc/voc5中PTQ_static.sh跑不通我在运行PaddleSpeech/examples/csmsc/voc5的run.sh的过程中，在stage=3时程序报错。对应运行命令： ![image](https://user-images.githubusercontent.com/68834517/223012996-b…

longRookie updated 1 year ago
2
pytorch/TensorRT #1025

🐛 [Bug] Cuda Runtime bug when using PTQ

## Bug Description When attempting to use TRT to PTQ quantize a 2B parameter GPT-neo model. I keep encountering the following error message `[executionContext.cpp::commonEmitDebugTensor::1269…

Scikud updated 1 year ago
7

上一页 1...74 75 76 77 78 79 80...100 下一页

1000+ results for ptq

1000+ results
for ptq