pytorch-quantization Search Results

NVIDIA/TensorRT #4093

pytorch_quantization qat accuracy descend

## Description the model after pytorch_quantization qat, accuracy descend relative to before pytorch_quantization qat ## Environment **TensorRT Version**: 8.5.3.1 **NVIDIA GPU**: TIT…

steven-spec updated 2 weeks ago

google-ai-edge/ai-edge-torch #178

Support torch QAT model export?

### Description of the Feature: torch QAT supports three mode. * Eager Mode Quantization * FX Graph Mode Quantization * PyTorch 2 Export Quantization detail in https://pytorch.org/docs/stabl…

WangFengtu1996 updated 1 week ago

ultralytics/ultralytics #15388

Quantization of PyTorch model for Torch Mobile

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component _No …

NazaRik555 updated 3 weeks ago

THUDM/CogVideo #245

cli_demo_quantization.py does not work with latest torchao (…

### System Info / 系統信息 torch 2.4.1 / diffuser 0.30.2 / Ubuntu 22.04.4 LTS / Cuda driver 12.6 ### Information / 问题信息 - [X] The official example scripts / 官方的示例脚本 - [ ] My own modified scripts / 我自己…

backface updated 6 days ago

pytorch/ao #758

First-time user feedback on `quantize_` and `autoquant`

I tested the `torchao.quantize_` and `torchao.autoquant` for the first time on a toy model and wanted to summarize my usability/readability comments, in case that is helpful. # torchao.quantize_ fe…

vkuzo updated 2 weeks ago

intel/neural-compressor #1980

how to evaluate AWQ ?

https://github.com/intel/neural-compressor/blob/master/docs/source/quantization_weight_only.md#examples how to set eval_func? https://github.com/intel/neural-compressor/blob/master/examples/3…

chunniunai220ml updated 2 weeks ago

NVIDIA/TensorRT-Model-Optimizer #67

Unable to load extension modelopt_cuda_ext and falling back …

I have tried to quantize a model by following the guide ([PyTorch Quantization — Model Optimizer 0.15.0](https://nvidia.github.io/TensorRT-Model-Optimizer/guides/_pytorch_quantization.html)), and I ca…

relaxtheo updated 2 days ago

NVIDIA/TensorRT-Model-Optimizer #52

why ONNX quantization works better on VIT?

I found a statement which said **3. Better support for vision transformers.** in link https://nvidia.github.io/TensorRT-Model-Optimizer/guides/_onnx_quantization.html. I'm working on quantizing VIT n…

AlexMercer-feng updated 3 weeks ago

pytorch/ao #710

Tensor subclass boilerplate can be consolidated

A lot of code for tensor subclasses can likely be conslidated together into a base class that other classes can utilize _get_to_kwargs: https://github.com/pytorch/ao/blob/main/torchao/dtypes/affin…

HDCharles updated 2 weeks ago

Xilinx/Vitis-AI #1089

PyTorch Model Quantization with Custom Op Fails

I am accelerating a custom pytorch network using Vitis-AI. After following the steps below the model is quantized and the .xmodel is compiled, however the accuracy of the model takes a huge hit going …

danielstumpp updated 3 weeks ago

1000+ results for pytorch-quantization

1000+ results
for pytorch-quantization