network-quantization Search Results

1000+ results
for network-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/ao #658

Self compressing neural networks

Self-Compressing Neural Networks is dynamic quantization-aware training that puts the size of the model in the loss Paper: https://arxiv.org/pdf/2301.13142 Code: https://github.com/geohot/ai-noteb…

msaroufim updated 2 weeks ago
1
openvinotoolkit/openvino #26380

[Performance]: The quantized full-connected network has no s…

### OpenVINO Version 2024.2.0-15519-5c0f38f83f6-releases/2024/2 ### Operating System Ubuntu 22.04 (LTS) ### Device used for inference CPU ### OpenVINO installation PyPi ### Programming Languag…

eekarot updated 2 months ago
1
hustvl/PD-Quant #9

Why are you disabling network output quantization?

Hi, could you please give more details why are you disabling network output quantization? [https://github.com/hustvl/PD-Quant/blob/main/main_imagenet.py#L212](url) The other question is about S…

padeirocarlos updated 10 months ago
1
NVIDIA/TensorRT-Model-Optimizer #64

conver to trt error

I use modelopt QAT my model: ``` import modelopt.torch.quantization as mtq # Select quantization config config = mtq.INT8_DEFAULT_CFG # Define forward loop for calibration def forward_loop(model): …

steven-spec updated 2 months ago
3
tensorflow/tensorflow #61410

TfLite ResizeInputTensor does not resize Transposed Convolut…

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? No ### Source source ### TensorFlow version 2.12 ### Custom code Yes ### OS platform and distr…

jackprescott updated 5 hours ago
2
opensearch-project/neural-search #991

[FEATURE] Quantization processor in ingest pipeline

### Is your feature request related to a problem? After documents are ingested by **text_embedding** processor, an array of float32 type per **knn_vector** field is stored in segments.(hnsw or ivf) …

YeonghyeonKO updated 1 day ago
9
NVIDIA/TensorRT #4023

The KL divergence calculation is very slow and is not optimi…

## Description I tried to quote the following documents directly，tools/pytorch-quantization/pytorch_quantization/calib/histogram.py，and Use HistogramCalibrator.compute_amax() to calculate the max…

yychen2000 updated 4 months ago
3
NVIDIA/TensorRT-LLM #2392

Qwen2-72B w4a8 empty output

### System Info GPU: 4090 Tensorrt: 10.3 tensorrt-llm: 0.13.0.dev2024081300 ### Who can help? @Tracin May you please have a look, thank you very much ### Information - [ ] The official example sc…

lishicheng1996 updated 2 weeks ago
4
Xilinx/Vitis-AI #1450

Resnet-50 test accuracy and loss of accuracy in DPU

Hi there, I am trying to run the Resnet-50 adestrated on imagenet that you give as an [example](https://github.com/Xilinx/Vitis-AI/tree/3.0/examples/vai_runtime/resnet50), I am using vitis3.0. T…

rattokiller updated 4 months ago
6
NVIDIA/TensorRT #4024

Quantization flow using TensorRT (what is recommended for CN…

I have commented the following in the ModelOpt issues, but since there is more activity here, I would like to get feedback on this subject from more people. First of all, if someone here has positi…

korkland updated 2 months ago
11

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for network-quantization

1000+ results
for network-quantization