-
Self-Compressing Neural Networks is dynamic quantization-aware training that puts the size of the model in the loss
Paper: https://arxiv.org/pdf/2301.13142
Code: https://github.com/geohot/ai-noteb…
-
### OpenVINO Version
2024.2.0-15519-5c0f38f83f6-releases/2024/2
### Operating System
Ubuntu 22.04 (LTS)
### Device used for inference
CPU
### OpenVINO installation
PyPi
### Programming Languag…
-
Hi, could you please give more details why are you disabling network output quantization?
[https://github.com/hustvl/PD-Quant/blob/main/main_imagenet.py#L212](url)
The other question is about S…
-
I use modelopt QAT my model:
```
import modelopt.torch.quantization as mtq
# Select quantization config
config = mtq.INT8_DEFAULT_CFG
# Define forward loop for calibration
def forward_loop(model):
…
-
### Issue type
Bug
### Have you reproduced the bug with TensorFlow Nightly?
No
### Source
source
### TensorFlow version
2.12
### Custom code
Yes
### OS platform and distr…
-
### Is your feature request related to a problem?
After documents are ingested by **text_embedding** processor, an array of float32 type per **knn_vector** field is stored in segments.(hnsw or ivf)
…
-
## Description
I tried to quote the following documents directly,tools/pytorch-quantization/pytorch_quantization/calib/histogram.py,and Use HistogramCalibrator.compute_amax() to calculate the max…
-
### System Info
GPU: 4090
Tensorrt: 10.3
tensorrt-llm: 0.13.0.dev2024081300
### Who can help?
@Tracin May you please have a look, thank you very much
### Information
- [ ] The official example sc…
-
Hi there,
I am trying to run the Resnet-50 adestrated on imagenet that you give as an [example](https://github.com/Xilinx/Vitis-AI/tree/3.0/examples/vai_runtime/resnet50), I am using vitis3.0.
T…
-
I have commented the following in the ModelOpt issues, but since there is more activity here, I would like to get feedback on this subject from more people.
First of all, if someone here has positi…