-
Hello @dusty-nv
I downloaded peoplnet directly from : https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/peoplenet. These are the contents of the downloaded folder :
labels.txt nvinfer_c…
-
## Description
I generated calibration cache for Vision Transformer onnx model using EntropyCalibration2 method. When trying to generate engine file using cache file for INT8 precision using trte…
-
### Description of the bug:
I downloaded the `microsoft/Phi-3.5-mini-instruct` from Hugging Face and ran the [convert_phi3_to_tflite.py](https://github.com/google-ai-edge/ai-edge-torch/blob/main/ai_…
-
The [current implementation of the SVDF layer](https://github.com/google-research/google-research/blob/master/kws_streaming/layers/svdf.py) doesn’t get fused as an SVDF operator when converted to TFLi…
-
python3 detectnet.py --model=peoplenet pedestrians.mp4 pedestrians_peoplenet.mp4
[gstreamer] initialized gstreamer, version 1.14.5.0
[gstreamer] gstDecoder -- creating decoder for pedestrians.mp4
O…
-
Hi everyone,
I faced the problem during the launching the YOLOv4 inference with INT8 precision on _RTX 3090_ GPU:
the _buildEngineWithConfig()_ method is very slow (had been running for 1.5 hours…
-
I am trying to quantize a [Wav2Lip](https://github.com/Rudrabha/Wav2Lip) PyTorch model. When I run the code using fbgemm backend. I run into the following error.
`AssertionError: Per channel weight…
-
### Description
Some inference API providers now support embedding models with each dimension defined as a single bit. For example, the v3 models from Cohere offer this capability. Since we already h…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…
-
**Describe the bug**
Unable to optimize a model with device- cpu and precision int8. Ending up with KeyError: 'input_model' error
**To Reproduce**
Start with this example: https://github.com/micr…