int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dusty-nv/jetson-inference #1882

Running peoplenet with detectNet on jetPack6

Hello @dusty-nv I downloaded peoplnet directly from : https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/peoplenet. These are the contents of the downloaded folder : labels.txt nvinfer_c…

AkshatJain-TerraFirma updated 3 months ago
3
NVIDIA/TensorRT #4068

Missing scale and zeropoint for lot of layers on calibrating…

## Description I generated calibration cache for Vision Transformer onnx model using EntropyCalibration2 method. When trying to generate engine file using cache file for INT8 precision using trte…

Shalini194 updated 2 months ago
14
google-ai-edge/ai-edge-torch #293

Error Using Converted Phi-3.5-mini TFLite in Android App

### Description of the bug: I downloaded the `microsoft/Phi-3.5-mini-instruct` from Hugging Face and ran the [convert_phi3_to_tflite.py](https://github.com/google-ai-edge/ai-edge-torch/blob/main/ai_…

chienhuikuo updated 1 week ago
13
google-research/google-research #1950

SVDF layer implementation incompatible with SVDF operator fr…

The [current implementation of the SVDF layer](https://github.com/google-research/google-research/blob/master/kws_streaming/layers/svdf.py) doesn’t get fused as an SVDF operator when converted to TFLi…

VictorDominguite updated 2 weeks ago
10
dusty-nv/jetson-inference #1759

Unable to run peoplenet model with detectnet program

python3 detectnet.py --model=peoplenet pedestrians.mp4 pedestrians_peoplenet.mp4 [gstreamer] initialized gstreamer, version 1.14.5.0 [gstreamer] gstDecoder -- creating decoder for pedestrians.mp4 O…

sai-ssauto updated 3 months ago
1
enazoe/yolo-tensorrt #159

INT8 engine building is too slow

Hi everyone, I faced the problem during the launching the YOLOv4 inference with INT8 precision on _RTX 3090_ GPU: the _buildEngineWithConfig()_ method is very slow (had been running for 1.5 hours…

victor-yudin updated 2 years ago
2
pytorch/FBGEMM #1576

AssertionError: Per channel weight observer is not supported…

I am trying to quantize a [Wav2Lip](https://github.com/Rudrabha/Wav2Lip) PyTorch model. When I run the code using fbgemm backend. I run into the following error. `AssertionError: Per channel weight…

qaixerabbas updated 1 month ago
3
elastic/elasticsearch #111747

Support for bit precision in the Inference API text_embeddin…

### Description Some inference API providers now support embedding models with each dimension defined as a single bit. For example, the v3 models from Cohere offer this capability. Since we already h…

jimczi updated 2 months ago
2
ultralytics/ultralytics #14127

tensorRT export type question

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

hongin13 updated 2 months ago
7
microsoft/Olive #1283

Getting KeyError: 'input_model' when trying to optimize whis…

**Describe the bug** Unable to optimize a model with device- cpu and precision int8. Ending up with KeyError: 'input_model' error **To Reproduce** Start with this example: https://github.com/micr…

mram0509 updated 3 months ago
1

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference