int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel-analytics/ipex-llm #8952

Size mismatch issue when loading BigDL-LLM Int8 model in IPE…

Encountering a size mismatch issue while loading the BigDL-LLM Int8 model (Pytorch) in IPEX. The sample inference code is provided below. How can I correctly load the model in IPEX? ![image](https:…

hueyline updated 1 year ago
1
PaddlePaddle/PaddleDetection #8819

Jetson Xavier NX上无法实现trt_int8推理

### 问题确认 Search before asking - [X] 我已经搜索过问题，但是没有找到解答。I have searched the question and found no related answer. ### 请提出你的问题 Please ask your question 在Jetson Xavier NX上使用Paddle Inference部署PaddleSli…

bbilixzc updated 8 months ago
1
PaddlePaddle/models #2022

量化训练模型删除指定op转换功能

``` import paddle.fluid as fluid from pyramidbox_test import PyramidBox from paddle.fluid.framework import IrGraph from paddle.fluid import core from paddle.fluid.contrib.slim.quantization.quanti…

wanghaoshuang updated 5 years ago
1
ultralytics/ultralytics #16074

Exporting yolov10n.pt int8 does not appear to fully quantize…

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component Expo…

jaylenetriton updated 1 month ago
2
ultralytics/ultralytics #16422

TFLite Micro support

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar feature requests. ### Description Non-specialized …

nicklasb updated 2 weeks ago
1
ultralytics/ultralytics #14823

Performance Issues with OpenVINO export

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component …

ambitious-octopus updated 2 weeks ago
14
microsoft/onnxruntime #8368

int8 quantization on GPU support with transformers like bert…

**Describe the solution you'd like** I found that the latest release of tensorrt 8.0 is support for the int8 quantization on GPU, which is great accelerate inference speed. And now onnxruntime is …

RyanHuangNLP updated 2 years ago
4
microsoft/DeepSpeed #1788

[BUG] CUDA error with INT 8 inference

**Describe the bug** I am trying to get started with implementing INT 8 inference on Deepspeed. But I am running into `RuntimeError: CUDA error: an illegal memory access was encountered` . **To Re…

gsujankumar updated 2 years ago
8
microsoft/onnxruntime #6706

TensorRT EP API for environment variables (esp, on FP16, INT…

For various environment variables on TensorRT EP, ONNXRT needs to provide an API to override settings per model based. The most critical environment variables are FP16 (ORT_TENSORRT_FP16_ENABLE) and …

ppyun updated 3 years ago
4
microsoft/onnxruntime-inference-examples #101

Does onnxruntime EP tensorrt quantized model can directly in…

EP tensorrt quantized int8 model, I want direcly inference via tensorrt, doesn't through onnxruntime, is that possible?

lucasjinreal updated 2 years ago
2

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference