int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openvinotoolkit/openvino.genai #934

issue with chatglm2-6b

i saw the issue with chatglm2-6b. it run successfully if with numactl -m 0 -C 0-23. it run failed if with numactl -m 0 -C 0-31, or 0-47 , or 0-55. i can be reproduced with INT8_ASYM or 4BIT_…

QuPengfei updated 2 weeks ago
1
ultralytics/ultralytics #17203

YOLOV8 Pose on Google Coral USB accelerator

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussi…

NguyenTheAn updated 16 hours ago
13
robertknight/rten #382

VALID auto_pad value

I found an interesting [model](https://github.com/Picsart-AI-Research/MI-GAN/tree/main) for removing objects from image. I'm going to add it to comparisons-rten repo, I already prepared python code. B…

igor-yusupov updated 1 week ago
8
apple/coremltools #2227

need help about both model weight and activation quantizatio…

from the issue "https://developer.apple.com/forums/thread/740518 how do we use the computational power of A17 Pro Neural Engine?" I learn that if i want to inference my mlmodel on my ipad pro with …

AndreaChiChengdu updated 5 months ago
1
surendramaran/YOLOv8-TfLite-Object-Detector #24

How to adapt the demo code for running a YOLOv8 model with i…

Hi, I have quantized a YOLOv8 model to int8 parameters. Could you please guide me on how to modify the demo code to make it compatible for running with the int8 quantized model? ![screenshot-202409…

sid-022 updated 1 month ago
3
nod-ai/SHARK-ModelDev #566

[tracking] E2EShark Model Tests Onnx Mode

Below is the list of issues we are hitting when running [vision int8 models](https://github.com/nod-ai/SHARK-TestSuite/blob/merge-reports/e2eshark/ci_model_lists/shark-test-suite.txt) end to end usin…

saienduri updated 1 month ago
11
microsoft/onnxruntime #13168

Onnxruntime fails on GPU loading inference with int8 models

### Describe the issue I'm using onnx-runtime to make inference on GPU. I have installed cuda 10.1, onnxruntime-gpu 1.4.0 and onnx 1.10.2. The inference is with resnet50-v1-12-int8.onnx mo…

IzanCatalan updated 1 year ago
3
DerryHub/BEVFormer_tensorrt #75

benchmark on orin

has anyone successfully deployed on orin? what's the infer time like?

shuyuan-wang updated 6 days ago
6
pytorch/ao #554

Quantized Training

Inspired by a recent back and forth with @gau-nernst we should add some quantized training recipes in AO for small models (600M param range) Character.ai recently shared that they're working on qua…

msaroufim updated 2 months ago
4
intel/intel-xpu-backend-for-triton #2279

What's version of Intel triton? or "No module named 'triton.…

With: * https://github.com/pytorch/benchmark/commit/7e1ba8d5983e4ff31cbf79d0f5dec071d11370cd * https://github.com/pytorch/pytorch/commit/0aa41eb52f7e577cf88e0f1b0adb34167a9ae94b * https://github.co…

dvrogozh updated 2 weeks ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference