int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AI-Hypercomputer/maxtext #595

Cannot do inference in float32

If we try to perform inference in float32, we get the error: ``` AssertionError: Key and Value Dtypes should match ``` This error comes from [this line](https://github.com/google/maxtext/blob/eb…

borisdayma updated 2 weeks ago
5
nod-ai/SHARK-Studio #1800

Hires fix does not work.

I tried using the hires fix, but it does not work. Here is the error that I get: Traceback (most recent call last): File "gradio\routes.py", line 488, in run_predict File "gradio\blocks.py", …

Arcadia245 updated 10 months ago
3
NVIDIA/nvtx-plugins #11

Register Bypass CPU OPs

The NVTX Ops should default to identity when no GPU Device is registered. Use-case, being able to run example scripts on a CPU machine and making sure the project compiled properly ```python …

DEKHTIARJonathan updated 3 years ago
1
veronicatorcolacci/keras-network--Google-coral-USB-accelerator #1

accuracy loss when running inference on edge TPU with keras …

Hi, I'm a student at the University of Bologna ( Italy) and I'm using the Google Coral USB accelerator for my thesis. I realized a keras neural network that classifies my data in four classes and the …

veronicatorcolacci updated 4 years ago
2
tensorflow/models #10006

CenterNet MobileNetV2 - inference is too slow

Hi, I am able to run SSD MobileNetV2 and CenterNet MobileNetV2 (boxes prediction) on my android device. When I compare inference speed of the models on my android device I get below results: inf…

Paliking updated 1 year ago
12
openvinotoolkit/openvino #25393

[BUG] [GPU] Phi3 Medium int4 Runtime Error: probability tens…

### 🐛 Describe the bug Hi, Running Phi3 Medium on LocalAI with OpenVINO backend I found that while the int8 quantization is working correctly, the int4 quant gives the following error after few to…

fakezeta updated 3 months ago
9
TexasInstruments/edgeai-torchvision #7

Quantized Checkpoints have Floating-Point Weights

### 🐛 Describe the bug Hello, I'm using the QuantTrainModule to train a MobileNetV2 model (using the MobileNetV2 class in this repo), and the quantized checkpoints have 32-bit floating-point weigh…

IsidoraR updated 2 years ago
20
NVIDIA-AI-IOT/yolo_deepstream #53

[YOLOV7-QAT] Cannot convert onnx to trt engine

Hi @wanghr323 Thank for your Yolov7 QAT. I follow your [tutorial](https://github.com/NVIDIA-AI-IOT/yolo_deepstream/tree/main/yolov7_qat) and successful on QAT training. ``` Loading and preparing …

HoangTienDuc updated 9 months ago
4
PaddlePaddle/Paddle-Lite #10501

使用经过paddle-lite-opt优化后的模型，在压测环境下报错(fread(dst, 1, size, file_…

为使您的问题得到快速解决，在建立 Issue 前，请您先通过如下方式搜索是否有相似问题: [历史 issue](https://github.com/PaddlePaddle/Paddle-Lite/issues), [FAQ 文档](https://www.paddlepaddle.org.cn/lite/develop/quick_start/faq.html), [官方文档](https:/…

L-Inkink updated 6 months ago
4
microsoft/T-MAC #40

Slow performance compared to llama.cpp origin

I have try on two platforms, 12490f with 64G 6400GHz DDR5, EPYC 7302 16C 3.0GHz 128G 3200 DDR4 (memory read 118GB/s) there is log on 7302, firstly t-mac and secondly for llamacpp latest …

idreamerhx updated 1 month ago
7

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference