-
If we try to perform inference in float32, we get the error:
```
AssertionError: Key and Value Dtypes should match
```
This error comes from [this line](https://github.com/google/maxtext/blob/eb…
-
I tried using the hires fix, but it does not work. Here is the error that I get:
Traceback (most recent call last):
File "gradio\routes.py", line 488, in run_predict
File "gradio\blocks.py", …
-
The NVTX Ops should default to identity when no GPU Device is registered.
Use-case, being able to run example scripts on a CPU machine and making sure the project compiled properly
```python
…
-
Hi, I'm a student at the University of Bologna ( Italy) and I'm using the Google Coral USB accelerator for my thesis. I realized a keras neural network that classifies my data in four classes and the …
-
Hi,
I am able to run SSD MobileNetV2 and CenterNet MobileNetV2 (boxes prediction) on my android device. When I compare inference speed of the models on my android device I get below results:
inf…
-
### 🐛 Describe the bug
Hi,
Running Phi3 Medium on LocalAI with OpenVINO backend I found that while the int8 quantization is working correctly, the int4 quant gives the following error after few to…
-
### 🐛 Describe the bug
Hello,
I'm using the QuantTrainModule to train a MobileNetV2 model (using the MobileNetV2 class in this repo), and the quantized checkpoints have 32-bit floating-point weigh…
-
Hi @wanghr323 Thank for your Yolov7 QAT.
I follow your [tutorial](https://github.com/NVIDIA-AI-IOT/yolo_deepstream/tree/main/yolov7_qat) and successful on QAT training.
```
Loading and preparing …
-
为使您的问题得到快速解决,在建立 Issue 前,请您先通过如下方式搜索是否有相似问题: [历史 issue](https://github.com/PaddlePaddle/Paddle-Lite/issues), [FAQ 文档](https://www.paddlepaddle.org.cn/lite/develop/quick_start/faq.html), [官方文档](https:/…
-
I have try on two platforms, 12490f with 64G 6400GHz DDR5, EPYC 7302 16C 3.0GHz 128G 3200 DDR4 (memory read 118GB/s)
there is log on 7302, firstly t-mac and secondly for llamacpp latest …