-
-
New paper just dropped on Arxiv describing a way to train models in 1.58 bits (with ternary values: 1,0,-1). Paper shows performance increases from equivalently-sized fp16 models, and perplexity nearl…
-
-
Will there be any plans to support INT8 GEMM? In the [SmoothQuant paper](https://arxiv.org/pdf/2211.10438.pdf) it seems like one of the main benefits is that by quantizing both weights and activations…
-
Hi!
Are there plans for making a low precision inference mode like many other neural network frameworks out there?
Would be really helpful for embedded applications where we have very limited memory…
-
Hi, I'm a student at the University of Bologna ( Italy) and I'm using the Google Coral USB accelerator for my thesis. I realized a keras neural network that classifies my data in four classes and the …
-
Hi,
This is more of a question than an issue, but I couldn't find the documentation or source code examples that address this. We have a backend that only supports fixed point operators and I am tr…
-
I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the action is always [0 0 0 0 0 0 1 0]. The age…
-
I tried using the hires fix, but it does not work. Here is the error that I get:
Traceback (most recent call last):
File "gradio\routes.py", line 488, in run_predict
File "gradio\blocks.py", …
-
你好,由于没地方问,所以只好在这个下面问一下,希望你不要介意。我想在windows上部署yolov8,请问你修改的项目[YoloV8 TensorRT CPP](https://github.com/xunzixunzi/YOLOv8-TensorRT-CPP)是可以部署在windows上的把?请问有详细的在windows上的操作么?这个项目中的lib/tensorrt-cpp-api文件夹是下载…