int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PaddlePaddle/PaddleDetection #2983

如果想使用paddle-inference是不是的安装paddle-inference，只是安装了paddlepaddl…

dengxinlong updated 3 years ago
12
ggerganov/llama.cpp #5761

Support BitNet b1.58 ternary models

New paper just dropped on Arxiv describing a way to train models in 1.58 bits (with ternary values: 1,0,-1). Paper shows performance increases from equivalently-sized fp16 models, and perplexity nearl…

igorbarshteyn updated 1 week ago
89
meta-introspector/ctuning-submodules #1

Owl

jmikedupont2 updated 10 months ago
23
NVIDIA/TensorRT-LLM #682

INT8 GEMM Support?

Will there be any plans to support INT8 GEMM? In the [SmoothQuant paper](https://arxiv.org/pdf/2211.10438.pdf) it seems like one of the main benefits is that by quantizing both weights and activations…

eycheung updated 4 weeks ago
6
pjreddie/darknet #81

Low precision inference?

Hi! Are there plans for making a low precision inference mode like many other neural network frameworks out there? Would be really helpful for embedded applications where we have very limited memory…

akshatd updated 5 years ago
18
veronicatorcolacci/keras-network--Google-coral-USB-accelerator #1

accuracy loss when running inference on edge TPU with keras …

Hi, I'm a student at the University of Bologna ( Italy) and I'm using the Google Coral USB accelerator for my thesis. I realized a keras neural network that classifies my data in four classes and the …

veronicatorcolacci updated 4 years ago
2
pytorch/executorch #1141

Questions on deploying Quantized models ...

Hi, This is more of a question than an issue, but I couldn't find the documentation or source code examples that address this. We have a backend that only supports fixed point operators and I am tr…

rvijayc updated 9 months ago
7
simoninithomas/Deep_reinforcement_learning_Course #33

Deep Q Learning Spaceinvaders

I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the action is always [0 0 0 0 0 0 1 0]. The age…

noobmaster29 updated 5 years ago
15
nod-ai/SHARK #1800

Hires fix does not work.

I tried using the hires fix, but it does not work. Here is the error that I get: Traceback (most recent call last): File "gradio\routes.py", line 488, in run_predict File "gradio\blocks.py", …

Arcadia245 updated 9 months ago
3
xunzixunzi/ImagePlayer #1

关于YOLOv8-TensorRT-CPP

你好，由于没地方问，所以只好在这个下面问一下，希望你不要介意。我想在windows上部署yolov8,请问你修改的项目[YoloV8 TensorRT CPP](https://github.com/xunzixunzi/YOLOv8-TensorRT-CPP)是可以部署在windows上的把？请问有详细的在windows上的操作么？这个项目中的lib/tensorrt-cpp-api文件夹是下载…

Lishumuzixin updated 9 months ago
20

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference