int8-inference Search Results

1000+ results
for int8-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TexasInstruments/edgeai-torchvision #7

Quantized Checkpoints have Floating-Point Weights

### 🐛 Describe the bug Hello, I'm using the QuantTrainModule to train a MobileNetV2 model (using the MobileNetV2 class in this repo), and the quantized checkpoints have 32-bit floating-point weigh…

IsidoraR updated 2 years ago
20
SYSTRAN/faster-whisper #584

CUDA 12.1 and CUBLAS and CUDNN without having to compile fro…

Is CUDA 12.1 support coming or in the works? Just curious since faster-whisper keeps looking for cublas11.dll...and although I don't use cudnn, I'm assuming that would be another aspect to consider? …

BBC-Esq updated 7 months ago
33
AlexeyAB/darknet #6067

YOLOv4-tiny released: 40.2% AP50, 371 FPS (GTX 1080 Ti), 177…

Discussion: https://www.reddit.com/r/MachineLearning/comments/hu7lyt/p_yolov4tiny_speed_1770_fps_tensorrtbatch4/ Full structure: [structure of yolov4-tiny.cfg model](https://netron.app/?url=https:/…

AlexeyAB updated 2 years ago
132
NVIDIA/TensorRT #3724

The inference speed of the int8 quantization version of SDXL…

The inference speed of the int 8 quantization version of SDXL is much slower than that of fp16. I am runing trt9.3 sdxl demo and here is the result. (I changed shape to 768x1344 manually) fp16 : pyt…

theNefelibata updated 6 days ago
44
Xilinx/brevitas #981

Using external activation functions

I have a Spiking convolutional neural network. It uses the Leaky(Leaky Integrate and Fire) neuron from [SNNTorch ](https://snntorch.readthedocs.io/en/latest/snntorch.html)library as activation functio…

Maya7991 updated 1 week ago
4
icubecorp/nvdla_compiler #2

Segmentation fault with run.sh

when I run run.sh, the program faced Segmentation fault, could you give some hints? GDB information: ``` (gdb) bt #0 0x000000000041323d in nvdla::TensorDescListParser::buildList (this=0x682320)…

wyxsky updated 5 years ago
16
ultralytics/ultralytics #14722

The recognition results of tflite model exported by yolov8 a…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

einsitang updated 1 month ago
5
tensorflow/models #10006

CenterNet MobileNetV2 - inference is too slow

Hi, I am able to run SSD MobileNetV2 and CenterNet MobileNetV2 (boxes prediction) on my android device. When I compare inference speed of the models on my android device I get below results: inf…

Paliking updated 1 year ago
12
vllm-project/vllm #7322

[Feature]: DeepSeek-Coder-V2-Instruct-FP8 on 8xA100

### 🚀 The feature, motivation and pitch VLLM has announced support for running llama3.1-405b-fp8 on 8xA100. This is the [blog](https://blog.vllm.ai/2024/07/23/llama31.html) Does vllm support run…

halexan updated 3 weeks ago
8
NVIDIA/FasterTransformer #506

LLaMA support

Given existing support for GPT-J and its rotary embeddings, is LLaMA supported as well? Huggingface just shipped their implementation: https://github.com/huggingface/transformers/commit/464d4207756538…

michaelroyzen updated 11 months ago
176

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for int8-inference

1000+ results
for int8-inference