image-quantization Search Results

NVIDIA/TensorRT-LLM #2288

need a copy code widget to be able to copy code snippets

Is it possible to add to https://nvidia.github.io/TensorRT-LLM/ the code copy widget that you already have on https://nvidia.github.io/TensorRT-Model-Optimizer/? For example if you go to https://nvidi…

stas00 updated 7 hours ago

pytorch-labs/gpt-fast #207

The Actual Throughput of int8 Quantization is Significantly …

When I use the Llama 7B model after int8 quantization for inference, my throughput is only around 42 tokens/s, which is far lower than the 155 tokens/s stated in the documentation. Below is my executi…

crhcrhcrhcrh updated 2 days ago

NVIDIA/DALI #2429

Enhancement: Image quantization operator?

I'm looking to do a lot of image quantization, and was/is searching for fast alternatives to K-means etc. Then I saw that there is a cuda implementation for NeuQuant. Although it is from 2011. [Pap…

cceyda updated 3 years ago

StartHua/Comfyui_CXH_joy_caption #22

没法运行

Error occurred when executing Joy_caption_load: No package metadata was found for bitsandbytes File "E:\ComfyUI-aki-v1.3\execution.py", line 317, in execute output_data, output_ui, has_subgraph…

chenminglin updated 1 month ago

mozilla/mozjpeg #182

Image specific quantization tables

I would like to call your attention that this patent for optimizing quantization tables [image specific] have expired: https://www.google.com/patents/US5724453 The paper can be found here: http://w…

jrmsmith updated 1 year ago

janhq/cortex.cpp #1242

feat: Model Pull has clear API and CLI to support Huggingfac…

## Goal - `cortex model pull` should have clear APIs that support different model repo sources - e.g. Huggingface, Cortex Hub ## Tasklist - [x] #1393 - [x] #1394 - [x] #1395 - [ ] #1398 ## CLI …

dan-homebrew updated 1 day ago

invoke-ai/InvokeAI #7049

[bug]: Regression on MPS - SDXL with only Prompt outputs jun…

### Is there an existing issue for this problem? - [X] I have searched the existing issues ### Operating system macOS ### GPU vendor Apple Silicon (MPS) ### GPU model _No response_ ### GPU VRA…

psychedelicious updated 1 week ago

microsoft/onnxruntime #21702

[Performance] pytorch quantize_qat model export to onnx, ins…

### Describe the issue I do a qat quantization on a cnn model, when a export it to onnx model, and got a slower inference than torchscript qat model. the result is torchscript: 4.798517942428589 …

wangyunxiaa updated 4 weeks ago

Unstructured-IO/unstructured #3646

bug/error while loading unstructured.partition.pdf import pa…

I use `python==3.10.3`, `unstructured==0.15.12` ``` from unstructured.partition.pdf import partition_pdf ``` ``` PS C:\Users\ProjectName\test.py" Traceback (most recent call last): File "…

dtruong46me updated 1 week ago

mlc-ai/mlc-llm #2894

[Bug] Any Model with the Suffix _1 Crashes Android

## 🐛 Bug I tried this on both the 23 ultra and the 24 ## To Reproduce 1.Using any model such as Qwen2_1_5B_q4f16_1 try to send a prompt. I've tested many models and it seems to be model…

Melgark updated 1 month ago

1000+ results for image-quantization

1000+ results
for image-quantization