product-quantization Search Results

1000+ results
for product-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/benchmark #278

Benchmark quantization

Add benchmarks for quantized models. This might be implemented as a new 'flavor' of test_eval, where most models raise NotImplemented and it is strictly opt-in to add quantization for particular mo…

wconstab updated 3 years ago
18
Xilinx/finn #174

Caching support for long-running transformations

To speed up the compilation process for large models or large layers, it would make sense to have a caching mechanism for long-running transformations. The cached outputs would be persistent and get r…

maltanar updated 4 years ago
1
xai-org/grok-1 #62

Hardware requirements

What are minimum and recommended hardware requirements to run the model and to do training? 1. How much GPU Memory (VRAM) is required? 2. How much RAM is required? 3. What GPUs are recommended? …

Konard updated 4 months ago
18
NVIDIA/TensorRT-LLM #396

Llama7b Int4 on Nvidia T4. Output from Triton is incorrect.

Hello folks, I am looking to build the llama7b int4 weight and serve via Triton. I attempted constructing it and verifying whether the int4 output is correct. However, when I built it with ```u…

matichon-vultureprime updated 9 months ago
3
jhj0517/Whisper-WebUI #205

Generated subtitles are too long

**Which OS are you using?** - OS: MacOS Sonoma 14.3.1 --- I am trying to translate korean audio files and the generation works, but I often find that the subtitles generated are too long. For …

joshuachough updated 1 month ago
3
huggingface/transformers #29704

Grok-1 MoE support

### Model description X-AI recently released [grok-1](https://huggingface.co/xai-org/grok-1), a massive MoE model, with a total parameter count of 314B across 8 experts, 2 active at a time. Would be …

AlpinDale updated 5 months ago
3
lsp-plugins/lsp-plugins #267

DSP Precision Question

I have recently tested the precision of the lsp parametric equaliser. I enabled a couple of filters and set their gain to 0 db. The test signal was white noise. Then I noticed that the output after th…

i-LOVE-cplusplus updated 2 years ago
4
plaidml/plaidml #185

Custom gradients

Is it possible to override the gradient for a TILE function?

kazimuth updated 5 years ago
4
SysCV/sam-hq #112

Will a fast version be released like segment-anything-fast?

Hello, i'm the contributor of project [ISAT](). Your project sam-hq give me more help, it's a great work. The pytorch-labs has recently released a new project [segment-anything-fast](https://github…

yatengLG updated 6 months ago
3
apache/lucene #12615

Should we explore DiskANN for aKNN vector search?

### Description I came across this compelling sounding [JVector project](https://foojay.io/today/jvector-1-0/) which looks to have awesome QPS performance. It uses [DiskANN](https://www.microsoft.…

mikemccand updated 1 month ago
41

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for product-quantization

1000+ results
for product-quantization