quantization-efficient-network Search Results

335 results
for quantization-efficient-network

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #2526

[RFC] Speedup vLLM inference with Intel@ Extension for PyTor…

## Motivation In the current technological landscape, Generative AI (GenAI) workloads and models have gained widespread attention and popularity. Large Language Models (LLMs) have emerged as the dom…

liangan1 updated 1 month ago
6
AlexeyAB/darknet #5069

merge bn and conv in training

ArtyZe updated 4 years ago
6
zhangjun/zhangjun.github.io #30

Papers

# accelerator [Modeling Deep Learning Accelerator Enabled GPUs](https://deepai.org/publication/modeling-deep-learning-accelerator-enabled-gpus)

zhangjun updated 9 months ago
3
pjreddie/darknet #1427

What is the best way to create a tiny network for YOLO

Currently trying to create a tinier version of the [v3 tiny](https://github.com/pjreddie/darknet/blob/master/cfg/yolov3-tiny.cfg). I was messing around with the cfg file and have come up with [this](h…

sd12832 updated 4 years ago
6
LAION-AI/laion-dedup #1

Computing hashes from embeddings

https://github.com/facebookresearch/faiss/issues/2531#issuecomment-1280695975 some thoughts here https://docs.google.com/document/d/1AryWpV0dD_r9x82I_quUzBuRyzDotL_HHnKuNB9H3Zc/edit?usp=drivesdk mo…

rom1504 updated 1 year ago
5
irthomasthomas/undecidability #628

LLaVA/README.md at main · haotian-liu/LLaVA

- [ ] [LLaVA/README.md at main · haotian-liu/LLaVA](https://github.com/haotian-liu/LLaVA/blob/main/README.md?plain=1) # LLaVA/README.md at main · haotian-liu/LLaVA ## 🌋 LLaVA: Large Language and Vi…

irthomasthomas updated 4 months ago
1
mathmanu/caffe-jacinto #1

Quantization

Hello, I am working on both image classification examples (CIFAR/IMAGENET) and am struggling understanding where the quantization appears in your examples. Actually, i looked in the prototxt files …

Wronskia updated 5 years ago
21
OpenTalker/video-retalking #202

Is there ANYWAY, to make this FAST, FASTER? And to use GPU, …

Thanks. I really need to make it faster please.

AIhasArrived updated 4 months ago
6
huggingface/accelerate #2813

"Only Tensors of floating point and complex dtype can requir…

### System Info ```Shell Python 3.11.5 torch 2.3.0 transformers 4.41.1 accelerate 0.30.1 +-----------------------------…

artkpv updated 1 day ago
2
tiny-dnn/tiny-dnn #202

Quantization method for conv, deconv and fc layers.

## Quantization Method for conv, deconv and fc Layers. Here I want to implement the quanzization on operation in conv, deconv and fc layers. Much quantization method are included in this paper: Ristr…

wangyida updated 6 years ago
49

上一页 1...1 2 3 4 5 6 7...34 下一页

335 results for quantization-efficient-network

335 results
for quantization-efficient-network