-
## Motivation
In the current technological landscape, Generative AI (GenAI) workloads and models have gained widespread attention and popularity. Large Language Models (LLMs) have emerged as the dom…
-
-
# accelerator
[Modeling Deep Learning Accelerator Enabled GPUs](https://deepai.org/publication/modeling-deep-learning-accelerator-enabled-gpus)
-
Currently trying to create a tinier version of the [v3 tiny](https://github.com/pjreddie/darknet/blob/master/cfg/yolov3-tiny.cfg). I was messing around with the cfg file and have come up with [this](h…
-
https://github.com/facebookresearch/faiss/issues/2531#issuecomment-1280695975 some thoughts here
https://docs.google.com/document/d/1AryWpV0dD_r9x82I_quUzBuRyzDotL_HHnKuNB9H3Zc/edit?usp=drivesdk mo…
-
- [ ] [LLaVA/README.md at main · haotian-liu/LLaVA](https://github.com/haotian-liu/LLaVA/blob/main/README.md?plain=1)
# LLaVA/README.md at main · haotian-liu/LLaVA
## 🌋 LLaVA: Large Language and Vi…
-
Hello,
I am working on both image classification examples (CIFAR/IMAGENET) and am struggling understanding where the quantization appears in your examples. Actually, i looked in the prototxt files …
-
Thanks.
I really need to make it faster please.
-
### System Info
```Shell
Python 3.11.5
torch 2.3.0
transformers 4.41.1
accelerate 0.30.1
+-----------------------------…
-
## Quantization Method for conv, deconv and fc Layers.
Here I want to implement the quanzization on operation in conv, deconv and fc layers. Much quantization method are included in this paper: Ristr…