-
Hi all,
I really appreciate your efforts to provide this excellent tool to convert pytorch model to HLS.
I am trying to convert a resnet18 model to HLS, and I found that the example-models insid…
-
**System information**
- TensorFlow version (you are using): 2.6.0 (TFMOT 0.7.2)
- Are you willing to contribute it (Yes/No): Potentially, with some advice on how to implement it
**Motivation**…
hunse updated
2 years ago
-
### System Info
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03 Driver Version: 560.35.03 CUDA Version: 12.6 |…
-
Hi there
I have some question about your paper “ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks”
1. why do you use more then one codebook,in…
-
The notebook tests in the CI seem to randomly fail every so often. Rerunning them seems to work. Any idea what's going on? Heres's a recent traceback:
```
[PosixPath('Adjoint.ipynb'), PosixPath(…
-
When I run the full quantization model of mobilenet, the current CPU platform is mtk8163. At present, I find a very strange phenomenon. When I limit the CPU number to 2 cores, I run the image …
-
### System Info
- `transformers` version: 4.41.2
- Platform: Linux-5.15.0-1044-nvidia-x86_64-with-glibc2.35
- Python version: 3.10.0
- Huggingface_hub version: 0.23.0
- Safetensors version: 0.4.2…
-
Thx for your work again, i have tried your default config for w4a4g2 quantization. it works well for resnet-18 on imagenet(top1 acc ~71%). So i want to try if it can work for w1a1(a.k.a BNN). I use th…
-
Hi there, I am trying to quantize my input feature `sparse_feat` with the following codes in my network.
```
class MyModel(nn.Module):
def __init__ (self):
super(MyModel, self).__init_…
-
# Learn by Blogging - The Mental Model for Leveraging LLMs in Cloud
In this blog post, we are exploring the intersection of different sized LLMs and their optimal compute environments for deployment
…