network-quantization Search Results

1000+ results
for network-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fastmachinelearning/hls4ml #123

Convert pytorch resnet18 model to HLS

Hi all, I really appreciate your efforts to provide this excellent tool to convert pytorch model to HLS. I am trying to convert a resnet18 model to HLS, and I found that the example-models insid…

garyhujingyao updated 1 month ago
5
tensorflow/model-optimization #994

Allow quantization of tied weights

**System information** - TensorFlow version (you are using): 2.6.0 (TFMOT 0.7.2) - Are you willing to contribute it (Yes/No): Potentially, with some advice on how to implement it **Motivation**…

hunse updated 2 years ago
2
NVIDIA/TensorRT-LLM #2428

trt_build for Llama 3.1 70B w4a8 fails with CUDA error

### System Info +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 560.35.03 Driver Version: 560.35.03 CUDA Version: 12.6 |…

chrisreese-if updated 1 week ago
1
gudovskiy/ShiftCNN #4

Question about you paper [ShiftCNN]

Hi there I have some question about your paper “ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks” 1. why do you use more then one codebook，in…

KangolHsu updated 6 years ago
1
quantumlib/Qualtran #911

Notebook tests are flaky?

The notebook tests in the CI seem to randomly fail every so often. Rerunning them seems to work. Any idea what's going on? Heres's a recent traceback: ``` [PosixPath('Adjoint.ipynb'), PosixPath(…

fdmalone updated 3 months ago
1
pytorch/android-demo-app #43

Run the ImageClassification example have some question

When I run the full quantization model of mobilenet, the current CPU platform is mtk8163. At present, I find a very strange phenomenon. When I limit the CPU number to 2 cores, I run the image …

linfeng886 updated 4 years ago
4
huggingface/transformers #31293

`merge_and_unload` for a quantized model ruins its quality

### System Info - `transformers` version: 4.41.2 - Platform: Linux-5.15.0-1044-nvidia-x86_64-with-glibc2.35 - Python version: 3.10.0 - Huggingface_hub version: 0.23.0 - Safetensors version: 0.4.2…

Aktsvigun updated 1 week ago
11
huqinghao/PalQuant #3

Loss nan for w1a1g3

Thx for your work again, i have tried your default config for w4a4g2 quantization. it works well for resnet-18 on imagenet(top1 acc ~71%). So i want to try if it can work for w1a1(a.k.a BNN). I use th…

ChuanjunLAN updated 2 years ago
5
lucidrains/vector-quantize-pytorch #131

Why do I get almost the same codes after the 1st batch?

Hi there, I am trying to quantize my input feature `sparse_feat` with the following codes in my network. ``` class MyModel(nn.Module): def __init__ (self): super(MyModel, self).__init_…

tanyz0208 updated 4 months ago
1
senthilkumarm1901/QuartoBlogComments #2

learn-by-blogging/posts/2024-06-17-how-to-host-open-source-l…

# Learn by Blogging - The Mental Model for Leveraging LLMs in Cloud In this blog post, we are exploring the intersection of different sized LLMs and their optimal compute environments for deployment …

utterances-bot updated 2 months ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for network-quantization

1000+ results
for network-quantization