quantization-aware-training Search Results

1000+ results
for quantization-aware-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HuangOwen/QAT-ACS #1

why there is no extended experiments on LLMs or large vision…

why there is no extended experiments on LLMs or large vision transformers?

brisker updated 1 year ago
3
mlcommons/mobile_models #21

MobileBERT tflite int8 model seems not follow quantization s…

The model downloaded from https://github.com/fatihcakirs/mobile_models/blob/main/v0_7/tflite/mobilebert_int8_384_20200602.tflite Some Fully-connected weights has none-zero zero point (ex. weight `b…

rednoah91 updated 3 years ago
6
tensorflow/model-optimization #527

Quantized wrapped layer non-trainable

Hello all, I am working on trying some logic on top of QAT where i will make few of the layers during QAT non-trainable based on some logic. I am currently seeing there is no such support in QAT (a…

SumanDeedwaniya updated 3 years ago
1
zhangjun/zhangjun.github.io #24

量化

# PaddleSlim量化 ![image](https://user-images.githubusercontent.com/1312389/170643197-8a42af2b-b696-4363-ac3a-29a582642162.png) PaddleSlim主要包含三种量化方法：量化训练(Quant Aware Training, QAT)、动态离线量化(Post Train…

zhangjun updated 1 year ago
7
microsoft/nni #4786

run quantization_speedup.py in /examples/tutorials get an er…

when I run quantization_speedup.py in /examples/tutorials, get erros like this: ``` Traceback (most recent call last): File "quantization_speedup.py", line 114, in engine.compress() Fi…

DemonHan updated 1 year ago
7
keras-team/tf-keras #115

model.save fails with ValueError __inference_conv2d_transpos…

Originally I posted this bug [#54753](https://github.com/tensorflow/tensorflow/issues/54753) on [tensorflow/tensorflow](https://github.com/tensorflow/tensorflow/issues) and was advised to repost it he…

ypsilon-elle updated 4 months ago
8
IntelLabs/nlp-architect #104

question: How can I quantize BERT to FP16 ?

I have only P100 and V100 which dosen't support INT8. So what should I do to quantize BERT to FP16 ? Thanks in advance!

hexiaoyupku updated 4 years ago
1
microsoft/onnxruntime #11604

Bad performance for QDQ model with openvino EP

**Describe the bug** Hi, I use openvino EP to test QDQ model performance but find QDQ model's performance is worse than original fp32 model.\ **System information** - ONNX Runtime installed fro…

mengniwang95 updated 2 years ago
8
kssteven418/I-BERT #1

Can use the CPU in the inference state?

Excellent work! Can use the CPU in the inference state? And how much faster than baseline?

luoling1993 updated 3 years ago
1
microsoft/onnxruntime #6847

albert quantized

**Describe the bug** I use huggingface transformers albert model albert-base-v2 to classify text,meanwhile,I use onnxruntime to optimized and quantized， `opt_model = optimizer.optimize_model( …

Zjq9409 updated 2 years ago
7

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for quantization-aware-training

1000+ results
for quantization-aware-training