quantization-aware-training Search Results

1000+ results
for quantization-aware-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/model-optimization #363

Support Dense+BatchNorm in QAT

Hi Everyone, i built an NN with BatchNormalization layer and i have tried to quantize the whole model for EdgeTPU application. I have read that i can use this layer after Dense or Conv2D layer in t…

lravano updated 3 years ago
6
rapidsai/cuvs #353

[QST] Does cuvs support IVFFlat for dataset size larger then…

**What is your question?** Does cuvs support build the index with the dataset size larger than GPU memory? Also does cuvs support multigpu building?

VoVAllen updated 1 month ago
8
pytorch/ao #47

[RFC] Plans for torchao

### Summary Last year, we released [pytorch-labs/torchao](https://github.com/pytorch-labs/ao) to provide acceleration of Generative AI models using native PyTorch techniques. Torchao added support …

supriyar updated 6 months ago
21
AlexeyAB/yolo2_light #50

YOLO v3 INT8 inference in TensorFlow Lite

Hello, Is it possible to obtain a quantized .tflite version of YOLO v3 / YOLO Tiny v3 to do INT8 inference with the tools in this repository? I've tried using TensorFlow Lite's official tool, `toco`,…

anferico updated 5 years ago
2
fastmachinelearning/hls4ml #524

FIFO resource consumption

Hi, I've been working with hls4ml to synthesize a model for the ZCU104. After quantization aware training with qkeras (2 bits), I convert the model to hls4ml and run the synthesis. However, when I…

ekellim updated 2 years ago
3
ZouJiu1/LSQplus #10

how to export S/Z files?

Hi，thanks for your excellent work. after train model with LSQplus, how to export S/Z files? like aimet export .encondings file? then be used in snpe

kia350 updated 1 year ago
2
apache/mxnet #15411

Need register_backward_hook() function in mxnet

The current version of mxnet just provide register_forward_hook() function. However, register_backward_hook function is also useful (e.g. logging the information of gradient w.r.t the block or overwri…

JFChi updated 5 years ago
2
ml-explore/mlx #129

Flash attention and flash decoding principles

Are there plans to add flash attention and also flash decoding to allow for improved performance for long context?

RonanKMcGovern updated 3 weeks ago
6
Aaronhuang-778/BiLLM #1

Request: please consider evaluating pareto-optimality of BiL…

Hi! Thank you for the paper! It is inspiring that you can compress weights to about 1 bit and the model still works better than random. A practical sub-2-bit quantization algorithm would be a grea…

justheuristic updated 8 months ago
3
tensorflow/model-optimization #1085

AttributeError: 'CustomLayerMaxPooling1D' object has no attr…

Hi, when I am trying Quantization Aware Training on my model, I get the following error in my 'CustomLayerMaxPooling1D' : --------------------------------------------------------------------------- …

konstantinatopali updated 1 year ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for quantization-aware-training

1000+ results
for quantization-aware-training