trained-quantization Search Results

1000+ results
for trained-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/model-optimization #765

Convert model to full int after quantization aware training

The [Quantization aware training in Keras example](https://www.tensorflow.org/model_optimization/guide/quantization/training_example?hl=en) mentions the following after performing quantization aware t…

suttergustavo updated 3 years ago
1
kohya-ss/sd-scripts #1503

Possible Enhancements/optimizations. - Is quantization possi…

1. Is it possible to add quantization? with ai toolkit i get 22.1gb vram doing 1024 training, and 1400 steps done in 45 mins in kohya 1024 training, takes me nearly 2 hours with the same amount. …

DarkViewAI updated 1 week ago
3
Xilinx/Vitis-AI #1188

Pytorch QAT

Hi, I am trying to use Pytorch's native QAT instead of pytorch_nndct and then use Vitis AI's Quantization and Compilation for a VCK190. Is there a way to do this? If not would the new ONNX compati…

ash8327 updated 1 year ago
1
pytorch/pytorch #131196

Missing float8 storage

### 🚀 The feature, motivation and pitch I'm trying to use float8 to try mistral nemo which was trained in a quantization aware way, to do inference in FP8. Ref: - https://mistral.ai/news/mistral-…

maruel updated 2 weeks ago
7
NVIDIA-AI-IOT/Lidar_AI_Solution #139

Exporting PTQ model on my custom bevfusion trained weights

Thank you for the amazing work. I was able to setup the BEVFusion inference using the model files given in the readme. I want to use this pipeline for BEVFusion trained on my dataset, so as per the […

sandeepnmenon updated 1 year ago
2
IntelLabs/nlp-architect #150

Loading quant_bert pretrained weights for MRPC

When I run the following command to fine-tune Quantized BERT on MRPC, nlp-train transformer_glue \ --task_name mrpc \ --model_name_or_path bert-base-uncased \ --model_type quant_bert \…

amrnag01 updated 4 years ago
1
IntelLabs/distiller #486

Quantization-aware models increase in size

Hi, I am training a model using quantization-aware training, and I have a couple of questions: 1. It seems to actually increase the size of the model (from ~87Mb to ~137Mb). I have come across th…

xserraalza updated 4 years ago
4
hunglc007/tensorflow-yolov4-tflite #189

Unable to convert a custom yolov3-spp model

When I try to convert a custom model to tflite model, I am getting all nan ``` [{'name': 'input_1', 'index': 0, 'shape': array([ 1, 416, 416, 3], dtype=int32), 'shape_signature': array([ -1, 41…

shahidammer updated 3 years ago
1
vllm-project/vllm #4760

[Performance]: Why the avg. througput generation is low?

### Report of performance regression Hi I use this: ``` server_vllm.py \ --model "/data/models_temp/functionary-small-v2.4/" \ --served-model-name "functionary" \ --dtype=bfloat16 \ -…

rvsh2 updated 1 month ago
2
facebookresearch/encodec #16

Some details about RVQ code

## ❓ Questions Hi, when I try to reproduce the training code based on your released part, I meet a question when I try to use multiple-GPU to train, that is, I find that [https://github.com/facebo…

yangdongchao updated 1 year ago
5

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for trained-quantization

1000+ results
for trained-quantization