qat Search Results - Githubissues

pytorch/ao #1224

[QAT] Low-bit FSDP all-gather for QAT

Had this idea and discussed briefly with @andrewor14. Conceptually the current QAT + FSDP looks like this - sharded FP32 weight -> all-gather in BF16 -> fake quantize However, we can do low-…

gau-nernst updated 2 weeks ago

quic/aimet #3508

per channel QAT is really slow

I'm currently using 1.29 AIMET. when I try to do QAT using per_channel config, the time spent is almost 50x longer than using default config. Is there a solution?

shuyuan-wang updated 1 day ago

PaddlePaddle/Paddle2ONNX #1441

使用paddle2onnx将量化后的ppyoloe_plus_s导出为onnx后，onnxruntime推理结果与pad…

**问题描述** 根据PaddleDetection提供的[模型压缩文档](https://github.com/PaddlePaddle/PaddleDetection/blob/8377e846439a709f5ab3ac6948d768221b5cf1e6/configs/slim/README.md)，我将训练好的ppyoloe plus s模型进行量化训练。量化后的模型，以及导出为…

EvW1998 updated 5 hours ago

Deci-AI/super-gradients #2060

yolo nas Post training quantization and quantization awarene…

### 💡 Your Question I have followed exactly same steps for model training followed by PTQ and QAT mentioned in the offcial super-gradient notebook : https://github.com/Deci-AI/super-gradients/blob…

anazkhan updated 1 week ago

pytorch/torchtune #1818

qwen2 is not supported by QAT

i try to use QAT to quantize qwen2 1.5B model The error raise from function `training.load_from_full_model_state_dict( model, model_state_dict, self._device, self._is_rank_zero, strict=T…

elfisworking updated 1 month ago

pytorch/executorch #6212

Support QAT in QCOM qnn backend

### 🚀 The feature, motivation and pitch Currently qnn quantizer only supports PTQ (post training quantization), and we'd like to enable QAT (quantization aware trainning) for better quantization supp…

cccclai updated 1 month ago

levipereira/yolov9-qat #17

QAT YOLOv9 m,s,t

Thanks for your contributions. I see you implement and eval qat YOLOv9 size c, e . Can I qat YOLOv9 model size m,s, n and what performance is it?

leanhtuan276 updated 3 weeks ago

pytorch/ao #987

[RFC] Long Term QAT Flow

Currently torchao QAT has two APIs, [tensor subclasses](https://github.com/pytorch/ao/blob/a4221df5e10ff8c33854f964fe6b4e00abfbe542/torchao/quantization/prototype/qat/api.py#L41) and [module swap](htt…

andrewor14 updated 1 month ago

intel/QAT_Engine #334

testapp: return error on HW failure

in our CI setup for HW testing, we want QAT Engine to return an error code if the HW fails. Currently, we build a custom engine, with ``` --disable-qat_sw \ --with-qat_engine_id=qathwtest && \…

mythi updated 2 weeks ago

pytorch/ao #1290

Why my QAT's convert doesn't work,still float32

I try the original QAT code. ``` model = llama3( vocab_size=4096, num_layers=16, num_heads=16, num_kv_heads=4, embed_dim=2048, max_seq_len=2048,…

Xxxgrey updated 2 days ago

1000+ results for qat

1000+ results
for qat