Open chuck-ma opened 1 week ago
Do share update if it did work,
Well, it will take 70 hours to quantize. But I have no money. Any idea about how to speed up ? @synxlin @bobboli
Also encountering this taking 12 hours on an H100
Hi, @chuck-ma @dome272 ,
We are working on improving our codebase to support fast calibration without online activation generation. We'll keep this issue updated.
I'm currently using H800 to do Smooth Quantization for my custom flux transformer. I'm wondering how long it would take to finish quantization. I have been quantizing for 20 minutes, but the progress bar is still empty.
python -m deepcompressor.app.diffusion.ptq configs/model/flux.1-custom.yaml configs/svdquant/int4.yaml --save-model /root/autodl-tmp/flux.1-custom-svdquant-int4