hi, thanks for your nice work, I tried set param quantize_output as True when quantize model,like follow code
def from_float(module, weight_quant='per_channel', act_quant='per_token', quantize_output=True),
but the acc drop to 0 , is there anything wrong or smoothquant method unsupport quantize output?
hi, thanks for your nice work, I tried set param quantize_output as True when quantize model,like follow code
def from_float(module, weight_quant='per_channel', act_quant='per_token', quantize_output=True)
, but the acc drop to 0 , is there anything wrong or smoothquant method unsupport quantize output?