Open HuDi2018 opened 4 years ago
It seems like you will work out a new set of scales for activations on every batch, and use the new set of scales to quantize and dequantize activations?
It seems like you will work out a new set of scales for activations on every batch, and use the new set of scales to quantize and dequantize activations?