Open beitong95 opened 1 month ago
I was wondering if you tried to quantize the mode to int8 as you mentioned efficiency in the paper. I would like to run the model on an edge device where only int8 accelerator is available.
I was wondering if you tried to quantize the mode to int8 as you mentioned efficiency in the paper. I would like to run the model on an edge device where only int8 accelerator is available.