Closed copybara-service[bot] closed 7 months ago
Set the QTensor's dequant_dtype during SERVE mode to scale_t's dtype.
Set the QTensor's dequant_dtype during SERVE mode to scale_t's dtype.