Open juberti opened 3 weeks ago
Quantize Ultravox to fp8 and determine how this affects the model's inference performance as well as speed. This would entail
Quantize Ultravox to fp8 and determine how this affects the model's inference performance as well as speed. This would entail