Closed yuekaizhang closed 1 year ago
Support whisper via onnx fp16 using triton.
Some perf results attached here:
Decoding on a single V100 GPU, audios are padding to 30s, using aishell1 test set files
@csukuangfj Would you mind checking this PR when you are free, many thanks!
Thanks! Left some minor comments.
Thanks, done!
Support whisper via onnx fp16 using triton.
Some perf results attached here:
Decoding on a single V100 GPU, audios are padding to 30s, using aishell1 test set files