livepeer / ai-worker

https://livepeer.ai
MIT License
17 stars 26 forks source link

Enable new models in audio-to-text #163

Closed eliteprox closed 1 week ago

eliteprox commented 3 months ago

This change adds support for new whisper models distil-whisper/distil-large-v3 and openai/whisper-medium.

It also optimizes those models to use the appropriate BFLOAT, FLOAT16 or FLOAT32 values.

Credit to @ad-astra-video for intially exploring these models and optimizations

eliteprox commented 1 month ago

@rickstaa I made several changes since you last reviewed this PR, so I held off on merging. Could you or @ad-astra-video re-review the latest changes?