Request to introduce large-v3-turbo model

McCloudS / subgen

Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr

MIT License

641 stars 55 forks source link

Request to introduce large-v3-turbo model #126

Closed YHSI5358 closed 1 month ago

YHSI5358 commented 1 month ago

This is the latest and seemingly most powerful model released some time ago. I hope it can be introduced to make everyone more satisfied with it.

McCloudS commented 1 month ago

At this point, it’s not possible. The turbo model is only for HF transformers. I removed transformers support due to its erratic performance and out of memory errors. When the model is released without transformers, it can be added.

McCloudS commented 1 month ago

I tried the transformers branch again and am still seeing similar issues. Transformers requires large amounts of memory to batch correctly, and too much user configuration to get beam_size correct to not give OOM errors. My limited testing on my 8gb card, again shows that it performs worse than the current setup with subgen. If you want to squeeze more out of your card, you can try to run multiple transcriptions by bumping up your concurrent transcriptions variable.