Open 0xT3chN0 opened 7 months ago
This seems very interesting if it's easy to implement
@nicholaskoerfer How is the accuracy of it? It's not worth gaining speed if the subtitles are worse
HuggingFace has a webversion up and holy crap it's fast. 21 min youtube video in about 8 seconds
Yes, it's incredibly fast. In my opinion, the subtitles are accurate.
Is this going to be worked on? @ahmetoner
Hmm, this is pretty interesting great find! Would be cool to have a near-instant service even if the quality is slightly worse.
Hi, thanks for this Software. It works perfect with Bazarr, for subtitle Translation.
However, it is, even with Faster-Whisper and a Tesla T4 GPU a bit slow. There is a new Whisper implementation, that can Transcribe 1 Hour of Audio in approx. 15 seconds.
The new Whisper implementation is called "Whisper JAX" (https://github.com/sanchit-gandhi/whisper-jax). It has support for CPU, GPU and even TPU, though there is already a big speed gain just by using a GPU.
Is it possible for one to add this whisper implementation to this ASR Webservice, that you can select between OpenAI Whisper, Faster Whisper and Whisper JAX?
Thanks!