SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2
MIT License
12.2k stars 1.02k forks source link

Speculative Decoding #771

Closed RohitMidha23 closed 7 months ago

RohitMidha23 commented 7 months ago
  1. Is speculative decoding faster than faster-whisper?

  2. Is there going to be support anytime soon for speculative decoding in faster-whisper?

Both of these questions are asked with a purely realtime, streaming audio view in mind!

trungkienbkhn commented 7 months ago

@RohitMidha23 , hello.

  1. I tested speculative decoding for whisper from here, then compared with fw large-v2. I used an audio file 192s and RTX 3090 GPU card. Below are the transcription times:
  1. Yes we will look into this
RohitMidha23 commented 7 months ago

Thanks for those results! Looking forward to seeing speculative decoding in Faster Whisper!

S-Cardenas commented 4 months ago

Is there any update on adding speculative decoding to Faster Whisper? If there is an open ticket or pull request I'd be willing to help and contribute to it.