KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
MIT License
2.09k stars 190 forks source link

New Function: Use transcription as prompts #112

Open fireindark707 opened 2 months ago

fireindark707 commented 2 months ago

I believe an effective way to improve overall stability is to use the previously generated transcription results as the next prompt.

KoljaB commented 2 months ago

I'm currently not aware of any research or official documentation from OpenAI (Whisper's creators) or Guillaume Klein (faster_whisper's creator) showing improvements with this approach. I think using previous real-time transcriptions as prompts could potentially introduce biases / errors if the previous transcription was inaccurate. If you have specific examples or URLs showing improvements with these kind of prompts in faster_whisper, I'd be very interested in seeing them.