egorsmkv / speech-recognition-uk

Speech Recognition for Ukrainian
https://t.me/speech_recognition_uk
322 stars 23 forks source link

STT Example? #36

Closed hellcezar closed 2 months ago

hellcezar commented 2 months ago

Greetings, how i can use model to recognize speech to text, like from wav/mp3/mp4 file and record all speech to file? probably it can to make time stumps like in srt files

egorsmkv commented 2 months ago

https://github.com/egorsmkv/speech-recognition-uk/tree/master/speech-to-text

here are the python scripts you can use

egorsmkv commented 2 months ago

to make time stumps like in srt files

for this you can use VAD like PyAnnote or Whisper model