-
Training speech recognition and text-to-speech models from scratch in Azerbaijani will require a comprehensive dataset of high-quality audio and corresponding text transcriptions. Here are the steps t…
-
The .NET SDK doesn't support streaming transcription. This is a very important feature for us. Is this something you're considering?
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
Maybe we could use the Web Speech API to create a plugin that records spoken responses with automated speech recognition?
This could be based on the html-audio-response plugin and used to run tasks …
-
I have a diarization application in which I prefer to have fewer false alarms at the expense of more misses. Can this be controlled during fine tuning?
Thanks
Michael
-
Google Speech Recognition: we're sorry but your computer or network may be sending automated queries to protect our users we can't process your request right now for more details visit www.google.com
…
-
Hi @deboradum , laat je nog even weten als de video en de demo nu zo ongeveer hetzelfdfe zijn? Dan maak ik een blogpostje met de 2 links en stuur dat naar de mensen die we uitgenodigd hadden.
Heb …
-
## Introduction
Computers can turn speech into text. It's sometimes called "Speech Recognition".
It takes a lot of previewing per and memory, to run some funky algorithms to transcode an audio f…
-
When using whisper to generate subtitles in srt format, I noticed after a certain period of time (around 1 hour), the subtitle starts to be out of sync with the video. I tested generating the subtitle…
-
As a user,
I would like to be able to view a transcript of an audio or video object in real time as the audio or video is playing on the screen,
so that I can better navigate the content.