automated-speech-recognition Search Results

GasimV/Commercial_Projects #4

Data Collection

Training speech recognition and text-to-speech models from scratch in Azerbaijani will require a comprehensive dataset of high-quality audio and corresponding text transcriptions. Here are the steps t…

GasimV updated 2 weeks ago

aws/aws-sdk-net #1847

Add Streaming transcription support to the .NET SDK

The .NET SDK doesn't support streaming transcription. This is a very important feature for us. Is this something you're considering?

ernestodossantos updated 3 weeks ago

GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 6 days ago

jspsych/jsPsych #2404

Audio-response plugin with speech recognition? (Web Speech A…

Maybe we could use the Web Speech API to create a plugin that records spoken responses with automated speech recognition? This could be based on the html-audio-response plugin and used to run tasks …

becky-gilbert updated 2 years ago

pyannote/pyannote-audio #1584

False Alarms vs Misses

I have a diarization application in which I prefer to have fewer false alarms at the expense of more misses. Can this be controlled during fine tuning? Thanks Michael

picheny-nyu updated 2 weeks ago

eastee/rebreakcaptcha #9

Google Speech Recognition: we're sorry but your computer or …

Google Speech Recognition: we're sorry but your computer or network may be sending automated queries to protect our users we can't process your request right now for more details visit www.google.com …

Ke100n4ik updated 5 years ago

deboradum/bachelorThesis #13

video en demo "hetzelfde"

Hi @deboradum , laat je nog even weten als de video en de demo nu zo ongeveer hetzelfdfe zijn? Dan maak ik een blogpostje met de 2 links en stuur dat naar de mensen die we uitgenodigd hadden. Heb …

maartenmarx updated 2 weeks ago

ethberlinzwei/Find-A-Team #28

Speech Recognition Infrastructure Services on Ethereum (DAO-…

## Introduction Computers can turn speech into text. It's sometimes called "Speech Recognition". It takes a lot of previewing per and memory, to run some funky algorithms to transcode an audio f…

chrishobcroft updated 4 years ago

sunmingtao/sample-code #351

Subtitles generated by Whisper start to be out of sync with …

When using whisper to generate subtitles in srt format, I noticed after a certain period of time (around 1 hour), the subtitle starts to be out of sync with the video. I tested generating the subtitle…

sunmingtao updated 1 month ago

psu-libraries/cho #24

Audio/Visual transcripts

As a user, I would like to be able to view a transcript of an audio or video object in real time as the audio or video is playing on the screen, so that I can better navigate the content.

jennielevineknies updated 5 years ago

283 results for automated-speech-recognition

283 results
for automated-speech-recognition