audio-to-text Search Results

1000+ results
for audio-to-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sensein/senselab #85

Task: Language Detection

### Description As far as I understand, this should take in an Audio or a ScriptLine and output a Language object ### Tasks - [ ] Select default model for either audio or text - [ ] Get working wi…

ibevers updated 3 weeks ago
1
speechbrain/speechbrain #2626

A lot of softlinks have been created when doing emotion clas…

### Describe the bug I am doing emotion classification on waveforms using the speechbrain IEMOCAP hugging face interface. The code was executed perfectly. But meanwhile it creates a lot fo softlinks …

underdogliu updated 1 day ago
1
ggerganov/whisper.cpp #2257

Core ML support: AttributeError: `np.issctype` was removed i…

trying to install the Core ML support on a macbook pro m3 running ``` ./models/generate-coreml-model.sh base.en Torch version 2.3.1 has not been tested with coremltools. You may run into unexpe…

BastLeblanc updated 4 weeks ago
1
KoljaB/RealtimeSTT #89

An attempt has been made to start a new process before the c…

I'm trying to use STT but sometimes it works and sometimes no, Please tell me what I need to do to fix that. ``` Traceback (most recent call last): File "", line 1, in File "/usr/lib/pytho…

ahmed-troido updated 1 day ago
1
bootiful-media-mogul/mogul-service #4

transcription using whisper

build a transcription integration flow that sends an .mp3 and sends it to whisper (which i need to deploy) and then returns the text from the audio (use a Spring INtegration gateway perhaps?)

joshlong updated 1 day ago
2
Azure-Samples/cognitive-services-speech-sdk #2416

Certain voices not providing viseme durations as expected

*Describe the bug* Certain TTS voices are not providing speechmarks with viseme timings. For example, all the Urdu Azure TTS voices provide word timings but do not provide viseme timings which is w…

trulience updated 2 weeks ago
3
WhitmanCS370/OGGS_repo #30

research library for text to audio

Find a library that supports text to audio, or audio to text.

ObaltzerS updated 4 months ago
1
coqui-ai/TTS #3815

Can't read Bengali year ১৯৫৪ সাল। কালো রাত। [Bug]

### Describe the bug I was testing the Bengali Voice model and it missed the Bengali number pronunciation. Bengali numbers ০ ১ ২ ৩ ৪ ৫ ৬ ৭ ৮ ৯ 0 1 2 3 4 5 6 7 8 9. ১৯৫৪ সাল। কালো রাত। Here is su…

khandakershahi updated 2 weeks ago
5
ajitesh123/Perf-Review-AI #113

Archieai: Add Audio Input for Performance Reviews

### Details _No response_ ### Branch _No response_

ajitesh123 updated 5 days ago
8
DigitalPhonetics/IMS-Toucan #183

English to another language synthesis

The voice cloning and prosody cloning are amazing. But i want to clone the prosody but synthesize speech in another language. Not having any luck so far, any help? I noticed the models only accepts…

Vicopem01 updated 15 hours ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for audio-to-text

1000+ results
for audio-to-text