-
I was wondering if it possible to add more speech to text services to the pipeline since Whisper is so good with English language but not as much with Arabic language.
so can I add for example google…
-
There is an open source licensed speech recognition AI that works with really good accuracy: https://github.com/openai/whisper
The databases are included, it is python based and able to convert acc…
-
**Describe the bug**
A call to `SpeechSynthesizer.StopSpeakingAsync()` does not stop synthesis for a very long time, up to 30 seconds. The log file is here: [speech.log](https://github.com/Azure-Sa…
-
### Discussed in https://github.com/R3gm/SoniTranslate/discussions/75
Originally posted by **fiaful** August 7, 2024
Hi,
the work done so far is simply fantastic and exciting!
I saw th…
-
Request for Text to speech feature which used to have it in goldendict
In goldendict, Dictionaries > Sources > Text to Speech can setup the feature, but in ng version, I can't find this function an…
-
这个api怎么调用?还是要自己封装
-
**Why**
With quite a few models available, great pricing, the ability to add your own models of fine-tunes, and a fairly simple API, NLP Cloud would be a great addition to big-AGI
**Description**
…
-
-
1. Due to a delay in text input, voice generation was cancelled. A timeout parameter was set according to the example, but it did not take effect
properties["SpeechSynthesis_FrameTimeoutInterval"…
-
When I use text_to_speech to read a text aloud while using the audioPlays package to play a short sound effect, I get the following error message:
"Error configuring audio session: Error Domain=NSOSS…