-
# embeddings
An embedding is a numerical representation of a piece of information, for example, text, documents, images, audio, etc. The representation captures the semantic meaning of what is being …
-
Hi team, I am working on implementing a voice chatbot using RealtimeSTT library. For the speech-to-text part, I am using the RealTimeSTT library. Here, I am attempting to provide an audio file as inpu…
-
-
Dear WhisperSpeech maintainers,
I found multi-language models like [s2a-v1.95-medium-7lang.model](https://huggingface.co/WhisperSpeech/WhisperSpeech/blob/main/s2a-v1.95-medium-7lang.model) on huggi…
-
Path: /api-reference/text-to-speech
I a trying to set the language in an API call to Portuguese (Portugal).
I have tried the code:
`data = {
"text": "Eles jogam futebol aos domingos.",
…
-
### Describe the bug
When using the `filter` or `map` function to preprocess a dataset, a ValueError is encountered with the error message "array is too big; arr.size * arr.dtype.itemsize is larger t…
-
just as images need alt-text, audio does too.
Janeway needs new functionality to allow for text to be attached to all non-text media in accordance with WCAG.
-
Is there anyway we can retrieve a characters voice response also? Ideally I'd like to skip running a local tts script, especially as C.Ai already provides us with this feature.
Could we find a way …
-
I've been using faster-whisper-server via Docker for weeks with no issues with my transcription script on Ubuntu, but suddenly the server is just broken.
I get this error, whenever I try to transcr…
-
Proprietary music generation is far ahead of open source (see Suno, Udio et al).
Using your encodec method, please include text-to-music with English synthetic Singing somehow. I'm not sure of the…