-
Hi all,
I tried to fix this problem several days, but I still cannot find the solution. Could someone help me? Thx in advance.
I use the Apple Os, and have installed all the tools required, such as …
-
Currently prompt2model is limited to text input text output tasks. The underlying framework can certainly handle different modalities, and it would be great to see prompt2model be able to handle diffe…
-
Hello, I am a researcher of speech synthesis, I am using HTS front-end generated labs for the voice synthesis, with Merlin synthesis module (04_synthesis.sh) for speech synthesis. We now have a few id…
-
Hello,
I'm a master student doing a thesis on music generation. My goal is to condition a neural network on emotion so as to generate "sad piano music" or "happy piano music" or "angry piano music"…
-
**Which AWS Services is the feature request for?**
aws-android-sdk-transcribe - AWS Transcribe Medical Streaming
**Is your feature request related to a problem? Please describe.**
No, I'd like t…
-
Dear Tushar Amdoskar,
I hope this message finds you well. I’ve identified key areas in SuperSpeechSaga that could benefit from improvements in performance and UI scalability.
Main Issues:
1. …
-
Would there be a way to generate long speeches ?
Because right now, it requires to be fed with at least 3 seconds of speech each time you want to inference something new. And if the length of the …
-
I admit that Bark is realistic speech sounding.
But there are couple of issues as following. Could someone please help me fix them?
- The output speech creates so many uh's and umm's even if none…
-
cboard app version 1.25.0
Fire HD 8 (8th generation)
App displays `WARNING: we did not detect an available Text to Speech voice! Cboard cannot work properly.` and produces no sound, but if I navig…
-
**Why**
With quite a few models available, great pricing, the ability to add your own models of fine-tunes, and a fairly simple API, NLP Cloud would be a great addition to big-AGI
**Description**
…