speech-generation Search Results

1000+ results
for speech-generation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

advaitjain/tflite-micro-sparkfun-edge-examples #63

compile success, but flash fails

Hi all, I tried to fix this problem several days, but I still cannot find the solution. Could someone help me? Thx in advance. I use the Apple Os, and have installed all the tools required, such as …

chtfrank updated 1 month ago
1
neulab/prompt2model #316

Support for non-text modalities (images, speech, video)

Currently prompt2model is limited to text input text output tasks. The underlying framework can certainly handle different modalities, and it would be great to see prompt2model be able to handle diffe…

neubig updated 8 months ago
4
CSTR-Edinburgh/merlin #144

A few ideas and would like to ask for your kind advice

Hello, I am a researcher of speech synthesis, I am using HTS front-end generated labs for the voice synthesis, with Merlin synthesis module (04_synthesis.sh) for speech synthesis. We now have a few id…

hdl2013win updated 7 years ago
7
fatchord/WaveRNN #81

Is training your waveRNN on piano music realistic ?

Hello, I'm a master student doing a thesis on music generation. My goal is to condition a neural network on emotion so as to generate "sad piano music" or "happy piano music" or "angry piano music"…

ghost updated 1 year ago
6
aws-amplify/aws-sdk-android #1435

Add Streaming speech-to-text support for aws-android-sdk-tra…

**Which AWS Services is the feature request for?** aws-android-sdk-transcribe - AWS Transcribe Medical Streaming **Is your feature request related to a problem? Please describe.** No, I'd like t…

AptFox updated 4 months ago
17
TusharAMD/SuperSpeechSaga #56

Enhancing Performance and UI Responsiveness in The Prototype

Dear Tushar Amdoskar, I hope this message finds you well. I’ve identified key areas in SuperSpeechSaga that could benefit from improvements in performance and UI scalability. Main Issues: 1. …

PrashantKumar39 updated 6 months ago
1
jasonppy/VoiceCraft #39

Generating long speeches

Would there be a way to generate long speeches ? Because right now, it requires to be fed with at least 3 seconds of speech each time you want to inference something new. And if the length of the …

RootingInLoad updated 7 months ago
5
suno-ai/bark #384

Stochastic speaking styles and unpredictable uh's & umm's

I admit that Bark is realistic speech sounding. But there are couple of issues as following. Could someone please help me fix them? - The output speech creates so many uh's and umm's even if none…

RahulBhalley updated 1 year ago
1
cboard-org/ccboard #60

Cannot Find Text to Speech Voice on Kindle Fire

cboard app version 1.25.0 Fire HD 8 (8th generation) App displays `WARNING: we did not detect an available Text to Speech voice! Cboard cannot work properly.` and produces no sound, but if I navig…

davesgonechina updated 1 year ago
1
enricoros/big-AGI #635

[Roadmap] Add support for NLP Cloud

**Why** With quite a few models available, great pricing, the ability to add your own models of fine-tunes, and a fairly simple API, NLP Cloud would be a great addition to big-AGI **Description** …

evilalmus updated 2 months ago
3

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for speech-generation

1000+ results
for speech-generation