audio-text Search Results

1000+ results
for audio-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neonbjb/tortoise-tts #727

Is there a way to use several threads to perform multiple pr…

Hey There! I am new to TTS models, and therefore sorry if my question is naive... I created a simple HTTP server that receives text as input, and return the voice. My HTTP server calls the `Tortoi…

KfirAlfa updated 8 months ago
1
fedirz/faster-whisper-server #70

faster-whisper-server suddenly broken (ValueError: max() ite…

I've been using faster-whisper-server via Docker for weeks with no issues with my transcription script on Ubuntu, but suddenly the server is just broken. I get this error, whenever I try to transcr…

Arche151 updated 2 months ago
4
m-bain/whisperX #289

Question: How to use Alignment only?

We are trying to use whisperX to align text. We already know the script and just want the start/end. It should be easy, but the whisperx.align expects the start/end as well as the text as an input.…

thomasf1 updated 7 months ago
3
JarodMica/audiobook_maker #68

96% of the way through a 24 hour generation and I am hit wit…

I was doing a 700 page textbook when I discovered an error at 96% completion stating the following: `RuntimeError: Possible latent mismatch: try recomputing voice latents. Error: Too much text provid…

couchpotatochip21 updated 2 weeks ago
4
facebookresearch/seamless_communication #82

Poor transcription in less than 60 sec audio

Even after providing an audio file of 54 sec. It only provides me an one-line translation and the data loss is huge. what is the workaround even in the code I tried to change the MAX_INPUT_AUDIO_LENGT…

Azam2107 updated 1 day ago
10
haoheliu/AudioLDM #122

Hugging Face is down

Can you please fix it? https://huggingface.co/spaces/haoheliu/audioldm-text-to-audio-generation

tjasmin111 updated 3 months ago
2
Scottish-Tech-Army/Soundscape-Android #141

Handle media controls from headphones

I got this one from the tutorials. The text there is: > You can access certain features in Soundscape with the help of the media control buttons on your headphones. This functionality works with an…

davecraig updated 1 month ago
3
vsf-tv/gccg-api #20

Inconsistency in the Media Flow definition between the text …

Media flow definition seems to be different in the text and in the JSON. Is it just one essence or more essences? The text says "_A sequence of Media Elements belonging to the same media essence flow …

pac-work updated 5 months ago
2
Naozumi520/Bert-VITS2-Cantonese-Yue #6

How to train a new speaker?

Hi! I am from HK and just started learning about Cantonese TTS. My first goal is to train it on 林尚義's voice. I am starting with this [repo](https://github.com/hon9kon9ize/Bert-VITS2-Cantonese) as sugg…

tangfucius updated 2 months ago
2
HumanSignal/label-studio #5902

UI for recording audio dataset

While using LabelStudio I found that there is no way to create audio dataset with voice recordings. I've got a number of utterances (texts) and want to create the dataset with different voices. But…

ekaterina-poslavskaya updated 4 months ago
1

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for audio-text

1000+ results
for audio-text