text-to-audio Search Results

1000+ results
for text-to-audio

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fedirz/faster-whisper-server #160

faster-whisper-server output seems something wrong

After a certain segment, all subsequent recognized texts are incorrect： ``` from openai import OpenAI client = OpenAI(api_key="cant-be-empty", base_url="http://192.168.31.100:8000/v1/") …

burness updated 1 week ago
1
gpt-omni/mini-omni #101

Question regarding data format and loss calculation in stage…

In stage 1, only ASR and TTS is used. ASR is Audio -> Text, so loss is only calculated for text tokens, not for audio tokens right? TTS is Text -> Audio, but mini-omni outputs text and audio sim…

sphmel updated 1 month ago
7
FunAudioLLM/CosyVoice #509

Zero-shot inference with additional instruction text

It could be helpful to control **speaking style** using prompt audio, and control **emotion** using instruction text. I attempted zero-shot inference by including the instruction text in the prompt us…

martinzwm updated 1 week ago
2
TEAMuP-dev/pyharp #25

Returning text & images to editor as well as audio

Dear Hugo et al, I have a gradio app that reads some audio and produces text and analysis graphs of it. (It doesn't actually change the audio.) It was looking like I could use PyHARP v0.1.0 to in…

drscotthawley updated 2 weeks ago
2
NCIOCPL/cgov-digital-platform #4484

Story: Add Audio-Described Files to Video Content Type and L…

## As a visually-impaired visitor to the site, I need to be able to access an audio-described version of videos put on pages through the Legacy Embedded WYSIWYG so that I can experience the content of…

andyvanavery31 updated 2 days ago
1
rany2/edge-tts #315

edge-playback opens the MPV GUI on Windows

I am using Windows 10 LTSC. When I execute the command `edge-playback --text "Hello, world!"`, the generated audio plays in the MPV window. 1. For MP3 files, the pop-up window seems unnecessary and…

ichat006 updated 7 hours ago
7
2noise/ChatTTS #814

How can i add sample voice and seed like in the webUI to thi…

I prefer to use a script and CLI to generate audio with ChatTTS rather than opening the webUI and want these features in my script: ![webui](https://github.com/user-attachments/assets/fe35822c-656a…

Atoli updated 2 weeks ago
5
SYSTRAN/faster-whisper #1119

After using VAD, the start and end times of the recognized s…

``` path = r"D:\Project\Python_Project\FasterWhisper\large-v3" model = WhisperModel(model_size_or_path=path, device="cuda", local_files_only=True) segments, info = model.transcribe("audio.wav",…

zlyMaster updated 2 weeks ago
3
DrewThomasson/ebook2audiobook #56

AssertionError: ❗ XTTS can only generate text with a maximu…

Just got the following error, seems to be hitting the limit in coqui TTS. ``` Chapter 50: 20%|████████████▌ | 1/5 [10:51

msameeh updated 3 days ago
4
langchain-ai/langchain #27717

Voice Input Support for Ollama Models

### Discussed in https://github.com/langchain-ai/langchain/discussions/27404 Originally posted by **kodychik** October 16, 2024 ### Checked - [X] I searched existing ideas and did not find …

efriis updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for text-to-audio

1000+ results
for text-to-audio