speech-generation Search Results

1000+ results
for speech-generation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenPecha/tts-model #1

TTS lighter and faster model ( MM24 )

### Description The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…

gangagyatso4364 updated 1 month ago
4
steventan0110/DiffNorm #2

Question about target units normalization

I wonder if DiffNorm is designed to normalize target speech units. Why is the src-feat required during the training of VAE and Diffusion in the provided script? I read the paper and didn't see any me…

mailong25 updated 3 months ago
1
OFA-Sys/AIR-Bench #3

Request for Complete Test Script for Qwen2-Audio on AIR Benc…

Hi, I'm currently trying to replicate the performance of Qwen2-Audio on the AIR Bench. However, I noticed that the repository at [AIR-Bench](https://github.com/OFA-Sys/AIR-Bench/blob/main/score_cha…

whwu95 updated 3 months ago
7
vercel/ai #885

Text to Speech utils?

### Feature Description Love to see how AI SDK can handle Text to Speech from OpenAI. As I see from documentation, TTS can be streamed. https://platform.openai.com/docs/guides/text-to-speech/strea…

nabilfatih updated 2 days ago
6
10up/classifai #571

TTS Audio: Long post with 10+ mins of TTS audio does not get…

### Describe the bug 1. The TTS Speech service seems to limit the audio files to a maximum length of 10 mins. This is regardless of a free or paid account - https://learn.microsoft.com/en-us/azure/ai…

joshuaabenazer updated 4 months ago
3
StartHua/Comfyui_CXH_joy_caption #89

Joy_caption_alpha_run 加载错误

像是缺失了文件 Unrecognized model in D:\LIUGEGE\ComfyUI\models\Joy_caption_alpha\text_model. Should have a `model_type` key in its config.json, or contain one of the following strings in its name: albert, a…

LiuGe126 updated 1 month ago
2
myshell-ai/MeloTTS #100

How to automatically generate subtitles?

Only can use speech recognition after generation?

maxin9966 updated 6 months ago
1
huggingface/transformers #33845

Whisper Scoring Model Saving Errors due to Config+Generation…

### System Info - `transformers` version: 4.45.1 - Platform: Linux-5.10.225-213.878.amzn2.x86_64-x86_64-with-glibc2.31 - Python version: 3.11.9 - Huggingface_hub version: 0.25.1 - Safetensors ver…

gcervantes8 updated 2 weeks ago
6
modelscope/modelscope-agent #418

examples\agents\modelscopegpt_agent.ipynb 运行报错，找不到工具NotImple…

### Initial Checks - [X] I have searched GitHub for a duplicate issue and I'm sure this is something new - [X] I have read and followed [the docs & demos](https://github.com/modelscope/modelscope-age…

Usigned updated 5 months ago
2
kadirnar/ComfyUI-Transformers #12

ROADMAP of ComfyUI-Transformers

## Computer Vision: - [x] Add Depth Estimation pipeline - [ ] Add Image Classification pipeline - [ ] Add Image Segmentation pipeline - [ ] Add Mask Generation pipeline - [ ] Add Object Detecti…

kadirnar updated 4 months ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for speech-generation

1000+ results
for speech-generation