issues
search
lenML
/
Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
https://huggingface.co/spaces/lenML/ChatTTS-Forge
GNU Affero General Public License v3.0
710
stars
87
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[BUG:WebUI]
#168
kuhubmk
opened
5 hours ago
0
WebUI: improve SSML tools
#167
zhzLuke96
opened
1 day ago
0
增加部分文字中文翻译
#166
andywu188
closed
12 hours ago
1
Support FireRedTTS
#165
zhzLuke96
closed
1 day ago
1
[Feature] 界面上增加一个生成音频格式选择功能
#163
andywu188
opened
2 days ago
1
[ISSUE] 语音复刻出现问题,提示:Could not initialize NNPACK! Reason: Unsupported hardware
#162
No-22-Github
opened
1 week ago
2
[ISSUE] the docker config is broken
#161
highkay
closed
1 week ago
1
[BUG:FT] 选择说话风格后,文本会生成不完全,会少最后几句。
#160
wangqun888
opened
1 week ago
3
[BUG:WebUI] 上传参考音频输出的音色不匹配并且全是杂音,前摇也很长
#158
nulinuli
closed
1 week ago
3
[ISSUE] 无法使用GPU来运行
#157
kyrosin
closed
1 week ago
4
[BUG:WebUI] 点击Web页面上的“生成单频”按钮后报错
#154
haijd
opened
4 weeks ago
6
[Bug] chattts使用随机种子生成音色报错
#153
zydmtaichi
closed
4 weeks ago
6
[BUG] SSML duration/prosody 生成结果不对
#152
zhzLuke96
opened
1 month ago
0
[ISSUE] cuda11.6下部署该项目发生循环扫描torch版本,无法安装成功
#150
zydmtaichi
closed
15 hours ago
1
[BUG:API] 文本转语音无报错,但非常的慢,超过120S
#148
Ccccx
closed
1 month ago
1
[BUG:API] 上传Speaker接口,使用tensor来创建音色并且传入了name、gender、describe但是没有写入spkv1.json文件中
#147
coutlinx
opened
1 month ago
1
[ISSUE] 人声增强与背景降噪
#146
cpken
closed
1 month ago
0
[ISSUE] 如何应用韵律apply_prosody
#145
cpken
closed
1 month ago
0
[BUG:FT] cosyvoice 生成音频报错
#144
cpken
closed
1 month ago
1
[BUG:API] 部署到HF Space 调用接口报错 RuntimeError: CUDA must not be initialized in the main process on Spaces with Stateless GPU environment.
#143
steveoon
opened
1 month ago
5
[BUG:FT] AttributeError: 'Chat' object has no attribute 'gpt'
#142
cpken
closed
1 month ago
1
[assistance] Confirmation on Data Format and Structure for Fine-Tuning
#141
IrisSally
opened
1 month ago
2
[Feature] Support customized models path
#140
kerol123
opened
1 month ago
2
[BUG:WebUI] ASR 操作报错 Error: libcudnn_ops_infer.so.8
#139
cpken
closed
1 month ago
2
[BUG:API] stream流式api特别慢
#138
xiaozhu1106
closed
1 month ago
2
[ISSUE] ImportError: cannot import name 'AudioNormalizer' from partially initialized module 'modules.core.pipeline.factory' (most likely due to a circular import) (/data/chattts-forge/modules/core/pipeline/factory.py)
#137
shawnwu2022
closed
1 month ago
1
replace chattts modelscope_repo with AI-ModelScope
#136
fangd123
closed
1 month ago
1
Support FishSpeech SFT version
#135
zhzLuke96
closed
1 month ago
2
[BUG:API] /v1/tts 使用--compile后加载fishspeech卡死
#134
tuxiaoseng
opened
2 months ago
1
[BUG:WebUI] 生成的spk和参考音频不符
#133
tuxiaoseng
closed
1 month ago
5
[BUG:API] /v1/ssml
#132
cpken
closed
2 months ago
0
Improve WebUI 0.8
#131
zhzLuke96
closed
1 month ago
1
[Feature] Examples of Cloning Audio and Text with One-Shot TTS and Streaming Output
#130
IrisSally
closed
1 month ago
4
window 使用错误[ISSUE]
#129
wtjwrold
closed
1 month ago
1
[BUG:WebUI]
#127
2open1024
closed
2 months ago
1
[BUG:WebUI] Error: AttributeError: '_io.BufferedRandom' object has no attribute 'endswith'
#126
2open1024
closed
1 month ago
1
[ISSUE] API请求无法并行处理多个,服务器端总是一个处理完才处理下一个
#124
zsy-code
opened
2 months ago
4
ASR WebUI
#123
zhzLuke96
closed
1 month ago
1
support SenseVoice
#122
zhzLuke96
opened
2 months ago
0
ChatTTS emb finetune with `DVAE_full.pt`
#121
zhzLuke96
opened
2 months ago
7
[BUG:API] /prompt/refine接口异常
#120
tuxiaoseng
closed
2 months ago
1
[ISSUE] 在M芯片的MacBook上报错:RuntimeError: Placeholder storage has not been allocated on MPS device!
#119
atfa
opened
2 months ago
1
[ISSUE] 优化文本产生乱码
#117
ahkimkoo
closed
2 months ago
2
[BUG:API] /prompt/refine接口异常
#116
tuxiaoseng
closed
2 months ago
1
fix: topK and topP not valid
#115
wenyangchou
closed
2 months ago
1
[BUG:API] 使用/v1/speaker/create创建了一个spaker,调用/v1/audio/speech接口时报错
#114
DYHouse
closed
2 months ago
1
[Plan] support ChatTTS zero shot infer
#113
zhzLuke96
closed
2 months ago
3
[Plan] Add vLLM Wrapper
#112
zhzLuke96
opened
2 months ago
3
Support for One-Shot Voice Cloning and vLLM Integration in ChatTTS-Forge
#111
IrisSally
closed
1 month ago
5
[ISSUE] 关于开启--compile之后,webui输出窗口乱码以及api后台卡主
#110
kedgelee
closed
2 months ago
1
Next