FunAudioLLM CosyVoice issues

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

https://funaudiollm.github.io/

Apache License 2.0

6.47k stars 698 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

微调模型时train和eval的loss和梯度有时是nan

#677 ScottishFold007 opened 17 hours ago
0
请问一下这个shuffle_size参数的取值逻辑是什么，修改后会影响效果吗

#676 shenlou11 opened 20 hours ago
1
【VC模式】VC模式下的流式推理，已经是分段返回，但合成音频时仍然只有最后一段

#675 wang-TJ-20 closed 16 hours ago
2
怪异的表现

#674 jacksonjack001 opened 1 day ago
1
Question

#673 dillfrescott closed 1 day ago
2
是否有支持mps的计划呀，mac上用cpu跑挺亏的

#672 ondineyuga opened 3 days ago
1
Using an Instruct model for inference without embeddings, how can the speaker be specified?

#671 0xCAFEBABE0 opened 3 days ago
4
fix bug

#670 aluminumbox closed 3 days ago
0
[Question]How to support batch inference?

#669 wjj19950828 opened 4 days ago
2
请教一下TransformerEncoderLayer和ConformerEncoderLayer类中的forward函数的一些问题，这个函数中的pos_emb的维度是怎么确定的

#668 goingHan opened 4 days ago
0
0样本声音克隆,部署后台进程,显存持续增加未自动释放

#667 abo123456789 opened 4 days ago
1
cosyvoice在克隆语音时出现问题了， Error: The size of tensor a (5002) must match the size of tensor b (2) at non-singleton dimension 3

#666 YuChaoM opened 4 days ago
3
有支持8k采样率的模型吗？

#665 Dollhan opened 4 days ago
1
Two times of fade-in-out

#664 OswaldoBornemann opened 4 days ago
1
念古诗的时候断句不是根据标点而是根据词意

#663 interfacekun opened 5 days ago
1
same loss when running two experiments simultaneously under examples/libritts/cosyvoice

#662 dbkest opened 5 days ago
1
Unable to play generated audio in Safari

#661 klx1204 opened 6 days ago
2
关于300M-instruct模型的问题

#660 yyliuCecilia opened 6 days ago
1
About "base continue train".

#659 sunnnnnnnny opened 6 days ago
2
webui是否支持多路并发？开两个页面同时流式合成报错

#658 yangpeng-space opened 6 days ago
1
WER计算

#657 liuxin99 opened 1 week ago
1
无法打开网页，会报错，好像和activations.py有关

#656 ztxxkaty opened 1 week ago
2
【instruct模式性别变化】instruct模式下删除音色的embedding的原因

#655 wang-TJ-20 opened 1 week ago
1
流式模式下语音更完美的朗读

#654 lucasjinreal opened 1 week ago
5
resume training

#653 aluminumbox closed 1 week ago
0
从头训模型总是loss突然变成nan

#652 lilinqin opened 1 week ago
1
如果想添加一些方言克隆，是需要重新训练模型吗？

#651 hjj-lmx opened 1 week ago
3
25HZ的效果比50的差不少，会多字

#650 hjj-lmx opened 1 week ago
1
请问如何提升复制音色的相似度

#649 linweijiang closed 1 week ago
1
Is the SenseVoice-Large model currently commercialized?

#648 JinYuanZhang999 opened 1 week ago
1
No module named 'conformer'

#647 yyliuCecilia opened 1 week ago
1
发现预定义音色在生成长音频时声音粗细、音色不稳定，而使用3s复制生成、不稳定问题会减轻。请问如何自定义音色，例如将3s复制生成的音色保存下来供使用

#646 yangpeng-space opened 1 week ago
3
单3090TI训练爆内存

#645 ScottishFold007 opened 1 week ago
1
开头语音重复，末尾丢失

#644 ww18735135443 opened 1 week ago
1
Exporting the operator 'aten::scaled_dot_product_attention' to ONNX opset version 18 is not supported.

#643 donstang opened 1 week ago
1
flow模型SFT训练不收敛

#642 JohnHerry opened 1 week ago
3
“闵行（hang）区”读成 “闵行（xing）”，这种多音字读错的问题有朋友遇到过吗？如何解决？

#641 timadidas opened 1 week ago
1
关于 Speaker Interpolation 如何实现

#640 LongMarch7 opened 1 week ago
4
use stream read to save memory

#639 aluminumbox closed 2 weeks ago
0
Streaming inference

#638 OswaldoBornemann opened 2 weeks ago
1
最新版本文本正则丢字，官方例子下

#637 wantt closed 1 week ago
1
data_pipeline中sort是必要的处理步骤吗

#636 shenlou11 opened 2 weeks ago
1
Update Dockerfile with CUDA 11.8

#635 swang109 opened 2 weeks ago
1
Ubuntu 运行安装CosyVoice-ttsfrd报错'CosyVoiceFrontEnd' object has no attribute 'zh_tn_model' or 'en_tn_model'

#634 junge1010 opened 2 weeks ago
1
Flow 模型finetune 会逐渐崩掉?

#633 dyyoungg closed 2 weeks ago
4
load_onnx=True比load_onnx=False 情况下慢

#632 goingHan opened 2 weeks ago
1
流式合成有电音输出，如何解决

#631 doudou0601 opened 2 weeks ago
5
Why do length regulator in flow split token into three parts (head/mid/tail) ?

#630 zhangyike opened 2 weeks ago
2
300M基础模型的断句错误问题

#629 czydfj opened 2 weeks ago
2
flow中的attn_mask数据类型有问题

#628 NiHaoUCAS opened 2 weeks ago
4