Extracting spk embedding is too slow

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

https://funaudiollm.github.io/

Apache License 2.0

4.66k stars 471 forks source link

Open WangGewu opened 1 month ago

WangGewu commented 1 month ago

可以提供campplus的pt文件和batch推理代码吗？目前使用campplus.onnx提取spk embedding，速度太慢。

aluminumbox commented 1 month ago

well we may consider it later as many people have raised this problem

WoBuChiTang commented 1 month ago

well we may consider it later as many people have raised this problem

是的确实有点慢，希望至少提速到和speech2token差不多的速度

Wentao795 commented 1 month ago

+1，

huskyachao commented 1 month ago

+1 :）

github-actions[bot] commented 4 days ago

This issue is stale because it has been open for 30 days with no activity.