langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
https://dify.ai
Other
52.6k stars 7.69k forks source link

The last few words of the TTS-generated speech end abruptly #10954

Open dadastory opened 4 days ago

dadastory commented 4 days ago

Self Checks

Dify version

0.11.2

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

When I use the CosyVoice-300M-SFT model deployed by Xinference, it seems that I cannot adjust the speaking speed or modify anything. However, the generated speech will suddenly end at the last few words or periods. It cannot naturally finish the last word or word, which seems very abrupt.

https://github.com/user-attachments/assets/683cf75c-fa41-48b1-be8e-c7ff98111883

✔️ Expected Behavior

The generated speech should naturally finish a sentence, rather than ending abruptly at the last word or character.

❌ Actual Behavior

No response