Open verigle opened 1 month ago
Not sure what you are trying to say, what do you expect?
I expect the tts don't read the markdown block of image url.
example for llm output:
this is a new picture:
![picture1](http://www.test.com/test.jpg)
for tts, it only required to read the word of this is a new picture
,
however, the current tts will also read the url of [picture1](http://www.test.com/test.jpg)
, which is not useful audio for user
is there any plan to filter the markdown image block for TTS input?
Hi, @verigle. I'm Dosu, and I'm helping the Dify team manage their backlog. I'm marking this issue as stale.
Issue Summary
Next Steps
Thank you for your understanding and contribution!
Self Checks
1. Is this request related to a challenge you're experiencing? Tell me about your story.
for tts, expect remove TTS input text of markdown image url .
2. Additional context or comments
llm output example
it is not need to use TTS to audio for image markdown block (
![picture1](http://www.test.com/test.jpg)
), is there has any way to remove image url for TTS input3. Can you help us with this feature?