karashiiro / TextToTalk

Chat TTS plugin for Dalamud. Has support for triggers/exclusions, several TTS providers, and more!
MIT License
45 stars 30 forks source link

Issue with Playback Rate on Azure Backend #167

Closed MartinVindis closed 12 months ago

MartinVindis commented 1 year ago

It appears that the playback rate is inexplicably set to x10 which makes even 20% way too fast. I have managed to find a temporary solution by manually adjusting the playback rate to 10% through the configuration file.

karashiiro commented 1 year ago

This works fine on my end, can you clarify what exactly the playback rate was before you adjusted it manually?

For reference, this is my test preset, and it plays back at a normal speaking rate:

    {
      "Volume": 1.0,
      "PlaybackRate": 20,
      "VoiceName": "Microsoft Server Speech Text to Speech Voice (en-US, EricNeural)",
      "Id": 2,
      "Name": "Eric",
      "ObsoleteRate": 0,
      "ObsoleteVolume": 0,
      "ObsoleteVoiceName": null,
      "EnabledBackend": 4
    }
MartinVindis commented 1 year ago

I had it at 20 but I thought it was too fast so I lowered it to 10 by manually edit the json file as the slider doesn't go lower than 20%. I mean it should sound normal 100, you agree that 100 is too fast right?

karashiiro commented 1 year ago

It turns out that I completely misunderstood how Azure uses the playback rate. While all of the other providers use playback rate as an absolute value (100% means no change), Azure uses it as a change from the original (100% means +100%, or 2x; 20% means +20%, or 1.2x). I'll fix this today.

karashiiro commented 12 months ago

Fixed in testing v1.26.3 (3 days ago)