art-from-the-machine / Mantella

Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation), and xVASynth / XTTS (text-to-speech).
https://art-from-the-machine.github.io/Mantella/
GNU General Public License v3.0
164 stars 41 forks source link

Support for subtitles only option (no TTS) #266

Open art-from-the-machine opened 3 months ago

art-from-the-machine commented 3 months ago

In its current state Mantella will only launch if it is able to connect to either xVASynth or XTTS. An option should be provided in MantellaSoftware/config.ini to connect to neither services and use no NPC voice output. On a high level, the implementation might look like this:

  1. Add a "None" option to config.ini's tts_service setting
  2. If tts_service is None, generate a silent audio file of X seconds, where X is approximated based on the length of the line of dialogue
  3. If necessary, either stop FaceFXWrapper from crashing Mantella.exe when it fails to generate a lip file on silent audio, or generate a generic lip file which will run for the length of the delivered line
DaymareOn commented 2 months ago

As a user, I'd like this feature, because I sometimes play without sound (because I use sound for another task). Moreover, I read a lot quicker than NPC speaks, so it would allow to fasten the dialogue pace.