Whispering Tiger UI is a Native-UI that can be used to control the Whispering Tiger application.
Whispering Tiger is a free and Open-Source tool that can listen/watch to any audio stream or in-game image on your machine and prints out the transcription or translation to a web browser using Websockets or over OSC (examples are Streaming-overlays or VRChat).
Download Latest Version from the Releases Page.
Video Tutorial "Whispering Tiger - Live Translation and Transcription":
After downloading the latest version from the [Releases], extract it to a folder of your choice on a drive with enough free space.
(Do not run it directly from the zip file, do not run from external drive.)
Create a Profile by entering a name and clicking on the New button.
Websocket IP + Port
can be kept at the default values "127.0.0.1" and "5000".
Select your Audio Input and Output devices. You can test them by speaking into your microphone and clicking on the Test button.
You should see the Audio Input bar move when you speak. and hear a test-audio and see the Audio Output bar move when you click on the Test button.
See also Audio configuration (TTS to Mic, Game Audio translation, etc.) for more information on specific Audio Setups.
(like when you want to translate Audio of Games, Videos or Streams that are played on your PC instead of using a Microphone as Input.).
(Optional) use Push to Talk Click into the field and press the keys you want to use for Push to Talk
(press each key separately to configure. When running the Profile, all keys will be required to be pressed at the same time when using Push to Talk)
Speech volume Level
and Speech pause detection
to 0.Keep an eye on the estimated Memory consumption in the lower right corner.
It is only a rough estimate and can vary, but it should give you an idea of how much (V-)RAM you need for your selected A.I. Models. and Options.
Select the A.I. Device for Speech-to-Text and Text Translation according to your Hardware.
Select the Speech-to-Text Size and Text Translation Size.
Select the Speech-to-Text Precision and Text Translation Precision
float16
.float32
, int16
or int8
precision.Note:
- You can play with the values until you get your desired results.
- If something does not work, check the Log under the Advanced tab. And check for any error.
- Enable Write log to file to save the log to a file.
*.py
file and place it in the Plugins folder.Note:
Most Plugins have specific settings that can be configured in the textboxes of the Plugin in the Plugins tab.See also Example Setup of Plugin VoiceVox (Japanese TTS) As example how to setup the VoiceVox Plugin.
For additional Help, you can join