llamaspeak 2 speech rate parameters are in the wrong format

dusty-nv / jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

MIT License

2.18k stars 446 forks source link

I've already put this in a comment on another thread here, but I'm adding as a new issue so it shows up in searches.

The Riva TTS agent is failing because the voice rate parameter has the wrong format. It's being sent as a float, but it needs to be a string, one of "default" and a couple other options described here.

I made two changes to the code to get it working. First, in local_llm/agents/web_chat.py, line 42:

-                self.tts.rate = float(msg['tts_rate'])
+                self.tts.rate = f"{float(msg['tts_rate']):.0%}"

...and in plugins/audio/riva_tts.py, line 43:

-        self.rate = voice_rate
+        self.rate = f'{voice_rate:.0%}'

I'm running the Riva server on a separate PC, and using the —riva-server command line option to talk to it, since Riva has not yet been ported to JP 6.

dusty-nv / jetson-containers

llamaspeak 2 speech rate parameters are in the wrong format #445