rhasspy / piper

A fast, local neural text to speech system
https://rhasspy.github.io/piper-samples/
MIT License
6.71k stars 492 forks source link

Output the audio size in bytes when logging #588

Open kovidgoyal opened 2 months ago

kovidgoyal commented 2 months ago

Currently, piper logs something like

Real-time factor: 0.24298105233065118 (infer=2.355538764 sec, audio=9.694331065759638 sec)

In addition to this I would like it to log

bytes=XYZ

this eases integration with other software that is sending multiple sentences and reading the output data while keep track of where one sentence ends and the next begins.

Thanks.