rhasspy / larynx

End to end text to speech system using gruut and onnx
MIT License
824 stars 49 forks source link

Real-time factor: calculation #11

Closed svenha closed 3 years ago

svenha commented 3 years ago

I know the metric real time factor (RTF) from STT (or ASR) systems. A RTF of 0.5 would mean than 1 sec is recognized in 0.5 sec.

I would expect a similar logic for TTS systems. But the numbers reported in larynx' debug output as Real-time factor seem to be 1/RTF. This is confusing, isn't it?

synesthesiam commented 3 years ago

Thanks, I had forgotten to flip that. Will be updated in the next release.