The visual display of the latency for stt and tts is very useful, and I would like to add another region on the playback UI that similarly shows the latency caused by the 3rd party application. This region would appear on the top audio strip, the same one showing STT. We would need a span created that starts when a gather verb resolves with a transcript (ie we send a verb:hook payload with the transcript) and ends when the next response is received from the webhook or websocket. That would let us continue to break down the overall latency into its components.
The visual display of the latency for stt and tts is very useful, and I would like to add another region on the playback UI that similarly shows the latency caused by the 3rd party application. This region would appear on the top audio strip, the same one showing STT. We would need a span created that starts when a
gather
verb resolves with a transcript (ie we send a verb:hook payload with the transcript) and ends when the next response is received from the webhook or websocket. That would let us continue to break down the overall latency into its components.