I have noticed that the difference between the "offline" and "single stream" performance is a lot higher for RNN-T than for the other benchmarks.
For example in submission "1.1-100" from NVIDIA the ratio between the "offline" sample/s to the calculated "single stream" sample/s for all nets except RNNT is lower than 20, but for RNNT it is ca. 292.
Any help in clarifying this difference is appreciated.
Hi,
I have noticed that the difference between the "offline" and "single stream" performance is a lot higher for RNN-T than for the other benchmarks.
For example in submission "1.1-100" from NVIDIA the ratio between the "offline" sample/s to the calculated "single stream" sample/s for all nets except RNNT is lower than 20, but for RNNT it is ca. 292.
Any help in clarifying this difference is appreciated.
Thank you!