Open calvinmccarter-at-lightmatter opened 2 years ago
The previous implementation of the RNN-T encoder StackTime module is slow and memory-inefficient. I have made a PR with this same fix to the MLCommons-Inference repo: https://github.com/mlcommons/inference/pull/1015
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅
@mwawrzos could you please review?
The previous implementation of the RNN-T encoder StackTime module is slow and memory-inefficient. I have made a PR with this same fix to the MLCommons-Inference repo: https://github.com/mlcommons/inference/pull/1015