NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference
BSD 3-Clause "New" or "Revised" License
5.06k stars 1.38k forks source link

Throughput on V100 32GB #566

Open Haoyanlong opened 2 years ago

Haoyanlong commented 2 years ago

I have trained the tacotron2 in V100S(32GB), batch_size=114 for filling memory, but can't achieve the 17604(https://lambdalabs.com/gpu-benchmarks); N5DGhcUbk9

image

I calculate the throughput roughly by: 114*159 / 3.08 = 5885.065, could you give me some advices? Thank you very much!