Open kakaluote opened 4 years ago
i add more prints, it shows: net forward use 50ms calc loss use 60ms backward use 50ms optimizer use 200ms !!!!
It takes ~6 days on a GTX 1080ti, but a 1080ti should not be 3x as fast as a 1070. What batch size are you using? Maybe it's not fitting into your GPU and it's paging to memory every iteration or something.
my environment is: ubuntu 18.04, cpu i5-7500, GTX1070, cuda10.0, pytorch1.3.1
i've added some code to print net forward time, the batch size is 2, net forward time is 350ms. the forward time is around 50ms when eval model, why trainning forward is so slow