About training time - Githubissues

researchmm / TTVSR

[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution

MIT License

196 stars 12 forks source link

About training time #2

Closed wdmwhh closed 2 years ago

wdmwhh commented 2 years ago

How long does it take to train TTVSR using 8 gpus on V100? Around 30 days?

wdmwhh commented 2 years ago

I'm sorry to ask another question. What is the gpu memory usage when training TTVSR using 8 gpus on V100?

ChengxuLiu commented 2 years ago

In the training, we use the 32G V100. the gpu memory usage exceeds 20G, and the higher memory cost is needed in the testing stage (inference 100 frames). The training time is about half a month.

wdmwhh commented 2 years ago

Thanks for you reply. The runtime analysis may be problematic. When comparing Reconstruction Network of TTVSR, 2 shared 60 resblks and upsample, with BasicVSR, 2 non-shared 30 resblks and upsample, the runtime of Reconstruction Network of TTVSR should be longer than that of BasicVSR. However, according to the following table in Supplementary Material, the runtime of Reconstruction Network of TTVSR is less than half that of BasicVSR.

ChengxuLiu commented 2 years ago

Thank you for your careful reading. We spent some time exploring the reasons for this table. This is indeed an error in calculating the runtime. This should because we have missing the cuda sync command in the calculation. Very much appreciate your promptness in pointing out our mistakes. Sorry we have not found the related code to calculate the runtime before, so we recalculated the runtime of this table. The runtime of Reconstruction Network of TTVSR is 72ms. We will correct this mistake and update~~ Very sorry for the trouble~