Question about the inference time.

Ma-Weijian commented 1 year ago

Hello.

Fantastic work on pointcloud self-supervised learning.

However, I'm quite confused about the inference time during dvae pretraining. It seems that the inference time is far longer than the training time. (For 2x NVIDIA-A100 40GB, the training time for one epoch is ~3 minues while the inference time is ~10 minutes.) The nvidia-smi command shows that the GPU utilization is ~1% during inference, while the CPUs are running at full utilization.

When I dig deep into the code, I discovered that the inference batch size is 1. When I adjust the inference batch size to the same as the training batch size, the inference time is still far longer than training and the utilization of CPUs and GPUs barely change.

I just wonder whether changing the inference batch size would affect the performance. Also, why does inference take far longer than training?

Ma-Weijian commented 1 year ago

Anything helps. Thanks a lot!

yuxumin commented 1 year ago

hi, see https://github.com/lulutang0608/Point-BERT/issues/51. Sry for the late reply

Ma-Weijian commented 1 year ago

I see. Thanks for your reply.

I still have some questions about choosing dataparallel in training dvae. I commented in #51

Julie-tang00 / Point-BERT

Question about the inference time. #65