GPU out of memory at trainer.text() stage but has no issue while trainer.fit() stage

bennyguo / instant-nsr-pl

Neural Surface reconstruction based on Instant-NGP. Efficient and customizable boilerplate for your research projects. Train NeuS in 10min!

MIT License

856 stars 84 forks source link

Hi there,

Could you please kindly give me some hints about how to solve the issue below?

When I was running nsr_pl, even though my train image set and test image set have both 73 images, the testing stage will have out of GPU memory issue. With ddp strategy in training stage, I can see the GPU memory evenly spread to 4 GPUs. But in the testing stage, all the memories will flow to one GPU only and stop with GPU out-of-memory error. The trainer is the same, why is this happening? Any potential things I can do to fix this issue? Thanks in advance

bennyguo / instant-nsr-pl

GPU out of memory at trainer.text() stage but has no issue while trainer.fit() stage #61