ge-xing / Diff-UNet

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation. (using diffusion for 3D medical image segmentation)
Apache License 2.0
141 stars 20 forks source link

Stuck in an epoch #23

Open XFivezzz opened 11 months ago

XFivezzz commented 11 months ago

When I apply this model to the verse2020 dataset, I get stuck at the ninth epoch every time (it will directly terminate the prompt RuntimeError: DataLoader worker (pid 9063) is killed by signal: killed) When I change the higher performance GPU and CPU, adjust the learning rate and batch, etc., I still get stuck at the ninth epoch, showing that it takes ten hours 42fe82b1178e5512182987a1c8228e6

920232796 commented 11 months ago

Your validation data is so big, you can only validate only a section of all validation data. And you also can modify the DDIM sample step from 10 to 2, which can also improve the speed of inference.