Open varshak97 opened 2 days ago
Hi, I'm attaching a drive link to a Google Doc with additional samples (training and inference) just for further clarification and review regarding this issue.
Link: https://docs.google.com/document/d/17VbNJ9LZJPoskOSP2kckq5vs9L7E2h6X-Yb-iZVUzFw/edit?usp=sharing
Hello! I want to begin by expressing my appreciation for this work; particularly the straightforward mask-ablated training strategy. Thank you to you and your team for sharing the code!
I am facing an issue when training the segmentation-guided DDPM/DDIM model with cardiac MRI data (using the public ACDC dataset). I have observed that the generated samples are not consistent with their corresponding segmentation masks, during training/inference.
Details: Inconsistent Results: The reconstructed/generated samples do not align with the input segmentation masks, i.e. the spatial features outlined by the masks are not correctly reflected in the generated samples. Noisy Samples: There is a noticeable presence of noisy samples, although increasing the number of epochs has somewhat resolved this issue. After training for 1000 epochs, there are still occasional noisy outputs, but my primary concern is the inconsistency between the generated samples and segmentation masks.
I have double-checked my data preparation step, and I don't believe the issue lies there. This is command I used for training the segmentation-guided diffusion model:
Following are sample outputs from epoch 999 for more context. Also, this issue persists throughout the entire training process, not just at the beginning or the end. I would appreciate any insights or suggestions on how to resolve this issue! Thank you so much for your time :)
Results/samples of ddpm_cardiac-mri_256_segguided_abalated @ Epoch 999:
0999_cond_seg_all.png
0999_orig.png
0999.png