Request for details on Creating Low Resolution Samples

saiboxx / chexray-diffusion

Code for "Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis" @ PAKDD 2023

MIT License

43 stars 7 forks source link

Hi,

Thanks for your interest in our work.

For training the SR module we follow the described procedure in the original S3 paper (Saharia et al.). To produce low resolution samples we simply use the standard Resize function in pytorch (See here). I also recommend to have a look at our training script.

We found that in our cascaded pipeline the upscaling of samples produced by the VAE decoder does not provide the level of desired quality with the above described method and script. So we conducted an additional finetuning step of the SR module. We used the LDM VAE to encode + decode the training dataset to get low resolution images that match the output distribution of the decoder, i.e: "real" image 1024px --> Resize 256 px --> encode --> decode --> Decoded "real" 256px image as conditioning for the SR module.

Does this help you?

Cheers, Tobias

saiboxx / chexray-diffusion

Request for details on Creating Low Resolution Samples #8