IceClear / StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
https://iceclear.github.io/projects/stablesr/
Other
2.2k stars 143 forks source link

train dataset #24

Closed kia350 closed 1 year ago

kia350 commented 1 year ago

hi can you share the training dataset ? DIV8K dataset is hart to download . thanks

IceClear commented 1 year ago

Sry. We do not have the copyright to distribute DIV8K. You have to refer to the official website.

kia350 commented 1 year ago

ok can you share more about the detail of making dataset how many time repeat when use ori photo to make LR ?

IceClear commented 1 year ago

You may refer to the config files for dataset details.

kangxi1818 commented 1 year ago

thank you for your interesting work, I have a question about the dataset I now have the inputs and the gts, when I geting the latent and samples, how to set the parameters --init-img of sr_val_ddpm_text_T_vqganfin_old.py to get the latent and samples, and the shape of init_latent is[1,4,64,64] image how to save it as .npy image and how to save the samples as .png

IceClear commented 1 year ago

thank you for your interesting work, I have a question about the dataset I now have the inputs and the gts, when I geting the latent and samples, how to set the parameters --init-img of sr_val_ddpm_text_T_vqganfin_old.py to get the latent and samples, and the shape of init_latent is[1,4,64,64] image how to save it as .npy image and how to save the samples as .png

The latents can be obtained from here, and the samples can be obtained from here. For the gts, they are 512x512 patches and inputs are obtained following the RealESRGAN degradation pipeline. Use np.save and cv2.imwrite.

kangxi1818 commented 1 year ago

Thank you for your reply, I have another question, when I run the sr_val_ddpm_text_T_vqganfin_old.py to get samples and latents, how to set the --init-img python scripts/sr_val_ddpm_text_T_vqganfin_old.py --config configs/stableSRNew/v2-finetune_text_T_512.yaml --ckpt CKPT_PATH --vqgan_ckpt VQGANCKPT_PATH --init-img INPUT_PATH --outdir OUT_DIR --ddpm_steps 200 --dec_w 0.5 --colorfix_type adain I mean the INPUT_PATH is it the path of inputs in my dataset image Thank you very much!!

IceClear commented 1 year ago

It is just the name of your test folder. You may refer to the code for details. It is not a secret.

kia350 commented 1 year ago

You may refer to the config files for dataset details. config files is lack of the procedure of how to use real_esrgan make dataset. and the test_example/OST_120.png was also made by real_esrgan?

when i used real_esrgan to get the lr-hr pair, lr was very hard to distinguish : 0801_s002

IceClear commented 1 year ago

We just followed RealESRGAN to generate data. You may adjust the pipeline following your settings.

kia350 commented 1 year ago

We just followed RealESRGAN to generate data. You may adjust the pipeline following your settings.

Thanks