silent-chen / layout-guidance

[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
240 stars 15 forks source link

When will the real image editing released #15

Closed RunpuWei closed 6 months ago

RunpuWei commented 1 year ago

Dear author.

Thank you for your awesome work, and congratulations on your work being accepted by WACV2024.

I wonder when the real image editing will be released, I am very interested in your work and hope to replicate it.

Thank you!

silent-chen commented 1 year ago

Hi, thanks for your interest in our work. I plan to release the real image editing part within this month. Hope it can inspire you somehow.

silent-chen commented 10 months ago

Sorry for the late update. I was kind of busy last month. I will try to clean up the code and release it soon.

RunpuWei commented 10 months ago

Thank you for your hard work. Looking forward to it!

Sjey-Lyn commented 8 months ago

Has real image editing been released yet?

silent-chen commented 8 months ago

Hi all, sorry for the late release. I just cleaned the real image editing code and pushed it to the repo. I also updated the dependent library because the CLIPFeatureExtractor is now deprecated in diffusers now. Please update the corresponding libs. Let me know if you have further questions.

Sjey-Lyn commented 8 months ago

I'm using a 4090 and it's showing that I don't have enough cuda memory, what is the minimum amount of memory needed?

silent-chen commented 8 months ago

It will cost ~28GB memory for text inversion and ~31GB for Dreambooth. To reduce the GPU memory usage in text inversion, you can try to reduce the batch size for text inversion by setting batch_size in conf/real_image_editing_config.yaml. To get similar results as using a batch size of 4, please use accumulated gradient and increase the iteration numbers accordinately. For dreambooth, it requires at least 30GB. You can probably reduce the memory usage by using xformers.