huggingface / diffusers

đŸ¤— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
https://huggingface.co/docs/diffusers
Apache License 2.0
25.73k stars 5.31k forks source link

Add PSLD upscaler/deblur/inpainting Pipelines to Stable Diffusion and other LDMs #4064

Closed AmericanPresidentJimmyCarter closed 1 year ago

AmericanPresidentJimmyCarter commented 1 year ago

Model/Pipeline/Scheduler description

We present the first framework to solve general inverse problems leveraging pre-trained latent diffusion models. Previously proposed algorithms (such as DPS and DDRM) only apply to pixel-space diffusion models. We theoretically analyze our algorithm showing provable sample recovery in a linear model setting. The algorithmic insight obtained from our analysis extends to more general settings often considered in practice. Experimentally, we outperform previously proposed posterior sampling algorithms in a wide variety of problems including random inpainting, block inpainting, denoising, deblurring, destriping, and super-resolution.

https://arxiv.org/pdf/2307.00619.pdf

PSLD is powerful and should enable inpainting for the new XL model.

Open source status

Provide useful links for the implementation

https://github.com/LituRout/PSLD

patrickvonplaten commented 1 year ago

Looks like a cool idea worth trying out!

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.