Saliency-adaptive noise fusion for PAG

v0xie / sd-webui-incantations

Enhance Stable Diffusion image quality, prompt following, and more through multiple implementations of novel algorithms for Automatic1111 WebUI.

GNU General Public License v3.0

120 stars 7 forks source link

Saliency-adaptive noise fusion for PAG #45

Closed v0xie closed 1 month ago

v0xie commented 1 month ago

Adds a new method of combining the guidance from PAG and CFG.

Derives from "High-fidelity Person-centric Subject-to-Image Synthesis": https://arxiv.org/abs/2311.10329

In the paper they are combining the guidance from two different models, so I thought we could apply that to PAG since it's doing pretty much the same thing.

A couple of examples:

High CFG scales: xyz_grid-0012-1 xyz_grid-0014-1

Greater than 512px for SD1.5: xyz_grid-0002-1

Normal CFG scale, high PAG scale: xyz_grid-0000-1