MMqd / kandinsky-for-automatic1111

Automatic1111 extension adding support for Kandinsky 2.X
GNU Affero General Public License v3.0
100 stars 8 forks source link

Kandinsky For Automatic1111 Extension

Adds a script that run Kandinsky 2.X models (2.1 and 2.2). Kandinsky 2.2 can generate larger images, but it is much slower to use with VRAM optimizations.

!!Note!! Progress bar not supported, view terminal progress bar instead.

Troubleshooting

Examples

The following are non cherry-picked examples, with various settings and resolutions.

center image

Prompt: sky, daylight, realistic, high quality, in focus, 16k, HQ
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 7
Prior CFG Scale: 7
Seed: 3479955
Size: 1024x1024
Inference Steps: 128

center image

Prompt: As the sun sets, les arbres whisper, mientras el río serpentea gracefully, отражая прекрасные colors, majestic mountains stand tall, evoking tranquillité et harmonie, 空中舞动着美丽的蝴蝶, 空と地球の神秘なつながり, रंगबिरंगी वस्तुएं। (from chatgpt)
In English: As the sun sets, the trees whisper, while the river gracefully meanders, reflecting beautiful colors, majestic mountains stand tall, evoking tranquility and harmony, butterflies dance in the air, the mysterious connection between sky and earth, colorful objects.
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 7
Prior CFG Scale: 7
Seed: 3479955
Size: 768x768
Inference Steps: 128

center image

Prompt: cat, realistic, high quality, 4k
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 7
Prior CFG Scale: 7
Seed: 3479955
Size: 1024x1024
Inference Steps: 128

center image

Prompt: spaceship, retro, realistic, high quality, 4k
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 7
Prior CFG Scale: 7
Seed: 3479955
Size: 512x512
Inference Steps: 128

center image

Prompt: cyberpunk city, distopian, high quality, 4k
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 3
Prior CFG Scale: 3
Seed: 3479955
Size: 768x768
Inference Steps: 128

Image Mixing

Combine images and/or prompts together. Can be used for style transfer, and combining a background with a subject.

Prompt: cat, high quality, 4k
Model: Kandinsky 2.1
Steps: 64
Sampler: Default
CFG Scale: 7
Prior CFG Scale: 7
Seed: 3479955494
Size: 1536x768
Inference Steps: 128

Mixed with:

center image

Result:

center image

How To Use

  1. Select "Kandinsky" in the scripts section
  2. Set "Prior Inference Steps". Increasing the value improves the results, but it reaches a plateau at around 128. Beyond that, the image may change, but the quality remains consistent.
  3. The model will start downloading automatically, if needed.

Image Mixing

Prompt + Image

  1. In text2img set the prompt
  2. In the extra image field in the script section, set the image
  3. Set the "Interpolate Image 1 Strength" to the desired amount of the image generated by the prompt
  4. Set the "Interpolate Image 2 Strength" to the desired amount of the image in the script section

Image + Image

  1. In img2img set an image
  2. In the extra image field in the script section, set the image
  3. Set the "Interpolate Image 1 Strength" to the desired amount of the image generated by the prompt
  4. Set the "Interpolate Image 2 Strength" to the desired amount of the image in the script section

Notes

Features

Supported Settings

Any other settings such as seed variations, will have no effect on generated images.

Known Bugs

Limitations