First of all, this is a really fun extension. Thank you for making this. I see some issues with how the prompts are created though.
Just looking at the structure of the final prompt coming from the LLM it looks like "Use common prompt" should be checked inside RP. Because I seem to consistently get 1 too many prompt regions compared to RP regions, and the first one is usually just the original prompt that I put into the LLM. Furthermore, it often gets the regions in the wrong order, which might be improved by giving more examples to the LLM.
For instance, this is my input prompt. I deliberately describe things in the wrong order:
"A photo of inside of a rustic cabin, the afternoon sun shines through a window on the left, a cat sits on the carpet below the window, to the right there is a cozy fireplace and a portrait of a lady above it"
This is the RP layout. Simple 4 quadrants, as expected:
"1,1,1; 1,1,1"
This is the output prompt:
"A photo of inside of a rustic cabin, the afternoon sun shines through a window on the left, a cat sits on the carpet below the window, to the right there is a cozy fireplace and a portrait of a lady above it BREAK
Afternoon sun piercing through the rustic casement window, casting a warm, golden glow across the aged wooden walls. BREAK
A fluffy cat, its fur softly reflecting the sunlight, seated serenely on a well-worn, braided carpet below the window. BREAK
A cozy fireplace with logs glowing, framed by a rugged stone mantle, exuding warmth and comfort. BREAK
An antique portrait of a lady with delicate features, framed above the fireplace, adding a touch of historical elegance."
First, I need to manually turn on "Use common prompt" because the first region is just the original prompt. Then I need to reorder them so that it is: common, window, portrait, cat, fireplace. After I do that, I get a nice resulting image that matches the original prompt.
First of all, this is a really fun extension. Thank you for making this. I see some issues with how the prompts are created though.
Just looking at the structure of the final prompt coming from the LLM it looks like "Use common prompt" should be checked inside RP. Because I seem to consistently get 1 too many prompt regions compared to RP regions, and the first one is usually just the original prompt that I put into the LLM. Furthermore, it often gets the regions in the wrong order, which might be improved by giving more examples to the LLM.
For instance, this is my input prompt. I deliberately describe things in the wrong order: "A photo of inside of a rustic cabin, the afternoon sun shines through a window on the left, a cat sits on the carpet below the window, to the right there is a cozy fireplace and a portrait of a lady above it"
This is the RP layout. Simple 4 quadrants, as expected: "1,1,1; 1,1,1"
This is the output prompt: "A photo of inside of a rustic cabin, the afternoon sun shines through a window on the left, a cat sits on the carpet below the window, to the right there is a cozy fireplace and a portrait of a lady above it BREAK Afternoon sun piercing through the rustic casement window, casting a warm, golden glow across the aged wooden walls. BREAK A fluffy cat, its fur softly reflecting the sunlight, seated serenely on a well-worn, braided carpet below the window. BREAK A cozy fireplace with logs glowing, framed by a rugged stone mantle, exuding warmth and comfort. BREAK An antique portrait of a lady with delicate features, framed above the fireplace, adding a touch of historical elegance."
First, I need to manually turn on "Use common prompt" because the first region is just the original prompt. Then I need to reorder them so that it is: common, window, portrait, cat, fireplace. After I do that, I get a nice resulting image that matches the original prompt.