gligen / GLIGEN

Open-Set Grounded Text-to-Image Generation
MIT License
1.98k stars 148 forks source link

How to combine 2d box and canny edge to control the image generation together? #29

Open Estrellama opened 1 year ago

Estrellama commented 1 year ago

Thank you for the subsequent updates on more controllable methods, including edge, depth, etc. So fast.

But I have a question, when I want to combine 2d box and canny edge to control the image generation together, how to redesign the UNet network structure?

For example, roughly stacking two gated self attention layers, one for fusing 2d box embedding, and the other for fusing edge embedding? Any more experience recommendations?

I would like to get your answer!

1028686314 commented 11 months ago

hello, do you have solved this question, i meet the same, maybe we can communicate with this

2a3b4c commented 9 months ago

hello, did you solve this problem? @1028686314 @enhuima