Open srymaker opened 2 months ago
No, you only need a pair <reference image, target prompt>.
Thank you for your answer. So what are the inputs and target during the training?
There are three kinds of training pairs:
Thank you,but in the paper,the qforme’s input should have the text {content} or {style},what is it
The text input of Q-former is the word "content" or "style".
Oh,i see,thanks for your patience
Hi @Tianhao-Qi , does the current released code support this "Stylized Reference Object Generation" function? Basically I want to convert my given image to a different style by providing the text only, the given image is the source image rather than the style image.
You can refer to this script. Besides, if you want to keep the structure of the source image as well, you'll need to use the controlnet.
Thanks for your great work! I want to know, when I want to do style transfer task, do I need to input a reference picture, a style word corresponding to this reference picture and a target prompt to the model? Just like This triplet <reference img,reference style word,target prompt>