Closed RahulSinghalChicago closed 9 months ago
all of the splits are 1?
The notebook worked for split humans, so do I need a template example that goes with that one?
all of the splits are 1?
The notebook worked for split humans, so do I need a template example that goes with that one?
Thanks for your comments. Actually, for this scenario, we have conducted ablation experiment in our paper. You need to use larger Base ratio to get satisfactory results. It is recommended to use smaller CFG, which could help to generate better results. As for the split ratio, I make slight adjustments, and here I show my results about your example prompt:
python RPG.py --user_prompt 'A couple, the beautiful silver braided ponytail girl on the left happy and peaceful, the handsome blonde hair young man on the right detailed gorgeous face.' --api_key ‘your api key’--use_gpt --use_base --base_ratio 0.5 --base_prompt 'a couple are chatting
Here are some results I get from batch_size=4:
The stability of the MLLMs' response relies on our template library. Our repository's template is currently a trial version. Just stay tuned for our upcoming series of updates. Thank you for your attention!
@YangLing0818 Thanks!
all of the splits are 1?
The notebook worked for split humans, so do I need a template example that goes with that one?
Thanks for your comments. Actually, for this scenario, we have conducted ablation experiment in our paper. ![图片 1]You need to use larger Base ratio to get satisfactory results. It is recommended to use smaller CFG, which could help to generate better results. As for the split ratio, I make slight adjustments, and here I show my results about your example prompt:
python RPG.py --user_prompt 'A couple, the beautiful silver braided ponytail girl on the left happy and peaceful, the handsome blonde hair young man on the right detailed gorgeous face.' --api_key ‘your api key’--use_gpt --use_base --base_ratio 0.5 --base_prompt 'a couple are chatting
Here are some results I get from batch_size=4:
The stability of the MLLMs' response relies on our template library. Our repository's template is currently a trial version. Just stay tuned for our upcoming series of updates. Thank you for your attention!
Could you post the log that came back from GPT for this one?
Edit: Also what is the adjusted split ratio?
all of the splits are 1?
The notebook worked for split humans, so do I need a template example that goes with that one?
Thanks for your comments. Actually, for this scenario, we have conducted ablation experiment in our paper. ![图片 1]You need to use larger Base ratio to get satisfactory results. It is recommended to use smaller CFG, which could help to generate better results. As for the split ratio, I make slight adjustments, and here I show my results about your example prompt:
python RPG.py --user_prompt 'A couple, the beautiful silver braided ponytail girl on the left happy and peaceful, the handsome blonde hair young man on the right detailed gorgeous face.' --api_key ‘your api key’--use_gpt --use_base --base_ratio 0.5 --base_prompt 'a couple are chatting
Here are some results I get from batch_size=4: The stability of the MLLMs' response relies on our template library. Our repository's template is currently a trial version. Just stay tuned for our upcoming series of updates. Thank you for your attention!Could you post the log that came back from GPT for this one?
Edit: Also what is the adjusted split ratio?
The GPT-4 response are as follows:
Original Caption: "A couple, the beautiful silver braided ponytail girl on the left happy and peaceful, the handsome blonde hair young man on the right detailed gorgeous face."
Key Phrases Identification: For the couple, we have two main subjects, each with distinct features:
Split Ratio Planning: Horizontal Split Ratio: 1
Detailed Subregion Prompts:
Composition Logic:
Aesthetic Considerations:
Output: Horizontal Split Ratio: 1 Vertical Split Ratio: 1,1 Final Split Ratio: 1,1 Final Regional Prompt: The girl on the left, her silver braided ponytail reflecting the soft lighting, embodying a serene and joyful aura. BREAK The young man on the right, his blonde hair and finely detailed features creating a striking portrait of handsomeness.
I was able to reproduce. For those curious, it's imperative to put a period at the end of the base prompt for consistency, at least with albedobaseXL_v20.safetensors
python RPG.py --user_prompt 'A couple, the beautiful silver braided ponytail girl on the left happy and peaceful, the handsome blonde hair young man on the right detailed gorgeous face.' --api_key ‘your api key’--use_gpt --use_base --base_ratio 0.5 --base_prompt 'a couple are chatting.'
Here's the same seed with and without the ending punctuation mark.
Any insights on what went wrong?
CMD
python RPG.py --user_prompt 'A couple, the beautiful girl on the left, silver hair, braided ponytail, happy, dynamic, energetic, peaceful, the handsome young man on the right detailed gorgeous face, grin, blonde hair, enchanting' --version_number 0 --api_key $GPT_KEY --use_gpt --model_name 'albedobaseXL_v20.safetensors' --base_ratio .2 --activate True --batch_size 3 --cfg 8 --steps 30 --height 1024 --width 1024Models
Using GPT4 API with albedobaseXL_v20.safetensorsResults
Log from GPT4