Open linnanwang opened 9 months ago
Using 16:9 aspect ratio with a width of 1024 would mean the image height is 576. This should work fine depending on the model you're training. Padding with whitespace might lead to whitespace in the outputs as well.
Thanks @DN6 , what's the right way to handle 16:9 images?
can we use forums for questions? https://github.com/huggingface/diffusers/discussions
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
Is your feature request related to a problem? Please describe. Hello there,
I have a question regarding the UNet and Lora Training in image AR other than 1:1. Right now I saw the data is resized to 1:1 using the linear interpolation provided from pytorch. But my question is how to handle 16:9 images? Thanks.
Perhaps resize the image proportionally with length to 1024 and pad the white space in width?