SHShim0513 / SD-VITON

[AAAI 2024] Towards Squeezing-Averse Virtual Try-On via Sequential Deformation.
https://arxiv.org/pdf/2312.15861.pdf
115 stars 14 forks source link

Does the model supports inputs of any resolution? #8

Open Aricling opened 4 months ago

Aricling commented 4 months ago

Hello, I'm trying to run the model with my own data, and I've done all the preprocessing required. (except for the original cloth of my own image, which I don't have, but I edited the code cause I suppose this is not needed.)

However, my image resolution is 2160x2160 and the cloth image in VITON-HD is 768x1024, which gives the error parse_arm = (np.array(mask_arm) / 255) * (parse_array == parse_id).astype(np.float32) ValueError: operands could not be broadcast together with shapes (1024,768) (2160,2160). So I want to ask whether the model supports input of any resolution?

Looking forward to your assist, thanks a lot!