About input image simple process

AiuniAI / Unique3D

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

https://wukailu.github.io/Unique3D/

MIT License

3.1k stars 246 forks source link

About input image simple process #36

Closed Yzhbuaa closed 4 months ago

Yzhbuaa commented 5 months ago

Hello,

Thank you for the excellent work. I have a question regarding the preprocessing step. Could you please explain why the input image needs to be cropped before being fed into the pipeline? Is this related to how the model was trained?

I experimented with using the original, uncropped image, but the generated results were not as good. Specifically, when feeding the original image into the pipeline, the side views often have a 'long tail' behind the object.

Thank you for your assistance.

wukailu commented 4 months ago

Because the data is scaled into the [-1,1] box during training, implicitly the model learns that either the left and right will top the box or the front and back will top it.