modelscope / richdreamer

Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
https://aigc3d.github.io/richdreamer/
Apache License 2.0
396 stars 14 forks source link

How to normalize the disparity (or depth) to feed into VAE in RichDreamer? #3

Closed thucz closed 8 months ago

thucz commented 9 months ago

Hi! Since Monocular Depth Network provides a relative disparity. How do you normalize the disparity (or depth) to feed into VAE?

lingtengqiu commented 9 months ago

You can see the supplementary from arxiv version Instead of disparity which has very poor property, we use relative depth ( setting near and far plane) to feed into VAE

thucz commented 9 months ago

Thanks for your reply! But I still wonder what values you set the near and far planes for the predicted depth for Monocular Depth Network. Could you give me a script of how you exactly process the predicted disparity of MiDaS on LAION datasets?

lingtengqiu commented 8 months ago

For the predicted depth from Midas, we normalize disparity (dis - min(dis)) / (max(dis) - min(dis)), given a prompt "xxx". On the other hand, when we finetune multi-view normal-depth diffusion model on objaverse, we transfer disparity domain to inverse depth domain, given a prompt "xxxx, 3D asset."

thucz commented 8 months ago

Thanks! I got it.