shinying / dmp

[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction
https://shinying.github.io/dmp/
Apache License 2.0
64 stars 5 forks source link

About the label space in semantic segmentation and depth estimation #7

Closed Ferry-Li closed 3 months ago

Ferry-Li commented 4 months ago

Thanks for your inspiring work! In the experiments of semantic segmentation, " The training and inference of the diffusion model are conducted using the color maps (in the RGB space)." and "the predicted color maps are converted to categorical label maps by assigning each pixel to its nearest category in the color space." In terms of this, how do you calculate the distance between different colors? And in the depth estimation, will there be some difference between performance when I set the label space as 'RGB' and grayscale?

shinying commented 3 months ago

Thanks for you interest. We calculate Euclidean distance in Lab space, but there is not much difference from RGB space. For depth estimation, we duplicate grayscale values for three channels, so technically they are the same.