About the label space in semantic segmentation and depth estimation

shinying / dmp

[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction

Apache License 2.0

64 stars 5 forks source link

Thanks for your inspiring work! In the experiments of semantic segmentation, " The training and inference of the diffusion model are conducted using the color maps (in the RGB space)." and "the predicted color maps are converted to categorical label maps by assigning each pixel to its nearest category in the color space." In terms of this, how do you calculate the distance between different colors? And in the depth estimation, will there be some difference between performance when I set the label space as 'RGB' and grayscale?

shinying / dmp

About the label space in semantic segmentation and depth estimation #7