huang-yh / GaussianFormer

[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Other
278 stars 20 forks source link

vis results #27

Open Jasper-sudo-Sun opened 1 month ago

Jasper-sudo-Sun commented 1 month ago

Thank you for open-sourcing the code. I'd like to ask why, in your visualization results, most of the gaussians still maintain a spherical shape and haven't changed significantly?

Huijie-Liu commented 1 month ago

I have a similar question regarding the optimization of Gaussians. After saving and visualizing the optimized Gaussians as PLY files, I observed that all the Gaussian spheres are almost identical in size. If the primary purpose of these Gaussian spheres is to represent occupancy, wouldn’t using just spherical shapes suffice? This could potentially reduce the dimensionality of parameters.

Below is the visualization of the Gaussians generated based on the 144,000 model (with all Gaussians of category 17, which are empty Gaussians, removed) and corresponding occupancy. CleanShot 2024-09-26 at 13 05 37@2x a22bea2f-fb77-47f7-8ccb-08c47c59c987

huang-yh commented 1 month ago

Indeed, the Gaussians in the current version do not show much diversity in their scales. We think it is because we have constrained the Gaussians to have similar size as a single voxel, in which case the occupancy supervision itself does not show much diversity in shapes. However, we do notice some flattened Gaussians on the surface of the road.

bqm1111 commented 1 day ago

I have a similar question regarding the optimization of Gaussians. After saving and visualizing the optimized Gaussians as PLY files, I observed that all the Gaussian spheres are almost identical in size. If the primary purpose of these Gaussian spheres is to represent occupancy, wouldn’t using just spherical shapes suffice? This could potentially reduce the dimensionality of parameters.

Below is the visualization of the Gaussians generated based on the 144,000 model (with all Gaussians of category 17, which are empty Gaussians, removed) and corresponding occupancy.

How do you generate visualization like this?

bqm1111 commented 17 hours ago

Indeed, the Gaussians in the current version do not show much diversity in their scales. We think it is because we have constrained the Gaussians to have similar size as a single voxel, in which case the occupancy supervision itself does not show much diversity in shapes. However, we do notice some flattened Gaussians on the surface of the road.

Could you elaborate more on how you have constrained the Gaussians to have similar size as a single voxel?