szymanowiczs / splatter-image

Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
https://szymanowiczs.github.io/splatter-image
BSD 3-Clause "New" or "Revised" License
795 stars 54 forks source link

question about scale #23

Closed johnren-code closed 6 months ago

johnren-code commented 6 months ago

I wonder why opacity_scale(0.001), xyz_scale(0.000001), scale_scale(0.001) of shapenet cars dataset is set to such value?

johnren-code commented 6 months ago

Another question, in the article, you mentioned using a camera transformation based on the relative pose of the first frame of the image, if I want to implement a predicted Gaussian parameter based on a single image and its camera pose, and render the resulting image under the same camera pose, i.e., train on a single image, would it be reasonable to use the corresponding camera pose information inside the dataset directly?

johnren-code commented 6 months ago

what's more, i also want to know how to determine the value of znear and zfar of other synthetic dataset.

szymanowiczs commented 6 months ago

Hi,

johnren-code commented 6 months ago

Thank you for your answer!

When you mention distance to object do you mean the distance from the camera to the object in the world coordinate system? And how exactly are the dimensions of w and distance to object defined? As a rule of thumb, I set the znear and zfar to 0.01 and 10 respectively for the new dataset in the first experiment, do you think this is reasonable?