pals-ttic / sjc

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation (CVPR 2023)
https://pals.ttic.edu/p/score-jacobian-chaining
Other
500 stars 15 forks source link

Query : 4 channels in nerf output #25

Closed aradhyamathur closed 1 year ago

aradhyamathur commented 1 year ago

Hi, I have a query pertaining to the voxnerf implementation, the features generated 4 channels, whereas the density feature is computed separately, does the last channel correspond to alpha ?

aradhyamathur commented 1 year ago

I wanted to test with vanilla nerf implementation like stable diffusion, the outputs there are 3 channel, so wanted to know what change could be made for the same.

w-hc commented 1 year ago

The four channels are for the latent features of Stable Diffusion. It’s not RGBa. The density is controlled by another small 1-channel voxel.

aradhyamathur commented 1 year ago

Oh I see, so it's essentially predicting the latents which are then denoised by the SD network to produce the RGB while density is predicted from the other module. Is that correct ?

w-hc commented 1 year ago

That’s right.

aradhyamathur commented 1 year ago

Thanks for the clarification.