XiangFeng66 / SRGS

The first to focus on HRNVS of 3DGS
27 stars 1 forks source link

Question About Super-Splatting #2

Open Rashfu opened 3 months ago

Rashfu commented 3 months ago

The original text says, Specifically, the 2D Gaussian projections in screen space are super-resolved to synthesize HR novel views. But what does it mean to "super-resolve" 2D Gaussians? I'm not clear on how this is actually implemented.

And I look forward to the updated README.md, which will tell us how to use your code.

Thanks in advance!

XiangFeng66 commented 3 months ago

Thank you for your attention. First of all, 3DGS splashes 2DGS onto the low-resolution rendering image during low-resolution training, and SRGS directly splashes out the high-resolution rendering image. This is called super-resolve. Secondly, the code of SRGS is very simple, and there are few changes in 3DGS. It is easy to understand if you read it carefully.

Rashfu commented 3 months ago

I see what you mean. Thanks for the explanation.

XiangFeng66 commented 1 month ago

Hello, have you run my code? If you reproduce some results, please contact me. I have some questions for you. Thank you.

Rashfu commented 1 month ago

Hi @XiangFeng66, I tried reproducing your results on the Synthetic NeRF, Tanks & Tamples datasets and Mip-NeRF 360 Dataset., but I failed. (single RTX 3090)

As for the code, I'm curious why resolution_ori is fixed to be multiplied by 4 in the loadCam function here. If I want to conduct a 2x super-resolution experiment, should I change it to 2? If that's the case, there might be other parts of the code where the factor is also fixed to 4 and would need to be modified.

XiangFeng66 commented 1 month ago

Yes,for the Tank dataset, it needs to be processed by colmap. Regarding the blender dataset, you need to refer to the colmap reading method to modify it, because this code is not organized by me. If you want to experiment with X2, you need to modify the Loadcam function and the Camera class. I think it is very simple.

Rashfu commented 1 month ago

Thanks for your reply. I will try to reproduce the results.

XiangFeng66 commented 1 month ago

Sorry about the OOM issue, I think you should use RTX4090. The two-dimensional model processing data must be inside the code to prevent distortion of the pictures during the data storage process, so Swinir needs to reason about each picture, which will consume a little time. For the loss problem, I use loss_sr for ablation experiments. You can add the following loss_lr to reproduce the results of our paper.