gsgen3d / gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting
https://arxiv.org/abs/2309.16585
MIT License
761 stars 42 forks source link

Running main.py looks like doing training? #5

Closed darthjaja6 closed 11 months ago

darthjaja6 commented 11 months ago

image This shows up in log, I wonder if I did something wrong or just misunderstood. Isn't it supposed to be an inference?

Phylliida commented 11 months ago

TLDR: you're running inference for a single 3D model, it's just very slow

I'm not a dev for this project but how this works is it starts with a rough shape given by Point-E, then uses stable diffusion (by default, u can change this in base.yaml config guidance) from different perspectives to guide Gaussian Splatting into the right shape. Their way of doing it takes a lot of iters to converge for a single 3D model (it takes me ~3 hours on a RTX 3090 to do 15000 steps). You can see the progress in outputs/\<date>/\<date>/checkpoints/\<prompt>/\<date>/\<id>/eval, look in the image and video folders.

How frequently it shows you progress pictures/video is in base.yaml in eval, image_period and video_period

darthjaja6 commented 11 months ago

@Phylliida Well explained, thank you!

genitrix commented 11 months ago

TLDR: you're running inference for a single 3D model, it's just very slow

I'm not a dev for this project but how this works is it starts with a rough shape given by Point-E, then uses stable diffusion (by default, u can change this in base.yaml config guidance) from different perspectives to guide Gaussian Splatting into the right shape. Their way of doing it takes a lot of iters to converge for a single 3D model (it takes me ~3 hours on a RTX 3090 to do 15000 steps). You can see the progress in outputs///checkpoints////eval, look in the image and video folders.

How frequently it shows you progress pictures/video is in base.yaml in eval, image_period and video_period

Hi @Phylliida, can you get the same result from the paper? I ran the training successfully but I got a result very different from my prompt.