Closed lh-dm closed 4 months ago
Answer to: https://github.com/lfranke/TRIPS/issues/23#issuecomment-1944274015
Hi, @dm-de, I will try to answer your questions here to keep these two different topics separate.
Can you also give me some hints?
- Can you confirm, that "tt_experiments.zip" works? Here are no checkpoint's in zip. See here my post: viewer.exe not working #22
I think the folders themselves that containing the training epochs are the checkpoints. Yes, they are not called checkpoints. That surprised me too, but I think the description in the readme file is just a structure tree.
Maybe the same as mine “dense_point_cloud.ply”
. ?
- What is your GPU memory usage with viewer.exe & playground scene I have a issue with viewer and I'm not sure, if this happen, because I have only 8GB VRAM or because my exe ist compiled wrong. I think this has something to do with saiga & cuda compute capability. My special problem is, that I compile on one machine and copy release to another machine (where I have no admin rights). For this I need to set cuda version manually... It is hard to find out what is wrong.
From what I currently know, you don't need the experiments folder
at all to use the viewer. I'll test this again tomorrow (up until now I had only moved the checkpoints/experiments to a subfolder for testing). I'll throw out the experiments folder
completely tomorrow, then I'll try to start the viewer with just the scenes folder
present and I'll pay attention to the VRAM. I'll report back tomorrow.
I have a 3090 with 24GB VRAM.
I also suspect that something has been compiled badly for you or is a question of admin rights.
Have you tried to recompile it?
I am disappointed. I have compiled again. For some reason cmake has set different cuda versions. I have set everything to cuda 7.5 only... But it makes no difference.
My 8GB VRAM is probably not enough. :-( viewer crashes without error message. It would be great if there was a very slimmed-down viewer.
train on the other hand works (but only with train_crop_size param) VRAM is hard at the limit 7.8 of 8GB used!
Hey, here are my test results.
VRAM usage: If you have limited VRAM, don't try the viewer with the "train" or "playground", instead use the "horse" for testing.
Here is my VRAM usage in various scenes (1.1 GB, is already occupied on my system without the viewer, so that's where it comes from):
Train =1,1 GB + 5,2 GB = 6,3 GB
Playground =1,1 GB + 5,1 GB = 6,2 GB
Horse =1,1 GB + 3,5 GB = 4,6 GB
Each scene should work with 8GB of VRAM, but if you have problems try the smaller scenes rather than the largest ones.
Result: If the experiments folder has been removed:
To open a scene, the scenes folder AND the experiments folder must exist. With removed experiments folder it dosnt work. I also tried putting the experiments themselves in a subfolder and also renaming them to checkpoint_playground. Result: Both use cases seem to work fine, as long as the experiments are somewhere in the experiments folder it should work.
But I have it like in your first screenshot and that works fine.
I have compiled again.
You need:
CUDA 11.8 (make sure to at least include Nsight NVTX, Development/ , Runtime/Libraries/ and the Visual Studio Integration)
These are the things I paid attention to when compiling for the twentieth time, maybe it will help.
Otherwise I don't know what to do next.
Hi,
yes, the folders in the tt_experiments.zip and tt_scenes.zip are named the same. Sorry for the oversight, I will update this in the supplemental soon. In the meantime, you can rename the folders in the experiments however you want.
Thanks @iFimo, yes the viewer fails if the experiments folder is completely missing, otherwise if the folder exists but no experiments are present just the Model View of the point cloud will start and you will be unable to switch to Neural View.
I just retried on Windows with a RTX2070 and my VRAM measurements were similar to @iFimo.
Maybe try to start the viewer with --render_scale 0.5
, this forces the resolution to be lower and should require less VRAM.
I (dm-de = lh-dm) got viewer working without crash. I copied TRIPS root directory with all files to other computer. No other tool installation was requred (conda, cuda etc.). I don't know what was missing or wrong before.
Hi, that is indeed odd! But I'm happy that it is working for you :)
I can't get viewer to work. I have downloaded "tt_scenes.zip" and unzipped it in the scene folder. I downloaded "tt_experiments.zip" and unzipped it in the experiments folder.
This is what it currently "experiments" looks like for me:
But it should look so (???):
I am already confused - here are no checkpoints.
I tried it anyway:
And then it stops... no error... but i found in %AppData% huge crash dump files - each over 1GB:
Then I started to train... That didn't work first and stopped at Epoch 1.
complete log: log.txt
Then I changed the crop-size and this solved: --TrainParams.train_crop_size 256
I managed to train up to epoch 10. Epoch 10 was my goal - it's OK for the 1st run.
This is my generated data:
The question is now... how can I view it? Nothing seems to work...
My GFX-Card has 8 GB and is during training fully loaded (>7 GB).