Open lolwarmaze opened 5 months ago
Thank you for your interesting. it seems there is no point output from colmap.
i am working on a custom data processing, loader and trainer. Stay tuned, in the next two weeks.
Hi, i just pushed code to process raw static-placed multi-camera captured mp4 videos without any prior. (current custom processing can not handle fisheye. which needs prior intrics). Please allow for some artifacts in reconstructed models.
code to process videos (suppose your videos is at /data/videos/*.mp4
)
python script/pre_no_prior.py --videosdir /data/videos
colmap points will be generated at /data/videos/point/colmap_*
you can remove all the intermediate results images in /data/videos/frames
after you can get good colmap points
code to train model
python train.py --eval --config configs/techni_lite/noprior.json --model_path <path to save model> --source_path /data/videos/point/colmap_0
add options for memory effient training (at cost of longer training time, larger cpu memory requirment)
--data_device cpu --gtisint8 1
after the two options, it only requires 4~5GB gpu memory for 18 cameras each with 50 frames (orginal 2 k resolution downsample to 1k resolution images)
I tested on the 170915_toddler5 dataset in the CMU Panoptic Studio dataset, the result after training with 31 HD Cams was very poor,it is almost impossible to observe any valid content; with 479 VGA Cams, the result was also unsatisfactory. For this 360° video dataset, how can we improve the quality? Are there any hyperparameters that can be adjusted? 31 HD Cams Result 👆 479 VGA Cams Result 👆
hyperparameters like "--trbfslinit 4" (this will make the initial points cover fewer neighbouring time.) "--trbfs_lr 0.05" will increase the changing of 1d gaussian shape.
did you use pose priors in this dataset?
No. I did not use pose priors in this dataset. I used the script/pre_no_prior.py
mentioned in your reply above for preprocessing. I will try training with the hyperparameters you mentioned later.
Thank you very much for adding this method for custom data, I am getting a lot better results than I was using LLFF to get poses_bounds.npy and n3d dataloader. Appreciate your work
No. I did not use pose priors in this dataset. I used the
script/pre_no_prior.py
mentioned in your reply above for preprocessing. I will try training with the hyperparameters you mentioned later.
Hi, @TonyDua Did you get good results on the CMU Panoptic Studio dataset, I tried but failed.
Hi,
I am trying to train a 4D Gaussian model using my own set of stereo video pairs. I have two video files (.mp4) from my stereo camera setup and have generated a
poses_bounds.npy
file with LLFF format because I was unable to generate camera poses for a stereo camera setup using COLMAP. Now, when I try to preprocess it withpre_n3d.py
script, it is able to extract features but I get the following error:I also checked my poses orientation and they look okay. What do you think might be the issue here ? Also if someone has any guideline for working with custom stereo datasets then please let me know. Thanks!