by Laura Fink, Darius Rueckert, Linus Franke, Joachim Keinert, and Marc Stamminger
Official repository | Link to project page | My home page
Abstract: Existing real-time RGB-D reconstruction approaches, like Kinect Fusion, lack real-time photo-realistic visualization. This is due to noisy, oversmoothed or incomplete geometry and blurry textures which are fused from imperfect depth maps and camera poses. Recent neural rendering methods can overcome many of such artifacts but are mostly optimized for offline usage, hindering the integration into a live reconstruction pipeline.
In this paper, we present LiveNVS, a system that allows for neural novel view synthesis on a live RGB-D input stream with very low latency and real-time rendering. Based on the RGB-D input stream, novel views are rendered by projecting neural features into the target view via a densely fused depth map and aggregating the features in image-space to a target feature map. A generalizable neural network then translates the target feature map into a high-quality RGB image. LiveNVS achieves state-of-the-art neural rendering quality of unknown scenes during capturing, allowing users to virtually explore the scene and assess reconstruction quality in real-time.
@inproceedings{fink2023livenvs,
title={LiveNVS: Neural View Synthesis on Live RGB-D Streams},
author={Fink, Laura and R{\"u}ckert, Darius and Franke, Linus and Keinert, Joachim and Stamminger, Marc},
booktitle={SIGGRAPH Asia 2023 Conference Papers},
pages={1--11},
year={2023}
}
sudo apt install -y build-essential libx11-dev xorg-dev libopengl-dev freeglut3-dev cmake
sudo apt install -y libassimp-dev # optional, to reduce compile times
Configure environment. The script will create a conda environment "torchgl" (including cuda-11.6 and cudnn) and download libtorch.
chmod +x ./configure.sh
./configure.sh
In run.sh, check the set compiler versions and adapt to you system if neccessary.
chmod +x ./run.sh
./run.sh
See the exemplary settings.json
and launch.json
in ./example_vscode_files
. Use vsc_env_init.sh
to start vs code with the torchgl env activated and other environment variables set.
F2: Use free view point camera
F3: Use dataset camera (Active on launch)
F1: GUI
*Notes: