mharrer97 / inovis

MIT License
23 stars 4 forks source link

Inovis: Instant Novel-View Synthesis

Mathias Harrer1 , Linus Franke1 , Laura Fink1,2 , Marc Stamminger1 , Tim Weyrich1 \ 1 Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany\ 2 Fraunhofer IIS, Erlangen, Germany\ image

Abstract: Novel-view synthesis is an ill-posed problem in that it requires inference of previously unseen information. Recently, reviving the traditional field of image-based rendering, neural methods proved particularly suitable for this interpolation/extrapolation task; however, they often require a-priori scene-completeness or costly preprocessing steps and generally suffer from long (scene-specific) training times. Our work draws from recent progress in neural spatio-temporal supersampling to enhance a state-of-the-art neural renderer's ability to infer novel-view information at inference time. We adapt a supersampling architecture [Xiao et al. 2020], which resamples previously rendered frames, to instead recombine nearby camera images in a multi-view dataset. These input frames are warped into a joint target frame, guided by the most recent (point-based) scene representation, followed by neural interpolation. The resulting architecture gains sufficient robustness to significantly improve transferability to previously unseen datasets. In particular, this enables novel applications for neural rendering where dynamically streamed content is directly incorporated in a (neural) image-based reconstruction of a scene. As we will show, our method reaches state-of-the-art performance when compared to previous works that rely on static and sufficiently densely sampled scenes; in addition, we demonstrate our system's particular suitability for dynamically streamed content, where our approach is able to produce high-fidelity novel-view synthesis even with significantly fewer available frames than competing neural methods.

[Webpage][Paper PDF][Paper PDF low-res][Video][Bibtex]

image

Compile Instructions:

Custom Dataset

image left: capture image; right: novel view

Kitti-360 Dataset

image left: capture image; right: novel view

Office Dataset

image left: novel view; right: ground truth

Playground Dataset

image left: novel view; right: ground truth

Interactive Viewer

image