Awesome Visual Localization
A curated list of visual (re)localization resources, inspired by awesome-computer-vision. The list focuses on the research of visual localization, i.e. estimates 6 DoF camera poses of query RGB/RGB-D frames in known scenes (with databases).
This document is a work in progress. Please suggest papers/resources through pull requests. If you believe this list is missing something or has factually inaccurate info, you can also new an issue. All contributions are appreciated.
Table of Contents
Direct Approaches
Approaches of this type estimate the camera pose directly from the query frame.
Image Retrieval
- [2022 IJCV] Investigating the Role of Image Retrieval for Visual Localization - An exhaustive benchmark [paper]
- [2020 3DV] Benchmarking Image Retrieval for Visual Localization [paper]
- [2018 CVPR] PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition [paper] [code]
- [2017 CVPR] Large-Scale Image Retrieval with Attentive Deep Local Features [paper]
- [2016 CVPR] NetVLAD: CNN architecture for weakly supervised place recognition [paper] [code]
- [2016 CVPR] Large-Scale Location Recognition and the Geometric Burstiness Problem [paper] [code]
- [2015 CVPR] 24/7 place recognition by view synthesis [paper] [code]
- [2014 TVCG] Real-Time RGB-D Camera Relocalization via Randomized Ferns for Keyframe Encoding [paper]
- [2013 CVPR] All about VLAD [paper]
- [2012 BMVC] Image Retrieval for Image-Based Localization Revisited [paper]
- [2012 BMVC] 6D Relocalisation for RGBD Cameras Using Synthetic View Regression
- [2011 IROS] Real-Time Loop Detection with Bags of Binary Words [paper]
Pose Regression
- [2022 ECCV] DFNet: Enhance Absolute Pose Regression with Direct Feature Matching [paper] [code]
- [2022 ECCV] Camera Pose Auto-Encoders for Improving Pose Regression [paper] [code]
- [2022 WACV] CoordiNet: uncertainty-aware pose regressor for reliable vehicle localization [paper]
- [2021 3DV] Direct-PoseNet: Absolute Pose Regression with Photometric Consistency [paper] [code]
- [2021 3DV] Visual Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision [paper] [code]
- [2021 ICCV] Learning Multi-Scene Absolute Pose Regression with Transformers [paper] [code]
- [2021 CoRL] LENS: Localization enhanced by NeRF synthesis [paper]
- [2020 ECCV] 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference [paper] [code]
- [2020 CVPR] Learning Multi-view Camera Relocalization with Graph Neural Networks [paper]
- [2020 AAAI] AtLoc: Attention Guided Camera Localization [paper] [code]
- [2019 ECCV] CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization [paper]
- [2019 CVPR] Understanding the Limitations of CNN-based Absolute Camera Pose Regression [paper]
- [2018 ECCV] RelocNet: Continuous Metric Learning Relocalisation using Neural Nets [paper]
- [2018 BMVC] Improved Visual Relocalization by Discovering Anchor Points [paper] [code]
- [2018 CVPR] Geometry-Aware Learning of Maps for Camera Localization [paper] [code]
- [2018 RA-L] VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry [paper]
- [2017 ICCV Workshop] Camera Relocalization by Computing Pairwise Relative Poses Using Convolutional Neural Network [paper] [code]
- [2017 IROS] Deep regression for monocular camera-based 6-dof global localization in outdoor environments [paper]
- [2017 CVPR] Geometric loss functions for camera pose regression with deep learning [paper]
- [2017 CVPR] Image-based localization using LSTMs for structured feature correlation [paper]
- [2016 ICRA] Modelling Uncertainty in Deep Learning for Camera Relocalization [paper]
- [2015 ICCV] PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization [paper]
Two-Step Approaches
Approaches of this type first find correspondences between the query frame and the scene (database), then estimate the camera pose through optimization with RANSAC.
Feature Matching Based
- [2022 ECCV] MeshLoc: Mesh-Based Visual Localization [paper] [code]
- [2022 CVPR] SceneSqueezer: Learning to Compress Scene for Camera Relocalization [paper]
- [2021 IJCV] Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis [paper]
- [2021 ICCV Workshop] MegLoc: A Robust and Accurate Visual Localization Pipeline [paper]
- [2021 ICCV Workshop] Pose Refinement with Joint Optimization of Visual Points and Lines [paper]
- [2021 ICRA & ICCV Workshop] Retrieval and Localization with Observation Constraints [paper]
- [2021 CVPR] Back to the Feature: Learning Robust Camera Localization from Pixels to Pose [paper] [code]
- [2020 3DV] Using Image Sequences for Long-Term Visual Localization [paper]
- [2020 ECCV Workshop] Hierarchical Localization with hloc and SuperGlue [slides] [code]
- [2020 IROS] KR-Net: A Dependable Visual Kidnap Recovery Network for Indoor Spaces [paper]
- [2020 ICRA] To Learn or Not to Learn: Visual Localization from Essential Matrices [paper]
- [2020 arXiv] Robust Image Retrieval-based Visual Localization using Kapture [paper] [code]
- [2019 CVPR] From Coarse to Fine: Robust Hierarchical Localization at Large Scale [paper] [code]
- [2019 CVPR] D2-Net: A Trainable CNN for Joint Description and Detection of Local Features [paper] [code]
- [2018 CoRL] Leveraging Deep Visual Descriptors for Hierarchical Efficient Localization [paper] [code]
- [2018 CVPR] InLoc: Indoor Visual Localization with Dense Matching and View Synthesis [paper] [code]
- [2017 ICCV] Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map [paper]
- [2017 CVPR] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization? [paper]
- [2017 ICRA] [RA-L] Self-supervised Visual Descriptor Learning for Dense Correspondence [paper]
- [2016 TPAMI] Efficient & Effective Prioritized Matching for Large-Scale Image-Based Localization [paper]
- [2015 ICCV] Camera Pose Voting for Large-Scale Image-Based Localization [paper]
- [2012 ECCV] Improving Image-Based Localization by Active Correspondence Search [paper] [code]
- [2012 CVPR] Real-time Image-based 6-DOF Localization in Large-Scale Environments [paper]
- [2011 ICCV] Fast Image-Based Localization using Direct 2D-to-3D Matching [paper]
Scene Coordinate Regression
- [2022 3DV] Visual Localization via Few-Shot Scene Region Classification [paper] [code]
- [2022 ECCV] Towards Accurate Active Camera Localization [paper] [code]
- [2022 CVPR] CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data [paper] [code]
- [2021 ICCV] Continual Learning for Image-Based Camera Localization [paper] [code]
- [2021 TPAMI] Visual Camera Re-Localization from RGB and RGB-D Images Using DSAC [paper] [code]
- [2021 CVPR] Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments [paper] [code]
- [2021 CVPR] Learning Camera Localization via Dense Scene Matching [paper] [code]
- [2021 CVPR] VS-Net: Voting with Segmentation for Visual Localization [paper] [code]
- [2020 CVPR] KFNet: Learning Temporal Camera Relocalization using Kalman Filtering [paper] [code]
- [2020 CVPR] Hierarchical Scene Coordinate Classification and Regression for Visual Localization [paper] [code]
- [2019 3DV] Let’s Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation [paper]
- [2019 TPAMI] Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade [paper] [code]
- [2019 ICCV] SANet: Scene Agnostic Network for Camera Localization [paper] [code]
- [2019 ICCV] Expert Sample Consensus Applied to Camera Re-Localization [paper] [code]
- [2018 IROS] Exploiting Points and Lines in Regression Forests for RGB-D Camera Relocalization [paper]
- [2018 RSS] Full-Frame Scene Coordinate Regression for Image-Based Localization [paper]
- [2018 CVPR] Learning Less is More – 6D Camera Localization via 3D Surface Regression [paper] [code]
- [2017 IROS] Backtracking Regression Forests for Accurate Camera Relocalization [paper]
- [2017 CVPR] DSAC - Differentiable RANSAC for Camera Localization [paper] [code]
- [2017 CVPR] On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation [paper] [code]
- [2017 ICRA] Random Forests versus Neural Networks − What’s Best for Camera Localization? [paper]
- [2016 CVPR] Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [paper]
- [2015 CVPR] Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization [paper]
- [2014 CVPR] Multi-Output Learning for Camera Relocalization [paper]
- [2013 CVPR] Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images [paper]
Datasets and Benchmarks
In/Outdoor
Indoor
Outdoor
Challenges
Tools