Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration

Introduction

This repository contains the implementation of our Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration. Our model is designed to process and align 3D point cloud data from various datasets, including 3DMatch and KITTI. markdown

System Overview

Below is an overview of our EGNN model architecture:

Model Overview

Read the full paper here

Environment Setup

The code is tested on pyg (Pytorch-Geometric) 2.4.0, python 3.8, Pytorch 2.0.0, cuda11.8, GPU RAM at least 8GB with batch size 1 on GTX 2080 above. Noted, my current code implementation with batch size one only consumes less than 0.9GB RAM for 2048 points on GPU!!! and the batch size by default is set to one, other than one may result in some training error, I will fix the batch size bug soon, please stay tuned, and batch size one can be enough for the current training as data size is not that big. The code can be ported onto edge device easily to support mobile applications.

To set up the environment for this project, we use Conda. Follow these steps:

Make sure you have Conda installed. If not, you can download it from here.
Clone this repository, All required packages are specified in the environment.yml file.

$conda env create -f environment.yml $conda activate egnn-test

Data

To run this project, you'll need to download the following pre-processed datasets in our self-defined format:

Data Processing

For the two dataloaders of datasets, we provide dataloader scripts in the datasets folder:

3DMatch.py: For processing 3DMatch dataset
KITTI.py: For processing KITTI dataset

Custom Data Processing

For self-processing data, please check the scripts in 'data_preprocess' folder for each individual training data processing: If you own dataset is ordered sequence point cloud frames, just reuse the same KITTI processing script to process the sequentail point cloud frames, So the source and target scans are using $i$ th and $i+1$ th frame respectively.

process_KITTI_Feature.py: For processing sequential dataset.

For processing KITTI dataset. Otherwise, if your point cloud frames are unordered, please refer to the 3D Match script to process, yet you have to establish the correspondence between source and target point scans, with a minimum 30% point overlapping between source and target scans, otherwise, we refer you to use public library like Open3D, PCL (Point Cloud Library), scikit-learn, through KDTree or Octree to create source and target frame correspondence with engouh point overlappings. Original 3DMatch already processed it for use. For further scan pair match, you can refer to the PointDSC repository to process the feature descriptors, FPFH, FCGF, as most of our data preprocessing codes are adapted based on their codes.

process_3DMatch_Feature.py: For processing paired scan dataset with enough overlapping > 30%. We use too conditions to determined the gt correspondence from source to target points, the transformed source point position distance to the target point is smaller than a threshold along with the point feature descriptors dot product similairty bigger than a threshold, label as one, otherwise, we only use point neighbouring search to find the match point pair, label as zero,

Training

To train the EGNN model, run the following script train_egnn.py in the src folder: $python src/train_eval_egnn.py

One more thing to the training of custom dataset training, you can set "use_pointnet" flag in the train model code to true, so that the model will train the model in end2end way from input point cloud scan pair to feature descriptor extraction, and until to equi-gnn regression, as custom dataset scenes may have some gap to indoor 3D Match or KITTI outdoor datasets. But indeed some more training time and tuning of layer hyper-params may be needed for this end2end training, and we also recommend you to use pre-trained PointTransformerV2, PointTransformerV3, the point transformer can be used as encoder for point feature descriptor extraction, to replace pointnet encoder in the code. By using the pre-trained -point transformer encoder weights fine-tuned on custom dataset you can mitigate the data gap, and it helps to converge fast based on our recent tests.

tensorboard logs are exported under "./runs" directory relative to the run script.

Evaluation

In same train_eval_egnn.py, set the "mode" variable in the main function to "test" mode, then run it to load the test data for evaluation.

For metric results generation, use the evaluation.py script located in the tools folder to load the saved results of model output in inference model and compared the results under a specified directorh against another directory of gt poses: $python tools/evaluation_metrics.py

Citation

If you find our work useful in your research, please consider citing:


@inproceedings{kang2025equi,
  title={Equi-GSPR: Equivariant SE (3) Graph Network Model for Sparse Point Cloud Registration},
  author={Kang, Xueyang and Luan, Zhaoliang and Khoshelham, Kourosh and Wang, Bing},
  booktitle={European Conference on Computer Vision},
  pages={149--167},
  year={2025},
  organization={Springer}
}

alexandor91 / se3-equi-graph-registration

readme