Official implementation of "SGFormer: Semantic Graph Transformer for Point Cloud-based 3D Scene Graph Generation", AAAI 2024
Figure, Overview pipeline of our proposed SGFormer. We leverage PointNet to initialize the node and edge features in the 3D scene and use LLMs ( i.e., ChatGPT) to enrich the object description text of the dataset as semantic knowledge. The SGFormer main consists of two carefully designed components: a Graph Embedding Layer and a Semantic Injection Layer.
We followed the dataset processing method of EdgeGCN A quick glance at some features of our cleaned 3DSSG-O27R16 dataset (compared to the original 3DSSG dataset):
more-comfortable-than
) filtered out - we focus on structural relationships instead;To obtain the 3DSSG-O27R16 dataset, please follow the instructions in EdgeGCN's project page.
This resipotry is based on EdgeGCN
If you find our data or project useful in your research, please cite:
@inproceedings{lv2024sgformer,
title={SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation},
author={Lv, Changsheng and Qi, Mengshi and Li, Xia and Yang, Zhengyuan and Ma, Huadong},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={38},
number={5},
pages={4035--4043},
year={2024}
}