Andy20178 / SGFormer

5 stars 1 forks source link

SGFormer

Official implementation of "SGFormer: Semantic Graph Transformer for Point Cloud-based 3D Scene Graph Generation", AAAI 2024

[Paper]

img

Figure, Overview pipeline of our proposed SGFormer. We leverage PointNet to initialize the node and edge features in the 3D scene and use LLMs ( i.e., ChatGPT) to enrich the object description text of the dataset as semantic knowledge. The SGFormer main consists of two carefully designed components: a Graph Embedding Layer and a Semantic Injection Layer.

Dataset

We followed the dataset processing method of EdgeGCN A quick glance at some features of our cleaned 3DSSG-O27R16 dataset (compared to the original 3DSSG dataset):

To obtain the 3DSSG-O27R16 dataset, please follow the instructions in EdgeGCN's project page.

Code

Acknowledgement

This resipotry is based on EdgeGCN

Citation

If you find our data or project useful in your research, please cite:

@inproceedings{lv2024sgformer,
  title={SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation},
  author={Lv, Changsheng and Qi, Mengshi and Li, Xia and Yang, Zhengyuan and Ma, Huadong},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={38},
  number={5},
  pages={4035--4043},
  year={2024}
}