CVMI-Lab / PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Apache License 2.0
240 stars 11 forks source link
3d-scene-understanding cvpr2023 deep-learning open-vocabulary open-world

PLA & RegionPLC

This repo contains the official implementation of PLA (CVPR2023) and RegionPLC (CVPR 2024)


PLA: Language-Driven Open-Vocabulary 3D Scene Understanding

Runyu Ding*, Jihan Yang*, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi,

CVPR 2023

[project page](https://dingry.github.io/projects/PLA) | [arXiv](https://arxiv.org/abs/2211.16312)

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Jihan Yang*, Runyu Ding*, Weipeng Deng, Zhe Wang, Xiaojuan Qi,

CVPR 2024

project page | arXiv

Highlights:

Release

Getting Started

Installation

Please refer to INSTALL.md for the installation.

Dataset Preparation

Please refer to DATASET.md for dataset preparation.

Training & Inference

Please refer to MODEL.md for training and inference scripts and pretrained models.

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{ding2022language,
    title={PLA: Language-Driven Open-Vocabulary 3D Scene Understanding},
    author={Ding, Runyu and Yang, Jihan and Xue, Chuhui and Zhang, Wenqing and Bai, Song and Qi, Xiaojuan},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
    year={2023}
}
@inproceedings{yang2024regionplc,
    title={RegionPLC: Regional point-language contrastive learning for open-world 3d scene understanding},
    author={Yang, Jihan and Ding, Runyu and Deng, Weipeng and Wang, Zhe and Qi, Xiaojuan},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
    year={2024}
}

Acknowledgement

Code is partly borrowed from OpenPCDet, PointGroup and SoftGroup.