Inquiry about Semantic Category Retrieval Capability in the Model

lkeab / gaussian-grouping

[ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.

https://arxiv.org/abs/2312.00732

Apache License 2.0

500 stars 37 forks source link

Inquiry about Semantic Category Retrieval Capability in the Model #7

Closed DevLinyan closed 6 months ago

DevLinyan commented 6 months ago

Could you please provide information on whether the model is equipped to identify and classify semantic categories such as desk or table? If yes, where it is.

lkeab commented 6 months ago

Yes, the provided 2D masks can be from Grounded-SAM, where each mask has category information. Or you can use Grounding DINO to pick the output masks of SAM inside the 2D bounding boxes. The 3D-2D mapping relation can lift this category information to 3D masks directly.