scene-verse / SceneVerse

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
https://scene-verse.github.io
MIT License
166 stars 2 forks source link

Frozen LM for object-level grounding #15

Closed sayands closed 1 month ago

sayands commented 1 month ago

Hi, firstly thanks for your nice work! I was curious to know which frozen Language Model did you use for object-level grounding, as mentioned in Sec 4.1 of the paper?

Please let me know. Thanks!

Buzz-Beater commented 1 month ago

We used a frozen bert-model for pre-training the alignment between pcd object encoder and language model.