ZzZZCHS / Chat-Scene

Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)
MIT License
113 stars 8 forks source link

Training datasets for v2.1 version #33

Closed ZCMax closed 6 months ago

ZCMax commented 6 months ago

Thanks for your great work! May I know whether you have done any dataset updation for the version 2.1?

ZzZZCHS commented 6 months ago

For evaluating on datasets such as Multi3DRefer and SQA3D, we add them (train split) into the training data. And in v2.1, we train the model in one single joint-training stage, so we delete some unnecessary data for object-level/scene-level alignment proposed in our paper. There leaves a large space for exploring to include more high-quality datasets during training.

ZCMax commented 6 months ago

So you remove the generated datasets in v2.1? only using the existing human-annotated 3d-vl datasets?

ZzZZCHS commented 6 months ago

Yes.