vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
MIT License
778 stars 87 forks source link

Action dim not match #51

Open Chris-Chow opened 9 months ago

Chris-Chow commented 9 months ago

In the training dataset which you provide, action dims are like:

--------------------------------------------------
Action
pose0_position : (2, 3)
pose0_rotation : (2, 4)
pose1_position : (2, 3)
pose1_rotation : (2, 4)
--------------------------------------------------

But the in the trajectory I generate by oracle, action dims are like:

--------------------------------------------------
Action
pose0_position : (2, 2)
pose0_rotation : (2, 4)
pose1_position : (2, 2)
pose1_rotation : (2, 4)
--------------------------------------------------

so are the dims of the action space of environment.

In your vima-bench, the action dims of pose0_position and pose1_position are 2, but in your training dataset, the corresponding action dims are 3. So that the action in the traing dataset can't be used in the vima-bench environment and don't match the corresponding dim in the models you provide.

But I notice that when the dims of action['pose0_position'] and action['pose1_position'] are 3, the action can still be used in the environment, so I'd like to ask if it means the third dim in the actions of datasets is useless and I can just ignore it. If not, how to solve this question so that the dataset can be utilized.

Thanks!

chengchenga commented 9 months ago

can you run the vima? I get some problems and can't run it. But i can run vima-bench

zcczhang commented 5 months ago

it means the third dim in the actions of datasets is useless and I can just ignore it

yes since the primitive actions are 2D.