MarkFzp / act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
https://mobile-aloha.github.io/
MIT License
2.86k stars 525 forks source link

Question about data augmentation and action coordinates #40

Closed cynthia-you closed 2 months ago

cynthia-you commented 4 months ago

Hi, After reading your paper and test the code on both real dataset and sim dataset. I'm eager to the following questions:

1 In the paper, u mentioned that u used image augmentation. I'm wondering is there any correlation between pixel and actions, or did actions get augmented too? Is there any code for reference?

2 For co-training, did static aloha had zero-pad on tracer actions or the padding processing is in the imitate_episodes.py ?

3 In ALOHA, the dimension of actions is 14(6+1+6+1), and the actions are joint positon, therefore, the actions are not related to external coordinates. But mobile-aloha has 2 more base actions, so what coordinates are used during the movement process?

I'll be appreciated for your answer, huge thanks~~