Open Doan-IT opened 6 months ago
In the original ALOHA paper, BC-ConvMLP (also known as CNNMLP) is identified as the simplest yet most widely used baseline [69, 26]. It processes the current image observations with a convolutional network. The output features of this network are then concatenated with the joint positions to predict the action."
Thank you for sharing such a great project. Refer to the paper "https://mobile-aloha.github.io/resources/mobile-aloha.pdf" I have some questions, hoping for your help:
Thanks a lot.