Closed lingxiao-guo closed 5 months ago
+1
+2
+1
Hi, I hope this message finds you well.
In Figure 2 of your paper, you illustrate the components of the policy network's input, which include a hybrid internal embedding derived from the robot's historical observations. I am intrigued by the distinction between the explicit velocity ( \hat{v}_t ) and the actual velocity ( v_t ) as depicted in the figure. The paper describes ( \hat{v}_t ) as part of the hybrid internal embedding optimized through contrastive learning to simulate the robot's response. However, the precise definition and role of ( v_t ) within the context of the proprioceptive information are not explicitly detailed in the text I have access to.
Could you please clarify the following:
Your insights would be invaluable for my understanding of the nuanced aspects of the HIM and its application in agile legged locomotion. I appreciate your time and expertise in elucidating these concepts.
Thank you very much.
Best regards
v_t is the target of \hat{v}_t, the hybrid internal model gives an estimation of v_t, which is \hat{v}_t. \hat{v}_t is used as a part of policy input to estimate disturbances.
Sorry for the late. Code is released.
Hi, I wonder how soon will you release the code, since this wonderful work has already accepted. Thanks!