Generating your own actuator network training data

Improbable-AI / walk-these-ways

Sim-to-real RL training and deployment tools for the Unitree Go1 robot.

Other

492 stars 129 forks source link

Hi @Vassil17 ,

I generated the training data for the actuator network by the following procedure:

Train a policy without actuator network
Deploy the policy and record the joint position and velocity command + the measured joint position, velocity, and torque at various walking speeds and gaits. The "L2" button should start and stop logging during deployment (https://github.com/Improbable-AI/walk-these-ways/blob/master/go1_gym_deploy/utils/deployment_runner.py#L168)
Train the actuator network on this data

Re: your question

From the scripts it seems that you only need joint position and velocity measurements, torques measurements (tau_est?), desired joint positions and torques - is the last one the computed torques based on the PD law?

Yes, the last one is the computed torques based on PD law. They aren't used to train the actuator network, only to visualize the difference between the learned model and ideal model

-Gabe

Improbable-AI / walk-these-ways

Generating your own actuator network training data #30