leggedrobotics / rsl_rl

Fast and simple implementation of RL algorithms, designed to run fully on GPU.
Other
505 stars 156 forks source link

Fix mask for RNN hidden states reset #28

Open kaixi287 opened 2 months ago

kaixi287 commented 2 months ago

This PR updates the reset function to convert the dones mask to a boolean mask before applying it to reset hidden states. Previously, the function used a 0/1 mask directly, which selected the first and second set of indices in the last but one dimension, rather than resetting the hidden states at the desired positions indicated by 1 in the mask. By converting the dones mask to a boolean mask, we ensure that the hidden states are correctly reset at the specified positions.