Closed Jaeyoung-Lim closed 5 years ago
This PR implements states, rewards in the environment so that it can be used in the gym framework.
I tested this PR that it outputs rewards every time step and increasing as the quad is close to the target.
This PR implements states, rewards in the environment so that it can be used in the gym framework.