Batou1406 / dls_orbit_bat_private

Unified framework for robot learning built on NVIDIA Isaac Sim
https://isaac-orbit.github.io/orbit/
Other
1 stars 0 forks source link

Define explicity the Observation space #27

Closed Batou1406 closed 1 week ago

Batou1406 commented 2 weeks ago

Define explicitly the Observation space

Context

Now, some history is provided to the observation.space (ie. input of the network). it is queried with the default function mdp.last_action.

This means, the acton.space is always into the observation.space.

Problem

If we change the type of network and the type of action.space. For example :

This will then change the observation.space, due to the mdp.last_action being into the observation.space

Solution

Instead of putting blindly, the network output into the observation.space, one could put the actual output applied to the system into the observation space (eventually with an history size > 1). However, the actual output is into some kind of specific frame (in this case world frame) that may not be relevant for the network. From the output of the network a Normalisation $N(x)$ and a Transformation $T(x)$ are applied before having it into relavant space for the low level controller. Inverse transformation and normalisation should be applied to the actual output before beeing feed into the network observation.space.

Batou1406 commented 1 week ago

Inverse Transformation and inverse Normalisation implemented and tested

Batou1406 commented 1 week ago

Observation space is now fixed no matter the dimension. Tested and it's working. Closing the task for now