But even though this wrapper is used, I believe that the action space doesn't get updated to account it, this causes the action limits to end up being the original ones instead of [-1, 1] which could cause compatibility problems with certain implementations in some of the scenarios.
I was checking the code of the environment and noticed that an action wrapper is always used to normalize the actions, the code used for this is:
But even though this wrapper is used, I believe that the action space doesn't get updated to account it, this causes the action limits to end up being the original ones instead of [-1, 1] which could cause compatibility problems with certain implementations in some of the scenarios.