(v3.1.3) - Sinergym ActionWrappers adaptation for SB3 algorithm

Description

This is a little fix for Sinergym's action wrappers in order to adapt it to the SB3 algorithms.

Motivation and Context

[ ] I have raised an issue to propose this change (required for new features and bug fixes)

Why is this change required? What problem does it solve? Please, reference issue or issues opened previously.

The current ActionWrappers work perfectly with Gymnasium. However, there was a bug with SB3 algorithms. The action Wrappers update the action before to send it to the next layer. SB3 copy the last layer action value instead the action used in the network input (which is translated in errors). To avoid this, the action parsers copy do a copy of the original action before to start to manipulate it.

Types of changes

[x] Bug fix (non-breaking change which fixes an issue)
[ ] New feature (non-breaking change which adds functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to change)
[ ] Documentation (update in the documentation)
[ ] Improvement (of an existing feature)
[ ] Others

Checklist:

[x] I've read the CONTRIBUTION guide (required)
[ ] My change requires a change to the documentation.
[ ] I have updated the tests.
[ ] I have updated the documentation accordingly.
[ ] I have reformatted the code using autopep8 second level aggressive.
[ ] I have reformatted the code using isort.
[ ] I have ensured cd docs && make spelling && make html pass (required if documentation has been updated.)
[ ] I have ensured pytest tests/ -vv pass. (required).
[ ] I have ensured pytype -d import-error sinergym/ pass. (required)

Changelog:

Fixed action wrapper for SB3: Now action tranformations are by value instead of by reference.
Fixed new pytype error due to the new version of requests and urllib3.

ugr-sail / sinergym