BasePlayer modification

Example:

config:
  env_config:
     ... train parameters ... 
  player:
      env_config:
         ... play parameters ...

Test plan:

Train models for two different scenarios python runner.py --train --file rl_games/configs/brax/ppo_ant.yaml python runner.py --train --file rl_games/configs/ppo_cartpole.yaml
Run model with & without proposed change. (No crash, similar results)
Benchmark change (brax/ant)

Denys88 / rl_games