Add mlagents-learn system architecture documentation

Is your feature request related to a problem? Please describe. I've been thinking about how I can go about modifying existing RL implementations or implementing new RL implementations within ml-agents. I started looking through the python codebase and quickly found myself getting quite lost within the layers of abstraction and the numerous modules. It took me a while to see how the CLI arguments are being translated down the software hierarchy to generate the pytorch networks.

Describe the solution you'd like It would be great to have a system architecture diagram describing the component interactions or a contributor's quickstart/guide so that interested people can figure out how to implement or modify the algorithms. This also means that the community can help add/debug algorithm implementations to ml-agents.

Describe alternatives you've considered One alternative is for researchers and ML engineers to build the algorithm and training system from scratch with the Python UnityEnvironment API. This would mean more granular control and visibility over the RL algorithm. However it would mean not being able to make use of all the great work that has already been done by the ml-agents team in building the ml-agents software.

Many thanks for the great work being done w ml-agents!

Unity-Technologies / ml-agents

Add mlagents-learn system architecture documentation #5447