Reinforcement learning with search algorithms

plamentotev commented 2 months ago

There are several state of the art algorithms that use search to improve the policy trained with RL(e.g. AlphaZero, Student Of Games). The current implementation of ML-Agents does not seem to support that. On other hand the architecture should be able to handle such algorithms pretty well. For example the training loop for AlphaZero consist of agents that generate trajectories used by a learner to update the policy.

Have you considered adding support for reinforcement learning with search algorithms? Or it is out of scope for the project?

github-actions[bot] commented 1 month ago

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] commented 3 weeks ago

This issue was closed because it has been inactive for 14 days since being marked as stale. Please open a new issue for related bugs.

Unity-Technologies / ml-agents

Reinforcement learning with search algorithms #6126