handria-ntoanina / unity-ml-tennis

About solving the tennis environment from Unity using a Multi Agent DDPG and a Multi Agent PPO
8 stars 3 forks source link

unity-ml-reacher

This repository contains an implementation of deep reinforcement learning based on:

The environment to be solved is having two agents playing tennis. Each agent is conducting a racket to bounce a ball over a net. If an agent hits the ball over the net, it receives a reward of +0.1. If an agent lets a ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play. This environment is similar to the tennis of Unity.
The action space is continuous [-1.0, +1.0] and consists of 2 values for horizontal and jumping moves.
The environment is considered as solved if the average score of one gent is >= 0.5 for 100 consecutive episodes.
Video

A video of trained agents can be found here below

Requirements

To run the codes, follow the next steps: