rishistyping / AWS_JPL_OSR_DRL

Build and train a reinforcement learning (RL) model on AWS to autonomously drive JPL’s Open-Source Rover between given locations in a simulated Mars environment with the least amount of energy consumption and risk of damage.
https://spacechallenge.tech/
Apache License 2.0
12 stars 6 forks source link

Reinforcement Learning for Robot Obstacle Avoidance #3

Open Gizmotronn opened 4 years ago

Gizmotronn commented 4 years ago

This issue is for us to document how we will use reinforcement learning to train our rl-agent to avoid obstacles. The goal of this issue is:

A useful link to get started: https://pdfs.semanticscholar.org/0fcd/a4e464c9d55ccd9f8e8e3521c286e4b47933.pdf

Gizmotronn commented 4 years ago

SemanticScholar - RL-agent Obstacle Avoidance

*Abstract**

Reinforcement Learning

Gizmotronn commented 4 years ago

https://papers.nips.cc/paper/452-obstacle-avoidance-through-reinforcement-learning.pdf

Gizmotronn commented 4 years ago

These resources may be useful

Gizmotronn commented 4 years ago

SemanticScholar - RL-agent Obstacle Avoidance

Abstract*

Reinforcement Learning

  • Reinforcement Learning is learning how to map environment situations to actions, with the goal of maximising a reward signal/value
  • It is a computational approach to learn from interaction. Learning from interaction is a foundational idea in almost all learning methods
  • The agent must learn from its own experience(s)
  • Exploration vs exploitation:

    • The agent must take actions that give a higher reward score (on the reward function) to get the best accumulative rewards
    • However, to find the best actions/choices in certain situations, the agent needs to try actions that it has not selected before
    • The agent has no idea what the reward will be unless it takes the action (otherwise the agent would be able to finish the program on the first try, every time)
    • The agent therefore has to exploit the best known actions to obtain rewards, while also exploring unknown options (to either increase its reward or to get further)

Experiments

Gizmotronn commented 4 years ago

Might also want to have a look at these links:

Gizmotronn commented 4 years ago

Arxiv.org - Unmanned Aerial Vehicles

https://arxiv.org/pdf/1811.03307.pdf (or above comment)

Part 1

Introduction

Gizmotronn commented 4 years ago

https://research.google/pubs/pub48418/

Gizmotronn commented 4 years ago

Google Research - Comparison of DRL Policies for Moving Obstacle Avoidance

Abstract

Gizmotronn commented 4 years ago

More resources: