Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay

Minjae Park, Chaneun Park, Nam Kyu Kwon

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.

Original languageEnglish
Article number51
JournalBiomimetics
Volume9
Issue number1
DOIs
StatePublished - Jan 2024

Keywords

  • autonomous driving
  • deep deterministic policy gradient
  • hindsight experience replay
  • mobile robot
  • multifunctional reward shaping

Fingerprint

Dive into the research topics of 'Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay'. Together they form a unique fingerprint.

Cite this