Abstract
In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.
Original language | English |
---|---|
Article number | 51 |
Journal | Biomimetics |
Volume | 9 |
Issue number | 1 |
DOIs | |
State | Published - Jan 2024 |
Keywords
- autonomous driving
- deep deterministic policy gradient
- hindsight experience replay
- mobile robot
- multifunctional reward shaping