Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies
This study presents a comparative analysis between single-objective and multi-objective reinforcement learning methods for training a robot to navigate effectively to an end goal while efficiently avoiding obstacles. Traditional reinforcement learning techniques, namely Deep Q-Network (DQN), Deep De...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This study presents a comparative analysis between single-objective and
multi-objective reinforcement learning methods for training a robot to navigate
effectively to an end goal while efficiently avoiding obstacles. Traditional
reinforcement learning techniques, namely Deep Q-Network (DQN), Deep
Deterministic Policy Gradient (DDPG), and Twin Delayed DDPG (TD3), have been
evaluated using the Gazebo simulation framework in a variety of environments
with parameters such as random goal and robot starting locations. These methods
provide a numerical reward to the robot, offering an indication of action
quality in relation to the goal. However, their limitations become apparent in
complex settings where multiple, potentially conflicting, objectives are
present. To address these limitations, we propose an approach employing
Multi-Objective Reinforcement Learning (MORL). By modifying the reward function
to return a vector of rewards, each pertaining to a distinct objective, the
robot learns a policy that effectively balances the different goals, aiming to
achieve a Pareto optimal solution. This comparative study highlights the
potential for MORL in complex, dynamic robotic navigation tasks, setting the
stage for future investigations into more adaptable and robust robotic
behaviors. |
---|---|
DOI: | 10.48550/arxiv.2312.07953 |