A Transferability Metric Using Scene Similarity and Local Map Observation for DRL Navigation

While deep reinforcement learning (DRL) has attracted a rapidly growing interest in solving the problem of navigation without global maps, DRL typically leads to a mediocre navigation performance in practice due to the gap between the training scene and the actual test scene. To quantify the transfe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ASME transactions on mechatronics 2024-12, Vol.29 (6), p.4423-4433
Hauptverfasser:	Lian, Shiwei, Zhang, Feitian
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Autonomous navigation deep reinforcement learning (DRL) Laser radar local map Measurement Navigation Performance evaluation Robot sensing systems Robots Robustness Safety measures scene similarity Similarity Spatial data Template matching Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	While deep reinforcement learning (DRL) has attracted a rapidly growing interest in solving the problem of navigation without global maps, DRL typically leads to a mediocre navigation performance in practice due to the gap between the training scene and the actual test scene. To quantify the transferability of a DRL agent between the training and test scenes, this article proposes a new transferability metric-the scene similarity calculated using an improved image template matching algorithm. Specifically, two transferability performance indicators are designed including the global scene similarity that evaluates the overall robustness of a DRL algorithm and the local scene similarity that serves as a safety measure when a DRL agent is deployed without a global map. In addition, this article proposes the use of a local map that fuses 2-D LiDAR data with spatial information of both the agent and the destination as the DRL observation, aiming to improve the transferability of DRL navigation algorithms. With a wheeled robot as the case study platform, both simulation and real-world experiments are conducted in a total of 26 different scenes. The experimental results affirm the robustness of the local map observation design and demonstrate the strong correlation between the scene similarity metric and the success rate of DRL navigation algorithms.
ISSN:	1083-4435 1941-014X
DOI:	10.1109/TMECH.2024.3376542