HMA-SAR: Multi-Agent Search and Rescue for Unknown Located Dynamic Targets in Completely Unknown Environments

Multi-Agent Search and Rescue (MASAR) tasks, challenged by unknown environments and the unpredictable movements of unknown dynamic targets, suffer from inefficiencies in traditional map coverage techniques which require repeated sweeps. Addressing this, our study introduces a novel MASAR framework b...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE robotics and automation letters 2024-06, Vol.9 (6), p.5567-5574
Hauptverfasser: Cao, Xiao, Li, Mingyang, Tao, Yuting, Lu, Peng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Multi-Agent Search and Rescue (MASAR) tasks, challenged by unknown environments and the unpredictable movements of unknown dynamic targets, suffer from inefficiencies in traditional map coverage techniques which require repeated sweeps. Addressing this, our study introduces a novel MASAR framework based on Multi-Agent Reinforcement Learning (MARL), featuring innovative elements like state, reward, and network structure design, alongside a Heterogeneous Curriculum Training algorithm and a hybrid decision mechanism. These components collectively enhance performance in dynamic environments, improve model generalization, and mitigate issues like sparse rewards and policy bias. In grid map simulations, our approach, HMA-SAR (Heterogeneous Multi-Agent Search and Rescue Framework), demonstrated consistent superiority over the traditional frontier-based method and other MARL algorithms, in metrics such as success rate, steps count, and the number of targets fetched. The practical applicability of our approach was further validated through experiments in Gazebo and real-world scenarios. Additionally, scalability tests in grid maps revealed substantial improvements in success rates and task completion times with increased agent deployment.
ISSN:2377-3766
2377-3766
DOI:10.1109/LRA.2024.3396097