Deep MARL-Based Resilient Motion Planning for Decentralized Space Manipulator

Space manipulators play an important role in the on-orbit services and planetary surface operation. In the extreme environment of space, space manipulators are susceptible to a variety of unknown disturbances. How to have a resilient guarantee in failure or disturbance is the core capability of its...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Space: science & technology 2024-01, Vol.4
Hauptverfasser:	Zhang, Jiawei, Bai, Chengchao, Yue, C. Patrick, Guo, Jifeng
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Space manipulators play an important role in the on-orbit services and planetary surface operation. In the extreme environment of space, space manipulators are susceptible to a variety of unknown disturbances. How to have a resilient guarantee in failure or disturbance is the core capability of its future development. Compared with traditional motion planning, learning-based motion planning has gradually become a hot spot in current research. However, no matter what kind of research ideas, the single robotic manipulator is studied as an independent agent, making it unable to provide sufficient flexibility under conditions such as external force disturbance, observation noise, and mechanical failure. Therefore, this paper puts forward the idea of “discretization of the traditional single manipulator”. Different discretization forms are given through the analysis of the multi-degree-of-freedom single-manipulator joint relationship, and a single-manipulator representation composed of multiple new subagents is obtained. Simultaneously, to verify the ability of the new multiagent representation to deal with interference, we adopted a centralized multiagent reinforcement learning framework. The influence of the number of agents and communication distances on learning-based planning results is analyzed in detail. In addition, by imposing joint locking failures on the manipulator and introducing observation and action interference, it is verified that the “multiagent robotic manipulator” obtained after discretization has stronger antidisturbance resilient capability than the traditional single manipulator.
ISSN:	2692-7659 2692-7659
DOI:	10.34133/space.0145