Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics

In actual welding scenarios, an effective path planner is needed to find a collision-free path in the configuration space for the welding manipulator with obstacles around. However, as a state-of-the-art method, the sampling-based planner only satisfies the probability completeness and its computati...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Complex & Intelligent Systems 2022-06, Vol.8 (3), p.1899-1912
Hauptverfasser:	Zhong, Jie, Wang, Tao, Cheng, Lianglun
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Collision avoidance Comparative analysis Complexity Computational Intelligence Configuration space path planning Data Structures and Information Theory Deep learning Engineering Inverse kinematics Kinematics Machine learning Modules Optimization Original Article Planning Sampling Welding
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In actual welding scenarios, an effective path planner is needed to find a collision-free path in the configuration space for the welding manipulator with obstacles around. However, as a state-of-the-art method, the sampling-based planner only satisfies the probability completeness and its computational complexity is sensitive with state dimension. In this paper, we propose a path planner for welding manipulators based on deep reinforcement learning for solving path planning problems in high-dimensional continuous state and action spaces. Compared with the sampling-based method, it is more robust and is less sensitive with state dimension. In detail, to improve the learning efficiency, we introduce the inverse kinematics module to provide prior knowledge while a gain module is also designed to avoid the local optimal policy, we integrate them into the training algorithm. To evaluate our proposed planning algorithm in multiple dimensions, we conducted multiple sets of path planning experiments for welding manipulators. The results show that our method not only improves the convergence performance but also is superior in terms of optimality and robustness of planning compared with most other planning algorithms.
ISSN:	2199-4536 2198-6053
DOI:	10.1007/s40747-021-00366-1