Learning and Reusing Quadruped Robot Movement Skills from Biological Dogs for Higher-Level Tasks

In the field of quadruped robots, the most classic motion control algorithm is based on model prediction control (MPC). However, this method poses challenges as it necessitates the precise construction of the robot's dynamics model, making it difficult to achieve agile movements similar to thos...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Sensors (Basel, Switzerland) Switzerland), 2023-12, Vol.24 (1), p.28
Hauptverfasser:	Wan, Qifeng, Luo, Aocheng, Meng, Yan, Zhang, Chong, Chi, Wanchao, Zhang, Shenghao, Liu, Yuzhen, Zhu, Qiuguo, Kong, Shihan, Yu, Junzhi
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Animal training Animals Biomimetics Design Dogs Learning Machine Learning Models, Biological Motion Motion capture motion imitation quadrupedal robots reinforcement learning Robotics Robotics industry Robots Teaching methods variational autoencoder
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In the field of quadruped robots, the most classic motion control algorithm is based on model prediction control (MPC). However, this method poses challenges as it necessitates the precise construction of the robot's dynamics model, making it difficult to achieve agile movements similar to those of a biological dog. Due to these limitations, researchers are increasingly turning to model-free learning methods, which significantly reduce the difficulty of modeling and engineering debugging and simultaneously reduce real-time optimization computational burden. Inspired by the growth process of humans and animals, from learning to walk to fluent movements, this article proposes a hierarchical reinforcement learning framework for the motion controller to learn some higher-level tasks. First, some basic motion skills can be learned from motion data captured from a biological dog. Then, with these learned basic motion skills as a foundation, the quadruped robot can focus on learning higher-level tasks without starting from low-level kinematics, which saves redundant training time. By utilizing domain randomization techniques during the training process, the trained policy function can be directly transferred to a physical robot without modification, and the resulting controller can perform more biomimetic movements. By implementing the method proposed in this article, the agility and adaptability of the quadruped robot can be maximally utilized to achieve efficient operations in complex terrains.
ISSN:	1424-8220 1424-8220
DOI:	10.3390/s24010028