A robot demonstration method based on LWR and Q-learning algorithm

A robot demonstration method is proposed based on the combination of locally weighted regression(LWR) and Q-learning algorithm. It is applied on a 6-DOF hitting-ball-system. This method can adapt to the work task by learning from demonstration and generating new actions. With the LWR algorithm, the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2018-01, Vol.35 (1), p.35-46
Hauptverfasser: Zhao, Guangzhe, Tao, Yong, Liu, Hui, Deng, Xianling, Chen, Youdong, Xiong, Hegen, Xie, Xianwu, Fang, Zengliang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A robot demonstration method is proposed based on the combination of locally weighted regression(LWR) and Q-learning algorithm. It is applied on a 6-DOF hitting-ball-system. This method can adapt to the work task by learning from demonstration and generating new actions. With the LWR algorithm, the mapping between target values and actions is established. According to deviation of landing position, a Q-learning algorithm is proposed to adjust the parameters of manipulator and compensate the errors caused by model and the controller. The model of LWR fits a local small space to approximate the global state and decision space. It turns out to reduce the dimension and simplify the training of Q-learning. The convergence rate is enhanced and the precision of performing task is improved. The simulation and experiment demonstrate the applicability of the proposed method.
ISSN:1064-1246
1875-8967
DOI:10.3233/JIFS-169564