Control Policy with Autocorrelated Noise in Reinforcement Learning for Robotics

Direct application of reinforcement learning in robotics rises the issue of discontinuity of control signal. Consecutive actions are selected independently on random, which often makes them excessively far from one another. Such control is hardly ever appropriate in robots, it may even lead to their...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of machine learning and computing 2015-04, Vol.5 (2), p.91-95
1. Verfasser: Wawrzynski, Pawel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Direct application of reinforcement learning in robotics rises the issue of discontinuity of control signal. Consecutive actions are selected independently on random, which often makes them excessively far from one another. Such control is hardly ever appropriate in robots, it may even lead to their destruction. This paper considers a control policy in which consecutive actions are modified by autocorrelated noise. That policy generally solves the aforementioned problems and it is readily applicable in robots. In the experimental study it is applied to three robotic learning control tasks: Cart-Pole SwingUp, Half-Cheetah, and a walking humanoid.
ISSN:2010-3700
2010-3700
DOI:10.7763/IJMLC.2015.V5.489