Caterpillar Robot Locomotion Based on Reinforcement Learning Using Subjective Reward

This note presents an application of reinforcement learning to caterpillar robot locomotion. An excellent advantage of reinforcement learning is that an action can be acquired using only a simple reward. In our previous work, the reward was a forward distance measured using a sensor. This reward was...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series C 2013, Vol.79(798), pp.366-370
Hauptverfasser: YAMASHINA, Ryota, KURODA, Masafumi, YABUTA, Tetsuro
Format: Artikel
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This note presents an application of reinforcement learning to caterpillar robot locomotion. An excellent advantage of reinforcement learning is that an action can be acquired using only a simple reward. In our previous work, the reward was a forward distance measured using a sensor. This reward was completely an “Objective Reward.” On the other hand, this study uses the reward given by the human's subjective judgment, which is defined as “Subjective Reward.” The main purpose of this study is to compare its performance between the “Objective Reward” obtained from the sensor and the “Subjective Reward” given by a human teacher. The results show that the “Subjective Reward” can give better results than that of the “Objective Reward”, because the “Subjective Reward” has more information than the “Objective Reward”. On the other hand, this note discusses the good teacher who gives an excellent “Subjective Reward”
ISSN:0387-5024
1884-8354
DOI:10.1299/kikaic.79.366