A Proposal of Predictive Reinforcement Learning Realizing Moving Obstacle Avoidance

In recent years, researches on autonomous robots in real life have developed. Especially, moving obstacle avoidance is one of the most important tasks for robots. Reinforcement learning is a typical method of action acquisitions of autonomous mobile robots for obstacle avoidance. However, it has bee...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu Information and Systems, 2009/06/01, Vol.129(6), pp.1115-1122
Hauptverfasser:	Takeda, Masato, Nagao, Tomoharu
Format:	Artikel
Sprache:	eng
Schlagworte:	moving obstacle avoidance prediction reinforcement learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In recent years, researches on autonomous robots in real life have developed. Especially, moving obstacle avoidance is one of the most important tasks for robots. Reinforcement learning is a typical method of action acquisitions of autonomous mobile robots for obstacle avoidance. However, it has been indicated that reinforcement learning has various problems in unknown environment. In order to solve these problems, we propose predictive reinforcement learning for moving obstacle avoidance. In predictive reinforcement learning, our rules are not defined as a pair of actions and states like conventional reinforcement learning. The rules are defined as the transition of the states by robot action between steps. We think that proposed rules enable robots to adapt to unknown environment because these rules are independent from any environment where moving obstacles exist. The robots implemented these rules predict the next state. After this prediction, the robots reinforce its rules by comparing observed states with predicted ones and foresee collisions on obstacles. Then the robots select safer actions. In this paper, we verify the efficiency of our method in several simulations. First, the robot is trained in learning environment where moving obstacles exist. After that, we experiment to verify the ability of adaptation to unknown environments. As a result, the robot acquires moving obstacle avoidance actions.
ISSN:	0385-4221 1348-8155
DOI:	10.1541/ieejeiss.129.1115