Chain Form Reinforcement Learning for Small-Memory Agent

In this paper, we propose Chain Form Reinforcement Learning for a reinforcement learning agent that has small memory. In the real world, learning is difficult because there are an infinite number of states and actions that need a large number of stored memories and learning times. To solve a problem...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 2012/04/15, Vol.24(2), pp.691-696
Hauptverfasser: NOTSU, Akira, KOMORI, Yuki, HONDA, Katsuhiro, ICHIHASHI, Hidetomo, IWAMOTO, Yuki
Format: Artikel
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, we propose Chain Form Reinforcement Learning for a reinforcement learning agent that has small memory. In the real world, learning is difficult because there are an infinite number of states and actions that need a large number of stored memories and learning times. To solve a problem, estimated values are categorized as “GOOD” or “NO GOOD” in the reinforcement learning process. Additionally, the alignment sequence of estimated values is changed as they are regarded as an important sequence themselves. We conducted some simulations and observed the influence of our methods. Several simulation results show no bad influence on learning speed.
ISSN:1347-7986
1881-7203
DOI:10.3156/jsoft.24.691