Chain Form Reinforcement Learning for Small-Memory Agent
In this paper, we propose Chain Form Reinforcement Learning for a reinforcement learning agent that has small memory. In the real world, learning is difficult because there are an infinite number of states and actions that need a large number of stored memories and learning times. To solve a problem...
Gespeichert in:
Veröffentlicht in: | Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 2012/04/15, Vol.24(2), pp.691-696 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng ; jpn |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this paper, we propose Chain Form Reinforcement Learning for a reinforcement learning agent that has small memory. In the real world, learning is difficult because there are an infinite number of states and actions that need a large number of stored memories and learning times. To solve a problem, estimated values are categorized as “GOOD” or “NO GOOD” in the reinforcement learning process. Additionally, the alignment sequence of estimated values is changed as they are regarded as an important sequence themselves. We conducted some simulations and observed the influence of our methods. Several simulation results show no bad influence on learning speed. |
---|---|
ISSN: | 1347-7986 1881-7203 |
DOI: | 10.3156/jsoft.24.691 |