Gradient descent methods in learning classifier systems: improving XCS performance in multistep problems

The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal differenc...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on evolutionary computation 2005-10, Vol.9 (5), p.452-473
Hauptverfasser:	Butz, M.V., Goldberg, D.E., Lanzi, P.L.
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Computer science control theory systems Data mining Data processing. List processing. Character string processing Evolutionary computation Exact sciences and technology Function approximation Genetic algorithms Genetic engineering Government gradient descent Laboratories Learning and adaptive systems learning classifier systems (LCSs) Learning systems Memory organisation. Data processing multistop problems Q-learning reinforcement learning Robustness Software XCS Zero current switching
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent, solving large and difficult maze problems reliably. Additionally, the extension to gradient methods highlights the relation of XCS to other function approximation methods in reinforcement learning.
ISSN:	1089-778X 1941-0026
DOI:	10.1109/TEVC.2005.850265