Adaptive dynamic programming approach to experience-based systems identification and control

Humans have the ability to make use of experience while selecting their control actions for distinct and changing situations, and their process speeds up and have enhanced effectiveness as more experience is gained. In contrast, current technological implementations slow down as more knowledge is st...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural networks 2009-07, Vol.22 (5), p.822-832
1. Verfasser:	Lendaris, George G.
Format:	Artikel
Sprache:	eng
Schlagworte:	Aircraft Algorithms Applied sciences Approximate Dynamic Programming (ADP) Artificial Intelligence Automobile Driving Computer science control theory systems Connectionism. Neural networks Context discernment Control system analysis Control theory. Systems Detection, estimation, filtering, equalization, prediction Exact sciences and technology Experience-based identification and control Humans Information, signal and communications theory Learning Learning and adaptive systems Neural networks Neural Networks (Computer) Optimal control Reinforcement (Psychology) Robotics Signal and communications theory Signal, noise Software Systems identification Telecommunications and information theory
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Humans have the ability to make use of experience while selecting their control actions for distinct and changing situations, and their process speeds up and have enhanced effectiveness as more experience is gained. In contrast, current technological implementations slow down as more knowledge is stored. A novel way of employing Approximate (or Adaptive) Dynamic Programming (ADP) is described that shifts the underlying Adaptive Critic type of Reinforcement Learning method “up a level”, away from designing individual (optimal) controllers to that of developing on-line algorithms that efficiently and effectively select designs from a repository of existing controller solutions (perhaps previously developed via application of ADP methods). The resulting approach is called Higher-Level Learning Algorithm. The approach and its rationale are described and some examples of its application are given. The notions of context and context discernment are important to understanding the human abilities noted above. These are first defined, in a manner appropriate to controls and system-identification, and as a foundation relating to the application arena, a historical view of the various phases during development of the controls field is given, organized by how the notion ‘context’ was, or was not, involved in each phase.
ISSN:	0893-6080 1879-2782
DOI:	10.1016/j.neunet.2009.06.021