Learning action selection in autonomous agents

This paper focuses on learning in autonomous agents under dynamic environments. Autonomous agent control has been dominated by two major Artificial Intelligence (AI) approaches, Planning-based and Behavior-based. We concentrate on Behavior-based control. Multiple behavior modules from which a Behavi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236) Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236), 2001, Vol.5, p.3391-3396 vol.5
Hauptverfasser: Ramachandran, S., Bree, D.S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper focuses on learning in autonomous agents under dynamic environments. Autonomous agent control has been dominated by two major Artificial Intelligence (AI) approaches, Planning-based and Behavior-based. We concentrate on Behavior-based control. Multiple behavior modules from which a Behavior-based agent is constituted may have conflicting outputs (actions) for various input situations. It is becoming increasingly difficult to 'hard-wire' or 'pre-fix' the actions. "Learning" to select an appropriate action emerges as a strong alternative to hard-wired schemes. A Behavior-based agent is constructed for the research study. The application task chosen for the agent is to learn to navigate in a real indoor environment avoiding static and moving obstacles. The strategy is as follows: learning to avoid obstacles is to be achieved by learning to select an appropriate action in any input situation. The learning algorithm, based on reinforcement learning principles, chooses with high probability, an appropriate action based on the performance statistics (activations and reinforcements) of the conflicting behaviors. Learning experiments conducted to observe the performance of the agent were encouraging.
ISSN:1062-922X
2577-1655
DOI:10.1109/ICSMC.2001.972043