Reinforcement Learning of Optimal Supervisor Based on Language Measure

Recently, Wang and Ray introduced a signed measure for formal languages, called a language measure, to evaluate performance of strings generated by discrete event systems and a synthesis method of an optimal supervisor based on the language measure has been studied. In order to apply the method, exa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Shisutemu Seigyo Jouhou Gakkai rombunshi Control and Information Engineers, 2005/12/15, Vol.18(12), pp.433-439
Hauptverfasser:	TANIGUCHI, Kazutaka, YAMASAKI, Tatsushi, USHIO, Toshimitsu
Format:	Artikel
Sprache:	eng ; jpn
Schlagworte:	discrete event system language measure reinforcement learning supervisory control
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Recently, Wang and Ray introduced a signed measure for formal languages, called a language measure, to evaluate performance of strings generated by discrete event systems and a synthesis method of an optimal supervisor based on the language measure has been studied. In order to apply the method, exact information about a language measure of a controlled discrete event system is needed. From the practical point of view, it is not always possible to know the information a priori. In such a situation, a learning-based approach is useful to obtain an optimal supervisor with respect to the language measure. This paper considers a synthesis method of an optimal supervisor with respect to a language measure. First, we clarify the relationship between the Bellman equation in reinforcement learning and performance of the language generated by the controlled discrete event systems. Next, using the relationship, we propose a learning method of the optimal supervisor where costs of disabling events are taken into consideration. Finally, by computer simulation, we illustrate an efficiency of the proposed method.
ISSN:	1342-5668 2185-811X
DOI:	10.5687/iscie.18.433