Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation

We study large population leader-follower stochastic multi-agent systems where the agents have linear stochastic dynamics and are coupled via their quadratic cost functions. The cost of each leader is based on a trade-off between moving toward a certain reference trajectory which is unknown to the f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on automatic control 2012-11, Vol.57 (11), p.2801-2816
Hauptverfasser:	Nourian, M., Caines, P. E., Malhame, R. P., Minyi Huang
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptation models Adaptive control Agents (artificial intelligence) Applied sciences Centroids Computer science control theory systems Control system synthesis Control systems Control theory. Systems Economic models Exact sciences and technology Followers Game theory Games Lead leader-follower collective behavior Leadership likelihood ratio based adaptation Mathematical analysis Mathematical model Mathematical models mean field (MF) stochastic control Modelling and identification Nash equilibria Noise measurement Operational research and scientific management Operational research. Management science stochastic optimal control Stochastic processes Stochasticity Studies System theory Trajectories Trajectory
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We study large population leader-follower stochastic multi-agent systems where the agents have linear stochastic dynamics and are coupled via their quadratic cost functions. The cost of each leader is based on a trade-off between moving toward a certain reference trajectory which is unknown to the followers and staying near their own centroid. On the other hand, followers react by tracking a convex combination of their own centroid and the centroid of the leaders. We approach this large population dynamic game problem by use of so-called Mean Field (MF) linear-quadratic-Gaussian (LQG) stochastic control theory. In this model, followers are adaptive in the sense that they use a likelihood ratio estimator (on a sample population of the leaders' trajectories) to identify the member of a given finite class of models which is generating the reference trajectory of the leaders. Under appropriate conditions, it is shown that the true reference trajectory model is identified by each follower in finite time with probability one as the leaders' population goes to infinity. Furthermore, we show that the resulting sets of mean field control laws for both leaders and adaptive followers possess an almost sure ε N -Nash equilibrium property for a system with population N where ε N goes to zero as N goes to infinity. Numerical experiments are presented illustrating the results.
ISSN:	0018-9286 1558-2523
DOI:	10.1109/TAC.2012.2195797