Generalized Global Bandit and Its Application in Cellular Coverage Optimization

Motivated by the engineering problem of cellular coverage optimization, we propose a novel multiarmed bandit model called generalized global bandit. We develop a series of greedy algorithms that have the capability to handle nonmonotonic but decomposable reward functions, multidimensional global par...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal of selected topics in signal processing 2018-02, Vol.12 (1), p.218-232
Hauptverfasser:	Shen, Cong, Zhou, Ruida, Tekin, Cem, van der Schaar, Mihaela
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithm design and analysis Algorithms Computer simulation Correlation Cost analysis coverage optimization Greedy algorithms Mathematical models Multi-armed bandit Multi-armed bandit problems Numerical simulation online learning Optimization Radio frequency regret analysis Signal processing algorithms Switches
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Motivated by the engineering problem of cellular coverage optimization, we propose a novel multiarmed bandit model called generalized global bandit. We develop a series of greedy algorithms that have the capability to handle nonmonotonic but decomposable reward functions, multidimensional global parameters, and switching costs. The proposed algorithms are rigorously analyzed under the multiarmed bandit framework, where we show that they achieve bounded regret, and hence, they are guaranteed to converge to the optimal arm in finite time. The algorithms are then applied to the cellular coverage optimization problem to achieve the optimal tradeoff between sufficient small cell coverage and limited macroleakage without prior knowledge of the deployment environment. The performance advantage of the new algorithms over existing bandits solutions is revealed analytically and further confirmed via numerical simulations. The key element behind the performance improvement is a more efficient "trial and error" mechanism, in which any trial will help improve the knowledge of all candidate power levels.
ISSN:	1932-4553 1941-0484
DOI:	10.1109/JSTSP.2018.2798164