Dynamic Spectrum Access Using Stochastic Multi-User Bandits

A stochastic multi-user multi-armed bandit framework is used to develop algorithms for uncoordinated spectrum access. In contrast to prior work, it is assumed that rewards can be non-zero even under collisions, thus allowing for the number of users to be greater than the number of channels. The prop...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE wireless communications letters 2021-05, Vol.10 (5), p.953-956
Hauptverfasser:	Bande, Meghana, Magesh, Akshayaa, Veeravalli, Venugopal V.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Channel estimation Channels Clustering algorithms Cognitive radio Estimation Heuristic algorithms Interference Multi-armed bandit problems multi-armed bandits Resource management Stochastic processes sub-linear regret uncoordinated spectrum access
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A stochastic multi-user multi-armed bandit framework is used to develop algorithms for uncoordinated spectrum access. In contrast to prior work, it is assumed that rewards can be non-zero even under collisions, thus allowing for the number of users to be greater than the number of channels. The proposed algorithm consists of an estimation phase and an allocation phase. It is shown that if every user adopts the algorithm, the system wide regret is order-optimal of order {O} (log {T} ) over a time-horizon of duration {T} . The regret guarantees hold for both the cases where the number of users is greater than or less than the number of channels. The algorithm is extended to the dynamic case where the number of users in the system evolves over time, and is shown to lead to sub-linear regret.
ISSN:	2162-2337 2162-2345
DOI:	10.1109/LWC.2021.3051328