Complexity of Spectrum Activity and Benefits of Reinforcement Learning for Dynamic Channel Selection

We explore the question of when learning improves the performance of opportunistic dynamic channel selection by characterizing the primary user (PU) activity using the concept of Lempel-Ziv complexity. We evaluate the effectiveness of a reinforcement learning algorithm by testing it with real spectr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal on selected areas in communications 2013-11, Vol.31 (11), p.2237-2248
Hauptverfasser:	Macaluso, Irene, Finn, Danny, Ozgul, Baris, DaSilva, Luiz A.
Format:	Artikel
Sprache:	eng
Schlagworte:	Complexity theory Computational modeling Dynamic spectrum access Entropy Learning (artificial intelligence) Lempel-Ziv complexity Machine learning algorithms Markov processes Mathematical model reinforcement learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We explore the question of when learning improves the performance of opportunistic dynamic channel selection by characterizing the primary user (PU) activity using the concept of Lempel-Ziv complexity. We evaluate the effectiveness of a reinforcement learning algorithm by testing it with real spectrum occupancy data collected in the GSM, ISM, and DECT bands. Our results show that learning performance is highly correlated with the level of PU activity and the amount of structure in the use of spectrum. For low levels of PU activity and/or high complexity in its utilization of channels, reinforcement learning performs no better than simple random channel selection. We suggest that Lempel-Ziv complexity might be one of the features considered by a cognitive radio when deciding which channels to opportunistically explore.
ISSN:	0733-8716 1558-0008
DOI:	10.1109/JSAC.2013.131115