Optimizing Throughput Performance in Distributed MIMO Wi-Fi Networks Using Deep Reinforcement Learning

This paper explores the feasibility of leveraging deep reinforcement learning (DRL) to enable dynamic resource management in Wi-Fi networks implementing distributed multi-user MIMO (D-MIMO). D-MIMO is a technique by which a set of wireless access points are synchronized and grouped together to joint...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on cognitive communications and networking 2020-03, Vol.6 (1), p.135-150
Hauptverfasser:	Nurani Krishnan, Neelakantan, Torkildson, Eric, Mandayam, Narayan B., Raychaudhuri, Dipankar, Rantala, Enrico-Henrik, Doppler, Klaus
Format:	Artikel
Sprache:	eng
Schlagworte:	artificial intelligence Channel allocation Dynamic scheduling Machine learning MIMO communication MIMO systems Optimization Production scheduling Reinforcement learning Resource management Science & Technology Technology Telecommunications Throughput Training Wireless access points Wireless communication systems Wireless fidelity Wireless LAN Wireless networks
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper explores the feasibility of leveraging deep reinforcement learning (DRL) to enable dynamic resource management in Wi-Fi networks implementing distributed multi-user MIMO (D-MIMO). D-MIMO is a technique by which a set of wireless access points are synchronized and grouped together to jointly serve multiple users simultaneously. This paper addresses two dynamic resource management problems germane to D-MIMO Wi-Fi networks: (i) channel assignment of D-MIMO groups, and (ii) deciding how to cluster access points to form D-MIMO groups, in order to maximize user throughput performance. These problems are known to be NP-Hard and only heuristic solutions exist in literature. We construct a DRL framework through which a learning agent interacts with a D-MIMO Wi-Fi network, learns about the network environment, and successfully converges to policies which address the aforementioned problems. Through extensive simulations and on-line training based on D-MIMO Wi-Fi networks, this paper demonstrates the efficacy of DRL agents in achieving an improvement of 20% in user throughput performance compared to heuristic solutions, particularly when network conditions are dynamic. This work also showcases the effectiveness of DRL agents in meeting multiple network objectives simultaneously, for instance, maximizing throughput of users as well as fairness of throughput among them.
ISSN:	2332-7731 2332-7731
DOI:	10.1109/TCCN.2019.2942917