Resource allocation and congestion control in clustered M2M communication using Q‐learning

In this paper, we apply a Q‐learning algorithm to carry out slot assignment for machine type communication devices (MTCDs) in machine‐to‐machine communication. We first make use of a K‐means clustering algorithm to overcome the congestion problem in an machine‐to‐machine network where each MTCD is a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Transactions on emerging telecommunications technologies 2017-04, Vol.28 (4), p.n/a
Hauptverfasser: Hussain, Fatima, Anpalagan, Alagan, Khwaja, Ahmed Shaharyar, Naeem, Muhammad
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, we apply a Q‐learning algorithm to carry out slot assignment for machine type communication devices (MTCDs) in machine‐to‐machine communication. We first make use of a K‐means clustering algorithm to overcome the congestion problem in an machine‐to‐machine network where each MTCD is associated with one controller. Subsequently, we formulate the slot selection problem as an optimisation problem. Then, we present a solution using the Q‐learning algorithm to select conflict‐free slot assignment in a random access network with MTCD controllers. The performance of the solution is dependent on parameters such as learning rate and reward. We thoroughly analyse the performance of the proposed algorithm considering different parameters related to its operation. The convergence time, that is, the time required to reach a solution, decreases with increasing value of learning rate, whereas the convergence probability increases. In addition, for smaller values of learning rate, the convergence time decreases with increasing reward values. We also compare with simple ALOHA and channel‐based scheduled allocation and show that the proposed Q‐learning‐based technique has a higher probability of assigning slots compared with these techniques. Copyright © 2016 John Wiley & Sons, Ltd. A self‐organized resource allocation scheme for time slots and frequency channels is proposed for a clustered M2M network based on independent learning. The advantage is the performance enhancement of MTCDs by selecting the conflict‐free slots in congested scenarios using Q‐learning algorithm in a distributed manner. Clustering is proposed to overcome the congestion and overload problem in M2M. The convergence capabilities are defined as convergence time, convergence probability and cumulative success rate and they are analysed with respect to different system parameters.
ISSN:2161-3915
2161-3915
DOI:10.1002/ett.3039