APPARATUS, SYSTEM, METHOD AND COMPUTER-IMPLEMENTED STORAGE MEDIA TO IMPLEMENT RADIO RESOURCE MANAGEMENT POLICIES USING MACHINE LEARNING

An apparatus of a transmitter computing node n (TX node n) of a wireless network, one or more computer readable media, a system, and a method. The apparatus includes one or more processors to: implement machine learning (ML) based training rounds, each training round including: determining a local a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Naderializadeh, Navid, Akdeniz, Mustafa Riza, Anand, Arjun, Balakrishnan, Ravikumar, Himayat, Nageen, Dhakal, Sagar, Eisen, Mark R
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An apparatus of a transmitter computing node n (TX node n) of a wireless network, one or more computer readable media, a system, and a method. The apparatus includes one or more processors to: implement machine learning (ML) based training rounds, each training round including: determining a local action value function Qn(hn, an; θn) corresponding to a value of performing a radio resource management (RRM) action an at a receiving computing node n (RX node n) associated with TX node n using policy parameter θn and based on hn, hn including channel state information at RX node n; and determining, based on an overall action value function Qtot at time t, an estimated gradient of an overall loss at time t for overall policy parameter θt(∇Lt(θt)), wherein Qtot corresponds to a mixing of local action value functions Qi(hi, ai; θi) for all TX nodes i in the network at time t including TX node n; and determine, in response to a determination that ∇Lt(θt) is close to zero for various values of t during training, a trained local action value function Qn,trained to generate a trained action value relating to data communication between TX node n and RX node n.