Deep reinforcement learning applied to the k-server problem

•A novel algorithm for the k-server problem.•The algorithm is less susceptible to the curse of dimensionality problem.•A new perspective on intelligent transport systems. The reinforcement learning paradigm has been shown to be an effective approach in solving the k-server problem. However, this app...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Expert systems with applications 2019-11, Vol.135, p.212-218
Hauptverfasser:	Lins, Ramon Augusto Sousa, Dória, Adrião Duarte Neto, Melo, Jorge Dantas de
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Combinatorial optimization Competitive location Deep reinforcement learning Machine learning Multilayer perceptrons Neural networks Online problem Performance tests Servers The k-server problem
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•A novel algorithm for the k-server problem.•The algorithm is less susceptible to the curse of dimensionality problem.•A new perspective on intelligent transport systems. The reinforcement learning paradigm has been shown to be an effective approach in solving the k-server problem. However, this approach is based on the Q-learning algorithm, being subjected to the curse of dimensionality problem, since the action-value function (Q-function) grows exponentially with the increase in the number of states and actions. In this work, a new algorithm based on the deep reinforcement learning paradigm is proposed. For this, the Q-function is defined by a multilayer perceptron neural network that extracts the information of the environment from images that encode the dynamics of the problem. The applicability of the proposed algorithm is illustrated in a case study in which different nodes and servers problem configurations are considered. The agents behavior is analyzed during the training phase and its efficiency is evaluated from performance tests that quantify the quality of the generated server displacement policies. The results obtained provide a new algorithm promising view as an alternative solution to the k-server problem.
ISSN:	0957-4174 1873-6793
DOI:	10.1016/j.eswa.2019.06.015