Learning agents for the multi-mode project scheduling problem

Intelligent optimization refers to the promising technique of integrating learning mechanisms into (meta-) heuristic search. In this paper, we use multi-agent reinforcement learning for building high-quality solutions for the multi-mode resource-constrained project scheduling problem (MRCPSP). We us...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Operational Research Society 2011-02, Vol.62 (2), p.281-290
Hauptverfasser:	Wauters, T, Verbeeck, K, Berghe, G Vanden, De Causmaecker, P
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Automata Business and Management Distance learning Educational activities Graph representations Heuristic Heuristics Learning styles Machine learning Management Nonrenewable resources Operations research Operations Research/Decision Theory Optimization Project management Renewable resources Schedules Scheduling Special Issue Paper Special Issue Papers Studies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Intelligent optimization refers to the promising technique of integrating learning mechanisms into (meta-) heuristic search. In this paper, we use multi-agent reinforcement learning for building high-quality solutions for the multi-mode resource-constrained project scheduling problem (MRCPSP). We use a network of distributed reinforcement learning agents that cooperate to jointly learn a well-performing constructive heuristic. Each agent, being responsible for one activity, uses two simple learning devices, called learning automata, that learn to select a successor activity order and a mode, respectively. By coupling the reward signals for both learning tasks, we can clearly show the advantage of using reinforcement learning in search. We present some comparative results, to show that our method can compete with the best performing algorithms for the MRCPSP, yet using only simple learning schemes without the burden of complex fine-tuning.
ISSN:	0160-5682 1476-9360
DOI:	10.1057/jors.2010.101