Ray double-layer scheduling method and device based on reinforcement learning and electronic equipment

The invention provides a Ray double-layer scheduling method and device based on reinforcement learning and electronic device.The Ray double-layer scheduling method based on reinforcement learning comprises the steps that a cluster task queue, resource node cluster information and resource node clust...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZHANG YONGJUN, GUAN YANXIA, LI YUAN, LIU XUNYUN, XU XINHAI, LIU YUNTAO
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention provides a Ray double-layer scheduling method and device based on reinforcement learning and electronic device.The Ray double-layer scheduling method based on reinforcement learning comprises the steps that a cluster task queue, resource node cluster information and resource node cluster task queue information are obtained, and a target decision action is determined based on a preset Ray double-layer scheduling model; wherein the preset Ray double-layer scheduling model comprises the step of determining the target decision action after reinforcement learning based on the resource node cluster information and the resource node cluster task queue information; and scheduling a to-be-scheduled task in a cluster task queue to the correspondingly allocated resource node based on the target decision action. By using the method provided by the invention, the purpose of determining the target decision-making action through autonomous learning is realized, so that the determined target decision-making act