ACTION INFORMATION LEARNING DEVICE, ACTION INFORMATION OPTIMIZATION SYSTEM AND COMPUTER READABLE MEDIUM

To perform reinforcement learning that enables selecting action information for shortening a cycle time while also avoiding the occurrence of overheating. An action information learning device (300) includes: a state information acquisition means (310) for acquiring state information including an op...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: NISHIMURA, Takuma, INAGUCHI, Yuuzou, TONG, Zheng
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To perform reinforcement learning that enables selecting action information for shortening a cycle time while also avoiding the occurrence of overheating. An action information learning device (300) includes: a state information acquisition means (310) for acquiring state information including an operation pattern of a spindle and a combination of parameters related to machining of a machine tool (100); an action information output means (320) for outputting action information including adjustment information for the operation pattern and the combination of parameters included in the state information; a reward calculation means (333) for acquiring judgment information which is information for temperature of the machine tool (100) and a machining time related to the machining of the machine tool (100), and calculating a value of a reward for reinforcement learning based on the judgment information thus acquired; and a value function update means (332) for updating a value function by performing the reinforcement learning based on the value of the reward, the state information and the action information.