Method for training aircraft control agent
An example includes a method (200) for training an agent (115) to control an aircraft (10A). The method includes: selecting (202), by the agent (115), first actions (Aa) for the aircraft to perform within a first environment (112A) respectively during first time intervals (Ta) based on first states...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | An example includes a method (200) for training an agent (115) to control an aircraft (10A). The method includes: selecting (202), by the agent (115), first actions (Aa) for the aircraft to perform within a first environment (112A) respectively during first time intervals (Ta) based on first states (Sa) of the first environment (112A) during the first time intervals (Ta), updating (204) the agent (115) based on first rewards (Ra) that correspond respectively to the first states (Sa), selecting (206), by the agent (115), second actions (AP) for the aircraft (10A) to perform within a second environment (112B) respectively during second time intervals (T) based on second states (SP) of the second environment (112B) during the second time intervals (T), and updating (208) the agent (115) based on second rewards (RP) that correspond respectively to the second states (SP). At least one first rule of the first environment (112A) is different from at least one rule of the second environment (112B). z zo |
---|