TUNABLE AGENT BEHAVIORS THROUGH CONTINUOUS REWARD WEIGHT-BASED GOAL SPACES
A single policy can be trained to handle the user selection of parameters across a predetermined range for each component of an artificial intelligent agent within a domain. The agent can be trained across a number of weights within the desired range for each component. These weights determine how m...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Patent |
Sprache: | eng ; fre |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Schreiben Sie den ersten Kommentar!