METHODS FOR TRAINING AN ARTIFICIAL INTELLIGENT AGENT WITH CURRICULUM AND SKILLS

A method for training an agent uses a mixture of scenarios designed to teach specific skills helpful in a larger domain, such as mixing general racing and very specific tactical racing scenarios. Aspects of the methods can include one or more of the following: (1) training the agent to be very good...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	MACALPINE, Patrick, THOMURE, Michael D, BARRETT, Samuel, WURMAN, Peter, WALSH, Thomas J, KOMPELLA, Varun
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A method for training an agent uses a mixture of scenarios designed to teach specific skills helpful in a larger domain, such as mixing general racing and very specific tactical racing scenarios. Aspects of the methods can include one or more of the following: (1) training the agent to be very good at time trials by having one or more cars spread out on the track; (2) running the agent in various racing scenarios with a variable number of opponents starting in different configurations around the track; (3) varying the opponents by using game-provided agents, agents trained according to aspects of the present invention, or agents controlled to follow specific driving lines; (4) setting up specific short scenarios with opponents in various racing situations with specific success criteria; and (5) having a dynamic curriculum based on how the agent performs on a variety of evaluation scenarios. Un procédé d'entraînement d'un agent utilise un mélange de scénarios conçus pour enseigner des compétences spécifiques utiles dans un domaine plus large, par exemple en mélangeant des scénarios de courses générales et de courses tactiques très spécifiques. Des aspects des procédés peuvent consister : (1) à entraîner l'agent à être très bon en contre-la-montre en ayant une ou plusieurs voitures réparties sur la piste ; et/ou (2) à faire fonctionner l'agent dans divers scénarios de course comportant un nombre variable d'adversaires commençant dans différentes configurations autour de la piste ; et/ou (3) à faire varier les adversaires à l'aide d'agents fournis par un jeu, d'agents entraînés selon des aspects de la présente invention, ou d'agents commandés pour suivre des lignes de conduite spécifiques ; et/ou (4) à établir des scénarios courts spécifiques comportant des adversaires dans diverses situations de course à critères de réussite spécifiques ; et/ou (5) à présenter un programme dynamique basé sur la manière dont l'agent se comporte sur toute une série de scénarios d'évaluation.