Training and/or utilizing machine learning models for use in natural language-based robot control

Techniques are disclosed that enable training of a target condition policy based on multiple data sets, where each data set describes a robotic task in a different manner. For example, the plurality of data sets can include: a target image data set in which a task is captured in a target image; a na...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SEMANET PIERRE, LYNCH CORY
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CHAMBERS PROVIDED WITH MANIPULATION DEVICES HAND TOOLS MANIPULATORS PERFORMING OPERATIONS PORTABLE POWER-DRIVEN TOOLS TRANSPORTING
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Techniques are disclosed that enable training of a target condition policy based on multiple data sets, where each data set describes a robotic task in a different manner. For example, the plurality of data sets can include: a target image data set in which a task is captured in a target image; a natural language instruction dataset in which the task is described in a natural language instruction; and a task ID data set, wherein the task is described by a task ID. In various embodiments, each of the plurality of data sets has a corresponding encoder, where the encoders are trained to generate a shared potential spatial representation of a corresponding task description. Additional or alternative techniques are disclosed that enable the use of a target conditional policy network to control a robot. For example, the robot can be controlled using the target conditional policy network based on free-form natural language input describing robot task (s). 公开了能够基于多个数据集来训练目标条件策略的技术，其中，每个数据集以不同方式描述机器人任务。例如，所述多个数据集能够包括：