RECURRENT NEURAL NETWORKS

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, updat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: RACANIERE SEBASTIEN HENRI ANDRE, WIERSTRA DANIEL PIETER, MOHAMED SHAKIR, CHIAPPA SILVIA
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, update a preceding initial hidden state of the recurrent neural network from the preceding time step using the preceding action, update a preceding cell state of the recurrent neural network from the preceding time step using at least the initial hidden state for the time step, and determine a final hidden state for the time step using the cell state for the time step. The system further comprises a decoder neural network configured to receive the final hidden state for the time step and process thefinal hidden state to generate a predicted observation characterizing a predicted state of the environment at the time step. 用于环境模拟的方法、系统和装置,包括在计算机存储介质上编码的计算机程序。在一个方面,一种系统包括递归神经网络,该递归神经网络被配置为在多个时间步骤中的每个处:接收先前的时间步骤的先前的动作,使用先前的