TRAINING OF LARGE NEURAL NETWORKS

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive g...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lepikhin, Dmitry, So, David Richard, Feinberg, Vladimir, Petrov, Slav, Sohn, Jin Young, Clark, Jonathan Hudson, Krikun, Maxim, Moreira, Erica Ann, Firat, Orhan, Xiao, Kefan, Xu, Yuanzhong, Shakeri, Siamak, Cheng, Yong, Nado, Zachary Alexander, Johnson Premkumar, Melvin Jose, Garcia, Xavier, Zhang, Yujing, Wu, Yonghui, Ni, Eric Jun Jie, Roy, Aurko, Mishra, Gaurav, Huang, Yanping, Dai, Andrew M, Du, Nan, Anil, Rohan
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform any one or more of a variety of machine learning tasks. For example, the neural network can be configured as a generative neural network, e.g., an autoregressive generative neural network.