Compute optimization mechanism for deep neural networks

Embodiments provide mechanisms to facilitate compute operations for deep neural networks. One embodiment comprises a graphics processing unit comprising one or more multiprocessors, at least one of the one or more multiprocessors including a register file to store a plurality of different types of o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Satish, Nadathur Rajagopalan, Ashbaugh, Ben J, Galoppo Von Borries, Nicolas C, Akhbari, Farshad, Ray, Joydeep, Srinivasa, Narayan, Maiyuran, Subramaniam, Schluessler, Travis T, Feit, John H, Gottschlich, Justin E, Boles, Jeffery S, Vaidyanathan, Karthik, Surti, Prasoonkumar, Nurvitadhi, Eriko, Burke, Devan, Hurd, Linda L, Appu, Abhishek R, Chen, Feng, Baghsorkhi, Sara S, Lake, Adam T, Lin, Tsung-Han, Fu, Wenyin, Koker, Altug, Kim, Dukhwan, Sinha, Kamal, Vembu, Balaji, Barik, Rajkishore, Mastronarde, Josh B
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Embodiments provide mechanisms to facilitate compute operations for deep neural networks. One embodiment comprises a graphics processing unit comprising one or more multiprocessors, at least one of the one or more multiprocessors including a register file to store a plurality of different types of operands and a plurality of processing cores. The plurality of processing cores includes a first set of processing cores of a first type and a second set of processing cores of a second type. The first set of processing cores are associated with a first memory channel and the second set of processing cores are associated with a second memory channel.