At-Scale Sparse Deep Neural Network Inference with Efficient GPU Implementation

High Performance Extreme Computing (2020) This paper presents GPU performance optimization and scaling results for inference models of the Sparse Deep Neural Network Challenge 2020. Demands for network quality have increased rapidly, pushing the size and thus the memory requirements of many neural n...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hidayetoglu, Mert, Pearson, Carl, Mailthody, Vikram Sharma, Ebrahimi, Eiman, Xiong, Jinjun, Nagi, Rakesh, Hwu, Wen-Mei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!