An Overview of Efficient Interconnection Networks for Deep Neural Network Accelerators

Deep Neural Networks (DNNs) have shown significant advantages in many domains, such as pattern recognition, prediction, and control optimization. The edge computing demand in the Internet-of-Things (IoTs) era has motivated many kinds of computing platforms to accelerate DNN operations. However, due...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal on emerging and selected topics in circuits and systems 2020-09, Vol.10 (3), p.268-282
Hauptverfasser:	Nabavinejad, Seyed Morteza, Baharloo, Mohammad, Chen, Kun-Chih, Palesi, Maurizio, Kogel, Tim, Ebrahimi, Masoumeh
Format:	Artikel
Sprache:	eng
Schlagworte:	Acceleration Accelerators Artificial intelligence artificial neural network Artificial neural networks Communication overheads Computer architecture Control optimization Deep neural networks Digital storage Economic and social effects Edge computing in/near-memory processing Integrated circuit interconnections Integrated circuit interconnects Interconnection design Interconnection method Interconnection networks (circuit switching) Interconnection technology Internet of thing (IoTs) Investigations network on chip Neural networks On-chip interconnection Optimization Parallel processing Pattern recognition reconfigurable interconnection Storage requirements System-on-chip Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep Neural Networks (DNNs) have shown significant advantages in many domains, such as pattern recognition, prediction, and control optimization. The edge computing demand in the Internet-of-Things (IoTs) era has motivated many kinds of computing platforms to accelerate DNN operations. However, due to the massive parallel processing, the performance of the current large-scale artificial neural network is often limited by the huge communication overheads and storage requirements. As a result, efficient interconnection and data movement mechanisms for future on-chip artificial intelligence (AI) accelerators are worthy of study. Currently, a large body of research aims to find an efficient on-chip interconnection to achieve low-power and high-bandwidth DNN computing. This paper provides a comprehensive investigation of the recent advances in efficient on-chip interconnection and design methodology of the DNN accelerator design. First, we provide an overview of the different interconnection methods on the DNN accelerator. Then, the interconnection methods on the non-ASIC DNN accelerator will be discussed. On the other hand, with the flexible interconnection, the DNN accelerator can support different computing flow, which increases the computing flexibility. With this motivation, reconfigurable DNN computing with flexible on-chip interconnection will be investigated in this paper. Finally, we investigate the emerging interconnection technologies (e.g., in/near-memory processing) for the DNN accelerator design. This paper systematically investigates the interconnection networks in modern DNN accelerator designs. With this article, the readers are able to: 1) understand the interconnection design for DNN accelerators; 2) evaluate DNNs with different on-chip interconnection; 3) familiarize with the trade-offs under different interconnections.
ISSN:	2156-3357 2156-3365 2156-3365
DOI:	10.1109/JETCAS.2020.3022920