STICKER-IM: A 65 nm Computing-in-Memory NN Processor Using Block-Wise Sparsity Optimization and Inter/Intra-Macro Data Reuse

Computing-in-memory (CIM) is a promising architecture for energy-efficient neural network (NN) processors. Several CIM macros have demonstrated high energy efficiency, while CIM-based system-on-a-chip is not well explored. This work presents a CIM NN processor, named STICKER-IM, which is implemented...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE journal of solid-state circuits 2022-08, Vol.57 (8), p.2560-2573
Hauptverfasser: Yue, Jinshan, Liu, Yongpan, Yuan, Zhe, Feng, Xiaoyu, He, Yifan, Sun, Wenyu, Zhang, Zhixiao, Si, Xin, Liu, Ruhui, Wang, Zi, Chang, Meng-Fan, Dou, Chunmeng, Li, Xueqing, Liu, Ming, Yang, Huazhong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Computing-in-memory (CIM) is a promising architecture for energy-efficient neural network (NN) processors. Several CIM macros have demonstrated high energy efficiency, while CIM-based system-on-a-chip is not well explored. This work presents a CIM NN processor, named STICKER-IM, which is implemented with sophisticated system integration. Three key innovations are proposed. First, a CIM-friendly block-wise sparsity (BWS) architecture is designed, enabling both activation-sparsity-aware acceleration and weight-sparsity-aware power-saving. Second, an adaptive kernel-/channel-order (KCO) mapping and intra-/inter-macro scheduling strategy is proposed to improve macro utilization and data reuse. Third, an efficient BWS-optimized CIM (BWS-CIM) macro with adaptive power-OFF ADCs is implemented. The STICKER-IM chip was fabricated in 65-nm CMOS technology. Experimental results show 5.8-158-TOPS/W average system energy efficiency on the sparse NN models. The macro/system-level energy efficiency is 4.23\times / 3.06\times higher compared with the state-of-the-art CIM macros and processors.
ISSN:0018-9200
1558-173X
DOI:10.1109/JSSC.2022.3148273