Implementation method for parallel acceleration of ResNet based on general neural network processor

The invention discloses an implementation method for parallel acceleration of ResNet based on a general neural network processor. The method comprises the following steps: loading a data set and a weight into a register from a central cache region, carrying out the operation of a convolutional layer...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	WANG RUNZHE, YANG-GONG YIFAN, CHUANG XIAOMING, ZHENG HANXUN
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention discloses an implementation method for parallel acceleration of ResNet based on a general neural network processor. The method comprises the following steps: loading a data set and a weight into a register from a central cache region, carrying out the operation of a convolutional layer, and loading a weight matrix into a matrix multiplication unit from the register; sending the data set matrix to a matrix multiplication unit in a streaming data mode, writing an operation result back to a register, and repeating the process until all data are processed; using a vector compression unit to complete batch standardization operation; completing the operation of linear rectification by using an SIMD operation unit ; completing the operation of a pooling layer by using the SIMD operation unit and a vector compression unit; completing operation of a full connection layer by using a matrix multiplication unit, and writing a result back to the register; and writing the result in the register back to the ce