Accelerator execution method and electronic equipment

A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SANG YONGQI, XIE GANGFENG, LI JIA, GE JIANMING, QIU XIAOGANG, YAO FEI, YANG JINGWEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,