Multi-head attention mechanism fusion calculation distribution method based on acceleration processor

The invention relates to the field of data processing, and discloses a multi-head attention mechanism fusion calculation distribution method based on an acceleration processor, and the method comprises the steps: obtaining slave core information, to-be-processed data in a memory, and a calculation d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GAO WEI, XU NILIN, YAN XIACHAO
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to the field of data processing, and discloses a multi-head attention mechanism fusion calculation distribution method based on an acceleration processor, and the method comprises the steps: obtaining slave core information, to-be-processed data in a memory, and a calculation demand of the to-be-processed data; based on the calculation demand and the slave core information, carrying out fusion association on operators of the slave cores to obtain fusion operators and calculation logic corresponding to each fusion operator; and calling the interfaces corresponding to the fusion operators in sequence to start the slave core, so that the slave core calculates the to-be-processed data by using the corresponding operators according to the calculation logic of each fusion operator in sequence to obtain a calculation result. The mode of fusing the operator combination is beneficial to full use of hardware resources, and compared with the situation that only a single operator is processed at the