Multi-head attention mechanism fusion calculation distribution method based on acceleration processor
The invention relates to the field of data processing, and discloses a multi-head attention mechanism fusion calculation distribution method based on an acceleration processor, and the method comprises the steps: obtaining slave core information, to-be-processed data in a memory, and a calculation d...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to the field of data processing, and discloses a multi-head attention mechanism fusion calculation distribution method based on an acceleration processor, and the method comprises the steps: obtaining slave core information, to-be-processed data in a memory, and a calculation demand of the to-be-processed data; based on the calculation demand and the slave core information, carrying out fusion association on operators of the slave cores to obtain fusion operators and calculation logic corresponding to each fusion operator; and calling the interfaces corresponding to the fusion operators in sequence to start the slave core, so that the slave core calculates the to-be-processed data by using the corresponding operators according to the calculation logic of each fusion operator in sequence to obtain a calculation result. The mode of fusing the operator combination is beneficial to full use of hardware resources, and compared with the situation that only a single operator is processed at the |
---|