DATA PROCESSING METHOD AND APPARATUS FOR NEURAL NETWORK
Disclosed are a data processing method and apparatus for a neural network, which method and apparatus relate to the field of artificial intelligence. The method comprises: according to the data amount of input data, a first feature of an internal memory in a chip that runs a neural network, and a se...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng ; fre |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Disclosed are a data processing method and apparatus for a neural network, which method and apparatus relate to the field of artificial intelligence. The method comprises: according to the data amount of input data, a first feature of an internal memory in a chip that runs a neural network, and a second feature of multiple layers in the neural network, dynamically segmenting the input data, and configuring different batch sizes for the layers in the neural network. By means of configuring a rational batch size for each layer in a neural network, during a neural network inference procedure, an internal memory can be fully utilized to store inter-layer data of the neural network, thereby improving the utilization rate of the internal memory, and ensuring the computational efficiency of hardware that runs the neural network.
Sont divulgués ici un procédé et un appareil de traitement de données d'un réseau neuronal qui se rapportent au domaine de l'intelligence artificielle. Le procédé consiste : en fonction de l |
---|