Method and device for compressing neural network, equipment and medium
The embodiment of the invention provides a neural network compression method and device, equipment and a storage medium. The method includes: training a neural network by using training data to determine a plurality of auxiliary parameters, the plurality of auxiliary parameters corresponding to a pl...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The embodiment of the invention provides a neural network compression method and device, equipment and a storage medium. The method includes: training a neural network by using training data to determine a plurality of auxiliary parameters, the plurality of auxiliary parameters corresponding to a plurality of output channels included in a convolutional layer of the neural network; determining a plurality of clipping parameters corresponding to the plurality of output channels based on the plurality of auxiliary parameters and the number of iterations of training, the clipping parameters indicating whether the corresponding output channels are to be clipped; and clipping at least one output channel of the plurality of output channels based on the plurality of clipping parameters. Based on the mode, the output channel in the neural network can be effectively cut, and then the neural network is compressed.
根据本公开的实施例,提供了一种压缩神经网络的方法、装置、设备和存储介质。该方法包括:通过利用训练数据来训练神经网络,来确定多个辅助参数,多个辅助参数与神经网络的卷积层所包括的多个输出通道相对应;基于多个辅助参数和训 |
---|