METHOD AND APPARATUS FOR COMPRESSING A NEURAL NETWORK
The invention relates to a method for compressing a neural network (10), wherein a trained neural network (10) is obtained, wherein at least one piece of structure information from the trained neural network (10) is extracted or obtained, wherein the trained neural network (10) is divided into subse...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng ; fre ; ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to a method for compressing a neural network (10), wherein a trained neural network (10) is obtained, wherein at least one piece of structure information from the trained neural network (10) is extracted or obtained, wherein the trained neural network (10) is divided into subsets (I, II, III, IV, V) on the basis of the at least one piece of structure information for the purpose of compression, wherein one compression method is selected and used for each subset (I, II, III, IV, V) on the basis of at least one property of the respective subset (I, II, III, IV, V), and wherein the compressed neural network (11) is provided. The invention also relates to an apparatus (1) for compressing a neural network (10), to a computer program and to a data carrier signal. |
---|