METHOD AND APPARATUS FOR COMPRESSING A NEURAL NETWORK

The invention relates to a method for compressing a neural network (10), wherein a trained neural network (10) is obtained, wherein at least one piece of structure information from the trained neural network (10) is extracted or obtained, wherein the trained neural network (10) is divided into subse...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SCHLICHT, Peter, MA, Yuan, HÜGER, Fabian, VARGHESE, Serin
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a method for compressing a neural network (10), wherein a trained neural network (10) is obtained, wherein at least one piece of structure information from the trained neural network (10) is extracted or obtained, wherein the trained neural network (10) is divided into subsets (I, II, III, IV, V) on the basis of the at least one piece of structure information for the purpose of compression, wherein one compression method is selected and used for each subset (I, II, III, IV, V) on the basis of at least one property of the respective subset (I, II, III, IV, V), and wherein the compressed neural network (11) is provided. The invention also relates to an apparatus (1) for compressing a neural network (10), to a computer program and to a data carrier signal.