Understanding Convolutional Neural Networks With Information Theory: An Initial Exploration

A novel functional estimator for Rényi's \alpha -entropy and its multivariate extension was recently proposed in terms of the normalized eigenspectrum of a Hermitian matrix of the projected data in a reproducing kernel Hilbert space (RKHS). However, the utility and possible applications of the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transaction on neural networks and learning systems 2021-01, Vol.32 (1), p.435-442
Hauptverfasser:	Yu, Shujian, Wickstrom, Kristoffer, Jenssen, Robert, Principe, Jose
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Convolution Convolutional neural networks Convolutional neural networks (CNNs) Data processing data processing inequality (DPI) Entropy Estimation Hilbert space Information flow Information theory IP networks Kernel Linear matrix inequalities multivariate matrix-based Rényi’s <italic xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">α -entropy Neural networks partial information decomposition (PID) Redundancy
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A novel functional estimator for Rényi's \alpha -entropy and its multivariate extension was recently proposed in terms of the normalized eigenspectrum of a Hermitian matrix of the projected data in a reproducing kernel Hilbert space (RKHS). However, the utility and possible applications of these new estimators are rather new and mostly unknown to practitioners. In this brief, we first show that this estimator enables straightforward measurement of information flow in realistic convolutional neural networks (CNNs) without any approximation. Then, we introduce the partial information decomposition (PID) framework and develop three quantities to analyze the synergy and redundancy in convolutional layer representations. Our results validate two fundamental data processing inequalities and reveal more inner properties concerning CNN training.
ISSN:	2162-237X 2162-2388
DOI:	10.1109/TNNLS.2020.2968509