Robust feature space separation for deep convolutional neural network training

This paper introduces two deep convolutional neural network training techniques that lead to more robust feature subspace separation in comparison to traditional training. Assume that dataset has M labels. The first method creates M deep convolutional neural networks called { DCNN i } i = 1 M . Each...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Discover Artificial Intelligence 2021-12, Vol.1 (1), p.1-11, Article 12
Hauptverfasser:	Sekmen, Ali, Parlaktuna, Mustafa, Abdul-Malek, Ayad, Erdemir, Erdem, Koku, Ahmet Bugra
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial Intelligence Classification Computational linguistics Computer Science Datasets Deep Convolutional Neural Networks Deep learning Diabetic retinopathy Engineering Language processing Natural language interfaces Network topologies Neural networks Robust Deep Learning Subspace Separation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper introduces two deep convolutional neural network training techniques that lead to more robust feature subspace separation in comparison to traditional training. Assume that dataset has M labels. The first method creates M deep convolutional neural networks called { DCNN i } i = 1 M . Each of the networks DCNN i is composed of a convolutional neural network ( CNN i ) and a fully connected neural network ( FCNN i ). In training, a set of projection matrices { P i } i = 1 M are created and adaptively updated as representations for feature subspaces { S i } i = 1 M . A rejection value is computed for each training based on its projections on feature subspaces. Each FCNN i acts as a binary classifier with a cost function whose main parameter is rejection values. A threshold value t i is determined for i th network DCNN i . A testing strategy utilizing { t i } i = 1 M is also introduced. The second method creates a single DCNN and it computes a cost function whose parameters depend on subspace separations using the geodesic distance on the Grasmannian manifold of subspaces S i and the sum of all remaining subspaces { S j } j = 1 , j ≠ i M . The proposed methods are tested using multiple network topologies. It is shown that while the first method works better for smaller networks, the second method performs better for complex architectures.
ISSN:	2731-0809 2731-0809
DOI:	10.1007/s44163-021-00013-1