Adaptive convolution kernel for artificial neural networks

Many deep neural networks are built by using stacked convolutional layers of fixed and single size (often 3 × 3) kernels. This paper describes a method for learning the size of convolutional kernels to provide varying size kernels in a single layer. The method utilizes a differentiable, and therefor...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of visual communication and image representation 2021-02, Vol.75, p.103015, Article 103015
Hauptverfasser:	Tek, F. Boray, Çam, İlker, Karlı, Deniz
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive convolution Image classification Multi-scale convolution Residual networks
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Many deep neural networks are built by using stacked convolutional layers of fixed and single size (often 3 × 3) kernels. This paper describes a method for learning the size of convolutional kernels to provide varying size kernels in a single layer. The method utilizes a differentiable, and therefore backpropagation-trainable Gaussian envelope which can grow or shrink in a base grid. Our experiments compared the proposed adaptive layers to ordinary convolution layers in a simple two-layer network, a deeper residual network, and a U-Net architecture. The results in the popular image classification datasets such as MNIST, MNIST-CLUTTERED, CIFAR-10, Fashion, and “Faces in the Wild” showed that the adaptive kernels can provide statistically significant improvements on ordinary convolution kernels. A segmentation experiment in the Oxford-Pets dataset demonstrated that replacing ordinary convolution layers in a U-shaped network with 7 × 7 adaptive layers can improve its learning performance and ability to generalize. •The kernel size hyperparameter of convolutional layers is made adaptive (learned).•Allows kernels of varying size in a single convolutional layer.•Improves the performance of the RESNET model in image classification.•Improves the performance of the UNET model in image segmentation.
ISSN:	1047-3203 1095-9076
DOI:	10.1016/j.jvcir.2020.103015