Image classification using convolutional neural network with wavelet domain inputs

Commonly used convolutional neural networks (CNNs) usually compress high‐resolution input images. Although it reduces the computation requirements into a reasonable range, the downsampling operation causes information loss, which affects the accuracy of image classification. How to adopt high‐resolu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IET image processing 2022-06, Vol.16 (8), p.2037-2048
Hauptverfasser: Wang, Luyuan, Sun, Yankui
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Commonly used convolutional neural networks (CNNs) usually compress high‐resolution input images. Although it reduces the computation requirements into a reasonable range, the downsampling operation causes information loss, which affects the accuracy of image classification. How to adopt high‐resolution image inputs to improve the quality of input information and thus improve the classification accuracy without changing the overall structure of the pre‐defined CNN model or increasing the model parameters is an important issue. Here, a CNN model with wavelet domain inputs is proposed to provide a solving scheme. Specifically, the proposed method applies wavelet packet transform or dual‐tree complex wavelet transform to extract information from input images with higher resolutions in the image pre‐processing stage. Some subband image channels are selected as the inputs of conventional CNNs where the first several convolutional layers are removed, so that the networks directly learn in the wavelet domain. Experiment results on the Caltech‐256 dataset and the Describable Textures Dataset with the ResNet‐50 show that the classification accuracy of our method can have a maximum improvement of 2.15% and 10.26%, respectively. These validate the effectiveness of our proposed scheme. This code is publicly available at https://github.com/BeBeBerr/wavelet‐cnn.
ISSN:1751-9659
1751-9667
DOI:10.1049/ipr2.12466