Lightweight image classifier using dilated and depthwise separable convolutions

The image classification based on cloud computing suffers from difficult deployment as the network depth and data volume increase. Due to the depth of the model and the convolution process of each layer will produce a great amount of calculation, the GPU and storage performance of the device are ext...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of Cloud Computing 2020-09, Vol.9 (1), p.1-12, Article 55
Hauptverfasser:	Sun, Wei, Zhang, Xiaorui, He, Xiaozheng
Format:	Artikel
Sprache:	eng
Schlagworte:	Classification accuracy Cloud computing Computer Communication Networks Computer Science Computer System Implementation Computer Systems Organization and Communication Networks Depthwise separable convolution Dilated convolution Information Systems Applications (incl.Internet) Lightweight neural network Neural networks Security and privacy issues for artificial intelligence in edge-cloud computing Software Engineering/Programming and Operating Systems Special Purpose and Application-Based Systems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The image classification based on cloud computing suffers from difficult deployment as the network depth and data volume increase. Due to the depth of the model and the convolution process of each layer will produce a great amount of calculation, the GPU and storage performance of the device are extremely demanding, and the GPU and storage devices equipped on the embedded and mobile terminals cannot support large models. So it is necessary to compress the model so that the model can be deployed on these devices. Meanwhile, traditional compression based methods often miss many global features during the compression process, resulting in low classification accuracy. To solve the problem, this paper proposes a lightweight neural network model based on dilated convolution and depthwise separable convolution with twenty-nine layers for image classification. The proposed model employs the dilated convolution to expand the receptive field during the convolution process while maintaining the number of convolution parameters, which can extract more high-level global semantic features to improve the classification accuracy. Also, the depthwise separable convolution is applied to reduce the network parameters and computational complexity in convolution operations, which reduces the size of the network. The proposed model introduces three hyperparameters: width multiplier, image resolution, and dilated rate, to compress the network on the premise of ensuring accuracy. The experimental results show that compared with GoogleNet, the network proposed in this paper improves the classification accuracy by nearly 1%, and the number of parameters is reduced by 3.7 million.
ISSN:	2192-113X 2192-113X
DOI:	10.1186/s13677-020-00203-9