A hybrid deep convolutional neural network for accurate land cover classification

•Spatial attention mechanism improves delineation of various-sized objects in CNNs.•Cascading residual dilated convolution reinforces feature multi-scale inference.•Deep supervision through intermediary loss enhances the network training process.•Different components in the hybrid network can improv...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of applied earth observation and geoinformation 2021-12, Vol.103, p.102515, Article 102515
Hauptverfasser:	Wambugu, Naftaly, Chen, Yiping, Xiao, Zhenlong, Wei, Mingqiang, Aminu Bello, Saifullahi, Marcato Junior, José, Li, Jonathan
Format:	Artikel
Sprache:	eng
Schlagworte:	Deep learning Deep supervision Dilated convolution Remote Sensing Science & Technology Spatial attention Technology very high spatial resolution (VHSR)
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Spatial attention mechanism improves delineation of various-sized objects in CNNs.•Cascading residual dilated convolution reinforces feature multi-scale inference.•Deep supervision through intermediary loss enhances the network training process.•Different components in the hybrid network can improve land cover classification. Land cover classification provides updated information regarding the Earth's resources, which is vital for agricultural investigation, urban management, and disaster monitoring. Current advances in sensor technology on satellite and aerial remote sensing (RS) devices have improved the spatial-spectral, radiometric, and temporal resolutions of images over time. These improvements offer invaluable chances of understanding land cover information. However, land cover classification from RS images is an intricate task because of the high intra-class disparities, low inter-class similarities, and image variation types. We propose a cascaded residual dilated network (CRD-Net) for land cover classification using very high spatial resolution (VHSR) images to address these challenges. The proposed hybrid network follows the encoder-decoder concept with a spatial attention block to guide the network on learnable discriminate features coupled with an intermediary loss to enhance the training process. Moreover, a cascaded residual dilated module increases the network's receptive field to enrich multi-contextual features further, thus boosting the resultant feature descriptor. Extensive experimental results demonstrate that the proposed CRD-Net outperformed state-of-the-art methods, achieving an overall accuracy (OA) of 90.73% and 90.51% on the ISPRS Potsdam land cover dataset and ISPRS Vaihingen dataset, respectively.
ISSN:	1569-8432 1872-826X
DOI:	10.1016/j.jag.2021.102515