Contextual deconvolution network for semantic segmentation

•A Contextual Deconvolution Network (CDN) is proposed to enhance the representation power of the decoder network.•A channel contextual module is proposed to model the channel interdependencies of features, which imposes a global semantic control.•A spatial contextual module is introduced to model th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2020-05, Vol.101, p.107152, Article 107152
Hauptverfasser: Fu, Jun, Liu, Jing, Li, Yong, Bao, Yongjun, Yan, Weipeng, Fang, Zhiwei, Lu, Hanqing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A Contextual Deconvolution Network (CDN) is proposed to enhance the representation power of the decoder network.•A channel contextual module is proposed to model the channel interdependencies of features, which imposes a global semantic control.•A spatial contextual module is introduced to model the spatial interdependencies, making the features more expressive on some local regions.•The proposed method achieves competitive performance on PASCAL VOC 2012, ADE20K, PASCAL-Context and Cityscapes dataset, and new state-of-the-art performance on PASCAL-Context dataset. In this paper, we propose a Contextual Deconvolution Network (CDN) and focus on context association in decoder network. Specifically, in upsampling path, we introduce two types of contextual modules to model the interdependencies of features in channel and spatial dimensions respectively. The channel contextual module captures image-level semantic information by aggregating the feature maps across spatial dimensions, and clarifies global ambiguity of features. Meanwhile, the spatial contextual module obtains patch-level semantic context by learning a spatial weight map, and enhance the feature discrimination. We embed the two contextual modules into individual components of the decoder network, thus improving the representation power and gaining more precise segment results. Thorough evaluations are performed on four challenging datasets, i.e., PASCAL VOC 2012, ADE20K, PASCAL-Context and Cityscapes dataset. Our approach achieves competitive performance with state-of-the-art models on PASCAL VOC 2012,ADE20K and Cityscapes dataset, and new state-of-the-art performance on PASCAL-Context dataset.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2019.107152