Context propagation embedding network for weakly supervised semantic segmentation

Weakly-supervised semantic segmentation with image-level labels has much significance as it facilitates related practical applications under lightweight manual annotations. Recent researches first infer the visual cues, referred as discriminative regions corresponding to each object category on imag...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications 2020-12, Vol.79 (45-46), p.33925-33942
Hauptverfasser:	Xu, Yajun, Mao, Zhendong, Chen, Zhineng, Wen, Xin, Li, Yangyang
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Communication Networks Computer Science Context Cues Data Structures and Information Theory Embedding Feature extraction Image annotation Image classification Image segmentation Masks Multimedia Information Systems Propagation Semantic segmentation Semantics Special Purpose and Application-Based Systems Visual discrimination
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Weakly-supervised semantic segmentation with image-level labels has much significance as it facilitates related practical applications under lightweight manual annotations. Recent researches first infer the visual cues, referred as discriminative regions corresponding to each object category on images using deep convolutional classification networks. Then they expand visual cues to generate initial segmentation masks. Despite the remarkable progress, the segmentation performance still remains unsatisfactory due to the absence of complete visual cues. Using these low-quality visual cues as prior will have the limitation on improving segmentation performance. To overcome this problem, we propose a novel context propagation embedding network, i.e., the CPENet to generate high-quality visual cues, which focuses on learning semantic relationship between each region and its surrounding neighbors and selectively propagates discriminative information to non-discriminative, object related regions. Our methods can provide reliable initial segmentation masks for training subsequent segmentation network to generate final segmentation results. In addition, we refined convolutional block attention module (CBAM) [ 30 ] to hierarchically extract more category-aware features by capturing global context related information and further promote the propagation process. Experiments on benchmark demonstrate that our proposed method obtains superior performance over the state-of-the-arts.
ISSN:	1380-7501 1573-7721
DOI:	10.1007/s11042-020-08787-9