Polynomial approximation based spectral dual graph convolution for scene parsing and segmentation

[Display omitted] •Semantic segmentation is divided into two directions including detail location and semantic classification.•Graph structure is more suitable for global modeling and context information capturing.•The polynomial approximates the mapping function to achieve the desired frequency bia...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neurocomputing (Amsterdam) 2021-05, Vol.438, p.133-144
Hauptverfasser:	Sun, Zitang, Wang, Ruojing, Luo, Zhengbo
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science Computer Science, Artificial Intelligence Graph convolution Science & Technology Semantic segmentation Signal processing Technology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	[Display omitted] •Semantic segmentation is divided into two directions including detail location and semantic classification.•Graph structure is more suitable for global modeling and context information capturing.•The polynomial approximates the mapping function to achieve the desired frequency biased graph convolution. Semantic segmentation requires both a large receptive field and accurate spatial information. Although existing methods based on the FCN have greatly improved the accuracy, it still does not show satisfactory results on complex scene parsing and tiny object identification. The convolution operation in FCN suffers from a restricted receptive field, while global modeling is fundamental to dense prediction tasks. In this work, we apply graph convolution into the semantic segmentation task and propose a spectral dual graph convolution module to solve the above problems. Moreover, the semantic segmentation task can be divided into two directions, one of which is to get a large receptive field and consider the global context information; the other is to focus on extracting spatial and contour clues, such as sharply changing curves and tiny objects. From the spectral-domain, it is supposed that low-frequency information is critical to the former task, while high-frequency information is vital to the latter task. Accordingly, high-frequency and low-frequency biased graph convolutions are proposed to process the above information separately. Experiments on Cityscapes, COCO Stuff, PASCAL Context, and PASCAL VOC demonstrate the effectiveness of our methods on semantic segmentation. The proposal achieves comparable performance with advantages in computational and memory overhead.
ISSN:	0925-2312 1872-8286
DOI:	10.1016/j.neucom.2021.01.002