FGMNet: Feature grouping mechanism network for RGB-D indoor scene semantic segmentation

Semantic segmentation is a basic and long-standing research area. Depth images can enrich RGB (red-green-blue) images with their rich geometric information, so as to achieve accurate semantic segmentation. However, redundant information exists in RGB and depth images, and its handling has become an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Digital signal processing 2024-06, Vol.149, p.104480, Article 104480
Hauptverfasser: Zhang, Yuming, Zhou, Wujie, Ye, Lv, Yu, Lu, Luo, Ting
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Semantic segmentation is a basic and long-standing research area. Depth images can enrich RGB (red-green-blue) images with their rich geometric information, so as to achieve accurate semantic segmentation. However, redundant information exists in RGB and depth images, and its handling has become an important problem. Filter group convolutions are widely used because they can eliminate redundant information and reduce computational complexity and parameter cost. Similarly, we propose a feature grouping mechanism network (FGMNet) using an attention mechanism and contextual information extraction for indoor scene semantic segmentation. First, modules of pyramid feature grouping attention and feature augmentation highlight the most useful information obtained by combining RGB and depth features. The enhanced features are then fed into a feature grouping contextual module. Results from extensive experiments on well-known indoor scene semantic segmentation datasets, NYUDv2 and SUN RGB-D, indicate that our FGMNet outperforms the most advanced existing methods in RGB-D semantic segmentation.
ISSN:1051-2004
1095-4333
DOI:10.1016/j.dsp.2024.104480