Pixel representations, sampling, and label correction for semantic part detection

Semantic part detection within an object is of importance in the field of computer vision. This study proposes a novel approach to semantic part detection that starts by employing a convolutional neural network to concatenate a selection of feature maps from the network into a long vector for pixel...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Machine vision and applications 2024, Vol.35 (1), p.10, Article 10
Hauptverfasser:	Huang, Jiao-Chuan, Lin, You-Lin, Fang, Wen-Chieh
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Classifiers Communications Engineering Comparative analysis Computer Science Computer vision Feature maps Image Processing and Computer Vision Labeling Labels Localization Networks Neural networks Object recognition Pattern Recognition Pixels Representations Sampling Semantics Short Paper Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Semantic part detection within an object is of importance in the field of computer vision. This study proposes a novel approach to semantic part detection that starts by employing a convolutional neural network to concatenate a selection of feature maps from the network into a long vector for pixel representation. Using this dedicated pixel representation, we implement a range of techniques, such as Poisson disk sampling for pixel sampling and Poisson matting for pixel label correction. These techniques efficiently facilitate the training of a practical pixel classifier for part detection. Our experimental exploration investigated various factors that affect the model’s performance, including training data labeling (with or without the aid of Poisson matting), hypercolumn representation dimensionality, neural network architecture, post-processing techniques, and pixel classifier selection. In addition, we conducted a comparative analysis of our approach with established object detection methods.
ISSN:	0932-8092 1432-1769
DOI:	10.1007/s00138-023-01493-0