SPBA-Net point cloud object detection with sparse attention and box aligning

Object detection in point clouds is essential for various applications, including autonomous navigation, household robots, and augmented/virtual reality. However, during voxelization and Bird’s Eye View transformation, local point cloud data often remains sparse due to non-target areas and noise poi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2024-11, Vol.14 (1), p.28420-10, Article 28420
Hauptverfasser: Sha, Haojie, Gao, Qingrui, Zeng, Hao, Li, Kai, Li, Wang, Zhang, Xuande, Wang, Xiaohui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Object detection in point clouds is essential for various applications, including autonomous navigation, household robots, and augmented/virtual reality. However, during voxelization and Bird’s Eye View transformation, local point cloud data often remains sparse due to non-target areas and noise points, posing a significant challenge for feature extraction. In this paper, we propose a novel mechanism named Keypoint Guided Sparse Attention (KGSA) to enhance the semantic information of point clouds by calculating Euclidean distances between selected keypoints and others. Additionally, we introduce Instance-wise Box Aligning, a method for expanding predicted boxes and clustering the points within them to achieve precise alignment between predicted bounding boxes and ground-truth targets. Experimental results demonstrate the superiority of our proposed SPBA-Net in 3D object detection on point clouds compared to other state-of-the-art methods.The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-77097-z