Point Transformer for Shape Classification and Retrieval of Urban Roof Point Clouds

The success of deep learning methods led to significant breakthroughs in 3-D point cloud processing tasks with applications in remote sensing. Existing methods utilize convolutions that have some limitations, as they assume a uniform input distribution and cannot learn long-range dependences. Recent...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE geoscience and remote sensing letters 2022, Vol.19, p.1-5
Hauptverfasser:	Shajahan, Dimple A., Varma T., Mukund, Muthuganapathy, Ramanathan
Format:	Artikel
Sprache:	eng
Schlagworte:	Benchmark testing Benchmarks Classification Data models Datasets Deep learning Feature extraction Machine learning Mean average precision (MAP) Methods Performance enhancement Performance evaluation Point Transformer (PT) Remote sensing Retrieval Robustness Routing self-attention Shape shape classification shape retrieval Three dimensional models Three-dimensional displays Transformers
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The success of deep learning methods led to significant breakthroughs in 3-D point cloud processing tasks with applications in remote sensing. Existing methods utilize convolutions that have some limitations, as they assume a uniform input distribution and cannot learn long-range dependences. Recent works have shown that adding attention in conjunction with these methods improves performance. This raises a question: can attention layers completely replace convolutions? This letter proposes a fully attentional model-Point Transformer (PT) for deriving a rich point cloud representation. The model's shape classification and retrieval performance are evaluated on a large-scale urban data set-RoofN3D and a standard benchmark data set ModelNet40. Extensive experiments are conducted to test the model's robustness to unseen point corruptions for analyzing its effectiveness on real data sets. The proposed method outperforms other state-of-the-art models in the RoofN3D data set, gives competitive results in the ModelNet40 benchmark, and shows high robustness to various unseen point corruptions. Furthermore, the model is highly memory and space-efficient when compared to other methods.
ISSN:	1545-598X 1558-0571
DOI:	10.1109/LGRS.2021.3061422