Ring-Masked Attention Network for Rotation-Invariant Template-Matching

To solve the rotational changes in matching localization of an underwater terrain image, this letter proposes the ring-masked attention network (RMANet), a model-driven deep network for rotational template-matching tasks. Since traditional convolutional neural networks cannot effectively encode rota...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE signal processing letters 2023-01, Vol.30, p.1-5
Hauptverfasser:	Zhang, Feng, Bian, HongYu, Lv, Zheng, Zhai, YuFeng
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Computational modeling Convolutional neural networks Feature extraction Invariants Location awareness ring-projection transform Rotation rotational equivariance rotational invariance Template matching Training Transforms Underwater vehicles
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	To solve the rotational changes in matching localization of an underwater terrain image, this letter proposes the ring-masked attention network (RMANet), a model-driven deep network for rotational template-matching tasks. Since traditional convolutional neural networks cannot effectively encode rotational changes, we introduce a rotation-equivariant network to extract the rotation-equivariant features. This network determines the rotation of an image at the pixel level. Based on the rotation-equivariant features, we propose the ring-masked attention module (RMAM), which combines the idea of the ring projection transform with an attention mechanism to extract the rotation-invariant features that are independent of the orientation. The overall model combines the rotation-equivariant network with RMAM into an end-to-end network that can exploit both the feature-representation capability of the learning-based model and domain knowledge. Our experimental results show that, compared with popular approaches targeting rotational matching tasks, RMANet achieves performance gains in terms of both matching accuracy and running speed.
ISSN:	1070-9908 1558-2361
DOI:	10.1109/LSP.2023.3252406