MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition

Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or ni...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science 2024-08, Vol.5 (6), p.656, Article 656
Hauptverfasser: Quach, Minh-Duc, Vo, Duc-Minh, Pham, Hoang-Anh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.
ISSN:2661-8907
2662-995X
2661-8907
DOI:10.1007/s42979-024-03011-z