EfficientNet-EA for Visual Location Recognition in Natural Scenes

In natural scenarios, the visual location recognition often experiences reduced accuracy because of variations in weather, lighting, camera angles, and occlusions caused by dynamic objects. This paper introduces an EfficientNet-EA-based algorithm specifically designed to tackle these challenges. The...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE robotics and automation letters 2025-01, Vol.10 (1), p.596-603
Hauptverfasser: Zhang, Heng, Chen, Yanchao, Liu, Yanli
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In natural scenarios, the visual location recognition often experiences reduced accuracy because of variations in weather, lighting, camera angles, and occlusions caused by dynamic objects. This paper introduces an EfficientNet-EA-based algorithm specifically designed to tackle these challenges. The algorithm enhances its capabilities by appending the Efficient Feature Aggregation (EA) layer to the end of EfficientNet and by using MultiSimilarityLoss for training purposes. This design enhances the model's ability to extract features, thereby boosting efficiency and accuracy. During the training phase, the model adeptly identifies and utilizes hard-negative and challenging positive samples, which in turn enhances its training efficacy and generalizability across diverse situations. The experimental results indicate that EfficientNet-EA achieves a recall@10 of 98.6% on Pitts30k-test. The model demonstrates a certain degree of improvement in recognition rates under weather variations, changes in illumination, shifts in perspective, and the presence of dynamic object occlusions.
ISSN:2377-3766
2377-3766
DOI:10.1109/LRA.2024.3511379