MCCG: A ConvNeXt-based Multiple-Classifier Method for Cross-view Geo-localization

The key to crossview geolocalization is to match images of the same target from different viewpoints, e.g., images from drones and satellites. It is a challenging problem due to the changing appearance of objects from variable viewpoints. Most existing methods focus mainly on extracting global featu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2024-03, Vol.34 (3), p.1-1
Hauptverfasser:	Shen, Tianrui, Wei, Yingmei, Kang, Lai, Wan, Shanshan, Yang, Yee-Hong
Format:	Artikel
Sprache:	eng
Schlagworte:	Classifiers ConvNeXt Cross-view Data mining Drones Feature extraction Feature maps Image Retrieval Image segmentation Localization Multiple Feature Representation Representations Satellite imagery Satellites Semantics Task analysis
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The key to crossview geolocalization is to match images of the same target from different viewpoints, e.g., images from drones and satellites. It is a challenging problem due to the changing appearance of objects from variable viewpoints. Most existing methods focus mainly on extracting global features or on segmenting feature maps, causing the loss of information contained in the images. To address the above issues, we propose a new ConvNeXt-based method called MCCG, which stands for Multiple Classifier for Cross-view Geolocalization. The proposed method captures rich discriminative information by cross-dimension interaction and acquires multiple feature representations, realizing a comprehensive feature representation. Additionally, the robustness of the model is improved crediting the multiple feature representations exploiting more contextual information despite position shifting or scale variations. Extensive experiments on the widely used public benchmarks University-1652 and SUES-200 demonstrate that the proposed method achieves state-of-the-art performance in both drone-view target localization and drone navigation applications by over 3% compared to existing methods. Our code and model are available at https://github.com/mode-str/crossview.
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2023.3296074