A Light-Weighted Hypergraph Neural Network for Multimodal Remote Sensing Image Retrieval
With the continuous maturity of remote sensing technology, the obtained remote sensing images' quality and quantity have surpassed any previous period. In this context, the content-based remote sensing image retrieval (CBRSIR) task attracts a lot of attention and research interest. Nowadays, th...
Gespeichert in:
Veröffentlicht in: | IEEE journal of selected topics in applied earth observations and remote sensing 2023, Vol.16, p.2690-2702 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | With the continuous maturity of remote sensing technology, the obtained remote sensing images' quality and quantity have surpassed any previous period. In this context, the content-based remote sensing image retrieval (CBRSIR) task attracts a lot of attention and research interest. Nowadays, the previous CBRSIR works mainly face the following problems. First of all, few works can realize one to many cross-modal image retrieval task (such as using optical image to retrieve SAR, optical images at the same time); second, research works mainly focus on small-area, target-level retrieval, and few on semantic-level retrieval of the whole image; last but not the least, most of the existing networks are characterized by massive parameters and huge computing need, which cannot be applied to resource-constrained edge devices with power and storage limit. For the sake of alleviating these bottlenecks, this article introduces a novel light-weighted nonlocal semantic fusion network based on hypergraph structure for CBRSIR (abbreviated as HGNLSF-Net). Specifically, in the framework, using the topological characteristics of hypergraph, the relationship among multiple nodes can be modeled, so as to understand the global features on remote sensing images better with fewer parameters and less computation. In addition, since the nonlocal semantics often involves a lot of noise, the hard-link module is constructed to filter noise. A series of experimental results on typical CBRSIR dataset, i.e., Multi-modal Multi-temporal Remote Sensing Image Retrieval Dataset (MMRSIRD), well show that with fewer parameters, the proposed HGNLSF-Net outperforms other methods and achieves optimal retrieval performance. |
---|---|
ISSN: | 1939-1404 2151-1535 |
DOI: | 10.1109/JSTARS.2023.3252670 |