Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification

The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich sa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on geoscience and remote sensing 2022, p.1-1
Hauptverfasser: Ma, Siteng, Hou, Biao, Guo, Xianpeng, Li, Zhihao, Wu, Zitong, Wang, Shuang, Jiao, Licheng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1
container_issue
container_start_page 1
container_title IEEE transactions on geoscience and remote sensing
container_volume
creator Ma, Siteng
Hou, Biao
Guo, Xianpeng
Li, Zhihao
Wu, Zitong
Wang, Shuang
Jiao, Licheng
description The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.
doi_str_mv 10.1109/TGRS.2022.3216831
format Article
fullrecord <record><control><sourceid>ieee_RIE</sourceid><recordid>TN_cdi_ieee_primary_9933708</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9933708</ieee_id><sourcerecordid>9933708</sourcerecordid><originalsourceid>FETCH-ieee_primary_99337083</originalsourceid><addsrcrecordid>eNp9jDkOwjAURF2AxHoARPMvkOAFQlwCYiuQEKGPrPATGSV2ZJsitycFNdVo3jwNIQtGY8aoXD3PjyzmlPNYcJakgg3ImDKZRDyVfEQm3r8pZesN245JefvUQUeFsy2cPl5bA1lwKmDVwV55fEFP7s4GG7oWYee9rkyDJkBpHTywsQEhQ-O1qeDaqKpvBRqEQ616t9SFCv3pjAxLVXuc_3JKlqfj83CJNCLmrdONcl0upRBbmor_6xdy7EYf</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><source>IEEE Electronic Library (IEL)</source><creator>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</creator><creatorcontrib>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</creatorcontrib><description>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</description><identifier>ISSN: 0196-2892</identifier><identifier>DOI: 10.1109/TGRS.2022.3216831</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>IEEE</publisher><subject>clustering idea ; Codes ; fusion strategy ; interpretability ; multiple views ; Predictive models ; prototype assignment ; Prototypes ; Remote sensing ; Self-supervised learning ; Task analysis ; Training</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2022, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0001-7119-3215 ; 0000-0002-0449-9465 ; 0000-0003-4940-1211 ; 0000-0002-1996-186X ; 0000-0003-3733-2570 ; 0000-0003-3354-9617 ; 0000-0001-9678-0213</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9933708$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4010,27900,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9933708$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ma, Siteng</creatorcontrib><creatorcontrib>Hou, Biao</creatorcontrib><creatorcontrib>Guo, Xianpeng</creatorcontrib><creatorcontrib>Li, Zhihao</creatorcontrib><creatorcontrib>Wu, Zitong</creatorcontrib><creatorcontrib>Wang, Shuang</creatorcontrib><creatorcontrib>Jiao, Licheng</creatorcontrib><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</description><subject>clustering idea</subject><subject>Codes</subject><subject>fusion strategy</subject><subject>interpretability</subject><subject>multiple views</subject><subject>Predictive models</subject><subject>prototype assignment</subject><subject>Prototypes</subject><subject>Remote sensing</subject><subject>Self-supervised learning</subject><subject>Task analysis</subject><subject>Training</subject><issn>0196-2892</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp9jDkOwjAURF2AxHoARPMvkOAFQlwCYiuQEKGPrPATGSV2ZJsitycFNdVo3jwNIQtGY8aoXD3PjyzmlPNYcJakgg3ImDKZRDyVfEQm3r8pZesN245JefvUQUeFsy2cPl5bA1lwKmDVwV55fEFP7s4GG7oWYee9rkyDJkBpHTywsQEhQ-O1qeDaqKpvBRqEQ616t9SFCv3pjAxLVXuc_3JKlqfj83CJNCLmrdONcl0upRBbmor_6xdy7EYf</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Ma, Siteng</creator><creator>Hou, Biao</creator><creator>Guo, Xianpeng</creator><creator>Li, Zhihao</creator><creator>Wu, Zitong</creator><creator>Wang, Shuang</creator><creator>Jiao, Licheng</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><orcidid>https://orcid.org/0000-0001-7119-3215</orcidid><orcidid>https://orcid.org/0000-0002-0449-9465</orcidid><orcidid>https://orcid.org/0000-0003-4940-1211</orcidid><orcidid>https://orcid.org/0000-0002-1996-186X</orcidid><orcidid>https://orcid.org/0000-0003-3733-2570</orcidid><orcidid>https://orcid.org/0000-0003-3354-9617</orcidid><orcidid>https://orcid.org/0000-0001-9678-0213</orcidid></search><sort><creationdate>2022</creationdate><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><author>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_99337083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>clustering idea</topic><topic>Codes</topic><topic>fusion strategy</topic><topic>interpretability</topic><topic>multiple views</topic><topic>Predictive models</topic><topic>prototype assignment</topic><topic>Prototypes</topic><topic>Remote sensing</topic><topic>Self-supervised learning</topic><topic>Task analysis</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ma, Siteng</creatorcontrib><creatorcontrib>Hou, Biao</creatorcontrib><creatorcontrib>Guo, Xianpeng</creatorcontrib><creatorcontrib>Li, Zhihao</creatorcontrib><creatorcontrib>Wu, Zitong</creatorcontrib><creatorcontrib>Wang, Shuang</creatorcontrib><creatorcontrib>Jiao, Licheng</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ma, Siteng</au><au>Hou, Biao</au><au>Guo, Xianpeng</au><au>Li, Zhihao</au><au>Wu, Zitong</au><au>Wang, Shuang</au><au>Jiao, Licheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2022</date><risdate>2022</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>0196-2892</issn><coden>IGRSD2</coden><abstract>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</abstract><pub>IEEE</pub><doi>10.1109/TGRS.2022.3216831</doi><orcidid>https://orcid.org/0000-0001-7119-3215</orcidid><orcidid>https://orcid.org/0000-0002-0449-9465</orcidid><orcidid>https://orcid.org/0000-0003-4940-1211</orcidid><orcidid>https://orcid.org/0000-0002-1996-186X</orcidid><orcidid>https://orcid.org/0000-0003-3733-2570</orcidid><orcidid>https://orcid.org/0000-0003-3354-9617</orcidid><orcidid>https://orcid.org/0000-0001-9678-0213</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0196-2892
ispartof IEEE transactions on geoscience and remote sensing, 2022, p.1-1
issn 0196-2892
language eng
recordid cdi_ieee_primary_9933708
source IEEE Electronic Library (IEL)
subjects clustering idea
Codes
fusion strategy
interpretability
multiple views
Predictive models
prototype assignment
Prototypes
Remote sensing
Self-supervised learning
Task analysis
Training
title Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T11%3A10%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-crop%20Fusion%20Strategy%20Based%20on%20Prototype%20Assignment%20for%20Remote%20Sensing%20Image%20Scene%20Classification&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Ma,%20Siteng&rft.date=2022&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=0196-2892&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2022.3216831&rft_dat=%3Cieee_RIE%3E9933708%3C/ieee_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9933708&rfr_iscdi=true