Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification

The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich sa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on geoscience and remote sensing 2022, p.1-1
Hauptverfasser:	Ma, Siteng, Hou, Biao, Guo, Xianpeng, Li, Zhihao, Wu, Zitong, Wang, Shuang, Jiao, Licheng
Format:	Artikel
Sprache:	eng
Schlagworte:	clustering idea Codes fusion strategy interpretability multiple views Predictive models prototype assignment Prototypes Remote sensing Self-supervised learning Task analysis Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1
container_issue
container_start_page	1
container_title	IEEE transactions on geoscience and remote sensing
container_volume
creator	Ma, Siteng Hou, Biao Guo, Xianpeng Li, Zhihao Wu, Zitong Wang, Shuang Jiao, Licheng
description	The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.
doi_str_mv	10.1109/TGRS.2022.3216831
format	Article
fullrecord	<record><control><sourceid>ieee_RIE</sourceid><recordid>TN_cdi_ieee_primary_9933708</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9933708</ieee_id><sourcerecordid>9933708</sourcerecordid><originalsourceid>FETCH-ieee_primary_99337083</originalsourceid><addsrcrecordid>eNp9jDkOwjAURF2AxHoARPMvkOAFQlwCYiuQEKGPrPATGSV2ZJsitycFNdVo3jwNIQtGY8aoXD3PjyzmlPNYcJakgg3ImDKZRDyVfEQm3r8pZesN245JefvUQUeFsy2cPl5bA1lwKmDVwV55fEFP7s4GG7oWYee9rkyDJkBpHTywsQEhQ-O1qeDaqKpvBRqEQ616t9SFCv3pjAxLVXuc_3JKlqfj83CJNCLmrdONcl0upRBbmor_6xdy7EYf</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><source>IEEE Electronic Library (IEL)</source><creator>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</creator><creatorcontrib>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</creatorcontrib><description>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</description><identifier>ISSN: 0196-2892</identifier><identifier>DOI: 10.1109/TGRS.2022.3216831</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>IEEE</publisher><subject>clustering idea ; Codes ; fusion strategy ; interpretability ; multiple views ; Predictive models ; prototype assignment ; Prototypes ; Remote sensing ; Self-supervised learning ; Task analysis ; Training</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2022, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0001-7119-3215 ; 0000-0002-0449-9465 ; 0000-0003-4940-1211 ; 0000-0002-1996-186X ; 0000-0003-3733-2570 ; 0000-0003-3354-9617 ; 0000-0001-9678-0213</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9933708$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4010,27900,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9933708$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ma, Siteng</creatorcontrib><creatorcontrib>Hou, Biao</creatorcontrib><creatorcontrib>Guo, Xianpeng</creatorcontrib><creatorcontrib>Li, Zhihao</creatorcontrib><creatorcontrib>Wu, Zitong</creatorcontrib><creatorcontrib>Wang, Shuang</creatorcontrib><creatorcontrib>Jiao, Licheng</creatorcontrib><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</description><subject>clustering idea</subject><subject>Codes</subject><subject>fusion strategy</subject><subject>interpretability</subject><subject>multiple views</subject><subject>Predictive models</subject><subject>prototype assignment</subject><subject>Prototypes</subject><subject>Remote sensing</subject><subject>Self-supervised learning</subject><subject>Task analysis</subject><subject>Training</subject><issn>0196-2892</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNp9jDkOwjAURF2AxHoARPMvkOAFQlwCYiuQEKGPrPATGSV2ZJsitycFNdVo3jwNIQtGY8aoXD3PjyzmlPNYcJakgg3ImDKZRDyVfEQm3r8pZesN245JefvUQUeFsy2cPl5bA1lwKmDVwV55fEFP7s4GG7oWYee9rkyDJkBpHTywsQEhQ-O1qeDaqKpvBRqEQ616t9SFCv3pjAxLVXuc_3JKlqfj83CJNCLmrdONcl0upRBbmor_6xdy7EYf</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Ma, Siteng</creator><creator>Hou, Biao</creator><creator>Guo, Xianpeng</creator><creator>Li, Zhihao</creator><creator>Wu, Zitong</creator><creator>Wang, Shuang</creator><creator>Jiao, Licheng</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><orcidid>https://orcid.org/0000-0001-7119-3215</orcidid><orcidid>https://orcid.org/0000-0002-0449-9465</orcidid><orcidid>https://orcid.org/0000-0003-4940-1211</orcidid><orcidid>https://orcid.org/0000-0002-1996-186X</orcidid><orcidid>https://orcid.org/0000-0003-3733-2570</orcidid><orcidid>https://orcid.org/0000-0003-3354-9617</orcidid><orcidid>https://orcid.org/0000-0001-9678-0213</orcidid></search><sort><creationdate>2022</creationdate><title>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</title><author>Ma, Siteng ; Hou, Biao ; Guo, Xianpeng ; Li, Zhihao ; Wu, Zitong ; Wang, Shuang ; Jiao, Licheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_99337083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>clustering idea</topic><topic>Codes</topic><topic>fusion strategy</topic><topic>interpretability</topic><topic>multiple views</topic><topic>Predictive models</topic><topic>prototype assignment</topic><topic>Prototypes</topic><topic>Remote sensing</topic><topic>Self-supervised learning</topic><topic>Task analysis</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ma, Siteng</creatorcontrib><creatorcontrib>Hou, Biao</creatorcontrib><creatorcontrib>Guo, Xianpeng</creatorcontrib><creatorcontrib>Li, Zhihao</creatorcontrib><creatorcontrib>Wu, Zitong</creatorcontrib><creatorcontrib>Wang, Shuang</creatorcontrib><creatorcontrib>Jiao, Licheng</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ma, Siteng</au><au>Hou, Biao</au><au>Guo, Xianpeng</au><au>Li, Zhihao</au><au>Wu, Zitong</au><au>Wang, Shuang</au><au>Jiao, Licheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2022</date><risdate>2022</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>0196-2892</issn><coden>IGRSD2</coden><abstract>The gap between self-supervised visual representation learning and supervised learning is gradually closing. Self-supervised learning does not rely on a large amount of labeled data and reduces the loss of human labeled information. Compared with natural images, remote sensing images require rich samples and human annotation by experts. Moreover, many algorithms have poor interpretability and unconvincing results. Therefore, this paper proposes a self-supervised method based on prototype assignment by designing a pretext task so that the network maps features to prototypes in the process of learning, swaps the code corresponding to the obtained features, combines them with another data-enhancing feature, and then optimizes the network. The prototype is introduced to explain the clustering idea embodied in the whole process. Considering the existence of the scene information-rich characteristic of remote sensing images, we introduce multiple views with different resolutions to capture more detailed information on the images. Finally, if the data enhancement method is not powerful enough, the network can easily fall into an overfitting state, which prevents the network from learning subtle differences and detailed information. To address this shortcoming, we propose a fusion strategy to flatten the decision boundary of the framework so that the model can also learn the soft similarity between sample pairs. We name the whole framework MFPC. In extensive experiments conducted on three common remote sensing image datasets (i.e., UCMerced, AID, and NWPU45), MFPC achieves a maximum improvement of 4.3% over some existing self-supervised algorithms, indicating that it can achieve good results.</abstract><pub>IEEE</pub><doi>10.1109/TGRS.2022.3216831</doi><orcidid>https://orcid.org/0000-0001-7119-3215</orcidid><orcidid>https://orcid.org/0000-0002-0449-9465</orcidid><orcidid>https://orcid.org/0000-0003-4940-1211</orcidid><orcidid>https://orcid.org/0000-0002-1996-186X</orcidid><orcidid>https://orcid.org/0000-0003-3733-2570</orcidid><orcidid>https://orcid.org/0000-0003-3354-9617</orcidid><orcidid>https://orcid.org/0000-0001-9678-0213</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 0196-2892
ispartof	IEEE transactions on geoscience and remote sensing, 2022, p.1-1
issn	0196-2892
language	eng
recordid	cdi_ieee_primary_9933708
source	IEEE Electronic Library (IEL)
subjects	clustering idea Codes fusion strategy interpretability multiple views Predictive models prototype assignment Prototypes Remote sensing Self-supervised learning Task analysis Training
title	Multi-crop Fusion Strategy Based on Prototype Assignment for Remote Sensing Image Scene Classification
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T11%3A10%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-crop%20Fusion%20Strategy%20Based%20on%20Prototype%20Assignment%20for%20Remote%20Sensing%20Image%20Scene%20Classification&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Ma,%20Siteng&rft.date=2022&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=0196-2892&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2022.3216831&rft_dat=%3Cieee_RIE%3E9933708%3C/ieee_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9933708&rfr_iscdi=true