MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition

Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or ni...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	SN computer science 2024-08, Vol.5 (6), p.656, Article 656
Hauptverfasser:	Quach, Minh-Duc, Vo, Duc-Minh, Pham, Hoang-Anh
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Imaging Computer Science Computer Systems Organization and Communication Networks Computing time Data Structures and Information Theory Deep learning Image retrieval Information Systems and Communication Service Methods Neural networks Original Research Pattern Recognition and Graphics Robotics Software Engineering/Programming and Operating Systems Vision
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	6
container_start_page	656
container_title	SN computer science
container_volume	5
creator	Quach, Minh-Duc Vo, Duc-Minh Pham, Hoang-Anh
description	Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.
doi_str_mv	10.1007/s42979-024-03011-z
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3068970302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3068970302</sourcerecordid><originalsourceid>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3068970302</pqid></control><display><type>article</type><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><source>SpringerLink Journals</source><creator>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creator><creatorcontrib>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creatorcontrib><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-024-03011-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Computing time ; Data Structures and Information Theory ; Deep learning ; Image retrieval ; Information Systems and Communication Service ; Methods ; Neural networks ; Original Research ; Pattern Recognition and Graphics ; Robotics ; Software Engineering/Programming and Operating Systems ; Vision</subject><ispartof>SN computer science, 2024-08, Vol.5 (6), p.656, Article 656</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</cites><orcidid>0000-0002-5806-5910</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-024-03011-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s42979-024-03011-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Computing time</subject><subject>Data Structures and Information Theory</subject><subject>Deep learning</subject><subject>Image retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Robotics</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Vision</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</recordid><startdate>20240801</startdate><enddate>20240801</enddate><creator>Quach, Minh-Duc</creator><creator>Vo, Duc-Minh</creator><creator>Pham, Hoang-Anh</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></search><sort><creationdate>20240801</creationdate><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><author>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Computing time</topic><topic>Data Structures and Information Theory</topic><topic>Deep learning</topic><topic>Image retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Robotics</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Quach, Minh-Duc</au><au>Vo, Duc-Minh</au><au>Pham, Hoang-Anh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2024-08-01</date><risdate>2024</risdate><volume>5</volume><issue>6</issue><spage>656</spage><pages>656-</pages><artnum>656</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-024-03011-z</doi><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 2661-8907
ispartof	SN computer science, 2024-08, Vol.5 (6), p.656, Article 656
issn	2661-8907 2662-995X 2661-8907
language	eng
recordid	cdi_proquest_journals_3068970302
source	SpringerLink Journals
subjects	Computer Imaging Computer Science Computer Systems Organization and Communication Networks Computing time Data Structures and Information Theory Deep learning Image retrieval Information Systems and Communication Service Methods Neural networks Original Research Pattern Recognition and Graphics Robotics Software Engineering/Programming and Operating Systems Vision
title	MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T23%3A35%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MS-MixVPR:%20Multi-scale%20Feature%20Mixing%20Approach%20for%20Long-Term%20Place%20Recognition&rft.jtitle=SN%20computer%20science&rft.au=Quach,%20Minh-Duc&rft.date=2024-08-01&rft.volume=5&rft.issue=6&rft.spage=656&rft.pages=656-&rft.artnum=656&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-024-03011-z&rft_dat=%3Cproquest_cross%3E3068970302%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3068970302&rft_id=info:pmid/&rfr_iscdi=true