MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition

Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or ni...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science 2024-08, Vol.5 (6), p.656, Article 656
Hauptverfasser: Quach, Minh-Duc, Vo, Duc-Minh, Pham, Hoang-Anh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 6
container_start_page 656
container_title SN computer science
container_volume 5
creator Quach, Minh-Duc
Vo, Duc-Minh
Pham, Hoang-Anh
description Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.
doi_str_mv 10.1007/s42979-024-03011-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3068970302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3068970302</sourcerecordid><originalsourceid>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3068970302</pqid></control><display><type>article</type><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><source>SpringerLink Journals</source><creator>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creator><creatorcontrib>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creatorcontrib><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-024-03011-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Computing time ; Data Structures and Information Theory ; Deep learning ; Image retrieval ; Information Systems and Communication Service ; Methods ; Neural networks ; Original Research ; Pattern Recognition and Graphics ; Robotics ; Software Engineering/Programming and Operating Systems ; Vision</subject><ispartof>SN computer science, 2024-08, Vol.5 (6), p.656, Article 656</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</cites><orcidid>0000-0002-5806-5910</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-024-03011-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s42979-024-03011-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Computing time</subject><subject>Data Structures and Information Theory</subject><subject>Deep learning</subject><subject>Image retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Robotics</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Vision</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</recordid><startdate>20240801</startdate><enddate>20240801</enddate><creator>Quach, Minh-Duc</creator><creator>Vo, Duc-Minh</creator><creator>Pham, Hoang-Anh</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></search><sort><creationdate>20240801</creationdate><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><author>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Computing time</topic><topic>Data Structures and Information Theory</topic><topic>Deep learning</topic><topic>Image retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Robotics</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Quach, Minh-Duc</au><au>Vo, Duc-Minh</au><au>Pham, Hoang-Anh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2024-08-01</date><risdate>2024</risdate><volume>5</volume><issue>6</issue><spage>656</spage><pages>656-</pages><artnum>656</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-024-03011-z</doi><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2661-8907
ispartof SN computer science, 2024-08, Vol.5 (6), p.656, Article 656
issn 2661-8907
2662-995X
2661-8907
language eng
recordid cdi_proquest_journals_3068970302
source SpringerLink Journals
subjects Computer Imaging
Computer Science
Computer Systems Organization and Communication Networks
Computing time
Data Structures and Information Theory
Deep learning
Image retrieval
Information Systems and Communication Service
Methods
Neural networks
Original Research
Pattern Recognition and Graphics
Robotics
Software Engineering/Programming and Operating Systems
Vision
title MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T23%3A35%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MS-MixVPR:%20Multi-scale%20Feature%20Mixing%20Approach%20for%20Long-Term%20Place%20Recognition&rft.jtitle=SN%20computer%20science&rft.au=Quach,%20Minh-Duc&rft.date=2024-08-01&rft.volume=5&rft.issue=6&rft.spage=656&rft.pages=656-&rft.artnum=656&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-024-03011-z&rft_dat=%3Cproquest_cross%3E3068970302%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3068970302&rft_id=info:pmid/&rfr_iscdi=true