MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition
Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or ni...
Gespeichert in:
Veröffentlicht in: | SN computer science 2024-08, Vol.5 (6), p.656, Article 656 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 6 |
container_start_page | 656 |
container_title | SN computer science |
container_volume | 5 |
creator | Quach, Minh-Duc Vo, Duc-Minh Pham, Hoang-Anh |
description | Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained. |
doi_str_mv | 10.1007/s42979-024-03011-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3068970302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3068970302</sourcerecordid><originalsourceid>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3068970302</pqid></control><display><type>article</type><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><source>SpringerLink Journals</source><creator>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creator><creatorcontrib>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</creatorcontrib><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-024-03011-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Computing time ; Data Structures and Information Theory ; Deep learning ; Image retrieval ; Information Systems and Communication Service ; Methods ; Neural networks ; Original Research ; Pattern Recognition and Graphics ; Robotics ; Software Engineering/Programming and Operating Systems ; Vision</subject><ispartof>SN computer science, 2024-08, Vol.5 (6), p.656, Article 656</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</cites><orcidid>0000-0002-5806-5910</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-024-03011-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s42979-024-03011-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</description><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Computing time</subject><subject>Data Structures and Information Theory</subject><subject>Deep learning</subject><subject>Image retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Robotics</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Vision</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEQxYMoWGq_gKeA52j-Z9dbKVaFVkutXkOaTdYt292adEH76Y2uoCcvMwPz3szjB8A5wZcEY3UVOc1VjjDlCDNMCDocgQGVkqAsx-r4z3wKRjFuMMZUYM6lGICH-ROaV-8vi-U1nHf1vkLRmtrBqTP7LjiYdlVTwvFuF1pjX6FvA5y1TYlWLmzhojbWwaWzbdlU-6ptzsCJN3V0o58-BM_Tm9XkDs0eb-8n4xmyJBMHZDJKjeCOcZavFREpn1lbpQpR5DIV7Lk0HDuqTFZY5oUiPhOFc1J6z9YFG4KL_m6K9da5uNebtgtNeqkZllmuEgiaVLRX2dDGGJzXu1BtTfjQBOsvdLpHpxM6_Y1OH5KJ9aaYxE3pwu_pf1yfARFwsQ</recordid><startdate>20240801</startdate><enddate>20240801</enddate><creator>Quach, Minh-Duc</creator><creator>Vo, Duc-Minh</creator><creator>Pham, Hoang-Anh</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></search><sort><creationdate>20240801</creationdate><title>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</title><author>Quach, Minh-Duc ; Vo, Duc-Minh ; Pham, Hoang-Anh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c185z-a822a54e3439b715661abc77d5d96d5d0f46a40e27a8dc3f571f85dee66ff3bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Computing time</topic><topic>Data Structures and Information Theory</topic><topic>Deep learning</topic><topic>Image retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Robotics</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Quach, Minh-Duc</creatorcontrib><creatorcontrib>Vo, Duc-Minh</creatorcontrib><creatorcontrib>Pham, Hoang-Anh</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Quach, Minh-Duc</au><au>Vo, Duc-Minh</au><au>Pham, Hoang-Anh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2024-08-01</date><risdate>2024</risdate><volume>5</volume><issue>6</issue><spage>656</spage><pages>656-</pages><artnum>656</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Visual place recognition (VPR) is a crucial task in robotics and autonomous systems, enabling robots to localize themselves in complex and dynamic environments. Due to significant differences in appearance that arise from changes in environmental factors like season, weather, and lighting (day or night), VPR is particularly challenging in outdoor settings. This paper presents a novel method to address this challenge called MS-MixVPR, which is proposed based on an existing work, MixVPR. The proposed MS-MixVPR extracts global features from different layers of pre-trained CNN backbones using MixVPR’s Feature Mixer blocks. These visual cues are combined further to create a compact, holistic representation that is highly robust to changes in environmental conditions. We evaluate the proposed MS-MixVPR on four challenging real-world benchmark datasets, including Nordland, SPEDTest, MSLS, and Pittsburgh30k. The experimental results show that our MS-MixVPR outperforms several current state-of-the-art methods while maintaining low computational time. Consequently, our approach is suitable for real-world applications that are often resource-constrained.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-024-03011-z</doi><orcidid>https://orcid.org/0000-0002-5806-5910</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2661-8907 |
ispartof | SN computer science, 2024-08, Vol.5 (6), p.656, Article 656 |
issn | 2661-8907 2662-995X 2661-8907 |
language | eng |
recordid | cdi_proquest_journals_3068970302 |
source | SpringerLink Journals |
subjects | Computer Imaging Computer Science Computer Systems Organization and Communication Networks Computing time Data Structures and Information Theory Deep learning Image retrieval Information Systems and Communication Service Methods Neural networks Original Research Pattern Recognition and Graphics Robotics Software Engineering/Programming and Operating Systems Vision |
title | MS-MixVPR: Multi-scale Feature Mixing Approach for Long-Term Place Recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T23%3A35%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MS-MixVPR:%20Multi-scale%20Feature%20Mixing%20Approach%20for%20Long-Term%20Place%20Recognition&rft.jtitle=SN%20computer%20science&rft.au=Quach,%20Minh-Duc&rft.date=2024-08-01&rft.volume=5&rft.issue=6&rft.spage=656&rft.pages=656-&rft.artnum=656&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-024-03011-z&rft_dat=%3Cproquest_cross%3E3068970302%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3068970302&rft_id=info:pmid/&rfr_iscdi=true |