Insights on Evaluation of Camera Re-localization Using Relative Pose Regression

We consider the problem of relative pose regression in visual relocalization. Recently, several promising approaches have emerged in this area. We claim that even though they demonstrate on the same datasets using the same split to train and test, a faithful comparison between them was not available...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shalev, Amir, Achrack, Omer, Fulkerson, Brian, Bobrovsky, Ben-Zion
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Shalev, Amir Achrack, Omer Fulkerson, Brian Bobrovsky, Ben-Zion
description	We consider the problem of relative pose regression in visual relocalization. Recently, several promising approaches have emerged in this area. We claim that even though they demonstrate on the same datasets using the same split to train and test, a faithful comparison between them was not available since on currently used evaluation metric, some approaches might perform favorably, while in reality performing worse. We reveal a tradeoff between accuracy and the 3D volume of the regressed subspace. We believe that unlike other relocalization approaches, in the case of relative pose regression, the regressed subspace 3D volume is less dependent on the scene and more affect by the method used to score the overlap, which determined how closely sampled viewpoints are. We propose three new metrics to remedy the issue mentioned above. The proposed metrics incorporate statistics about the regression subspace volume. We also propose a new pose regression network that serves as a new baseline for this task. We compare the performance of our trained model on Microsoft 7-Scenes and Cambridge Landmarks datasets both with the standard metrics and the newly proposed metrics and adjust the overlap score to reveal the tradeoff between the subspace and performance. The results show that the proposed metrics are more robust to different overlap threshold than the conventional approaches. Finally, we show that our network generalizes well, specifically, training on a single scene leads to little loss of performance on the other scenes.
doi_str_mv	10.48550/arxiv.2009.11342
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2009_11342</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2009_11342</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-f330254026eed3c5328fa1187b28a19b033a8ba22a17391b4967db0b81f1b2753</originalsourceid><addsrcrecordid>eNotj99KwzAUxnPjhUwfwCvzAq3JOU2TXkqZOhhsyLwuJ1tSA1krySzq01s3r75_8MGPsTspysooJR4ofYWpBCGaUkqs4JptVkMO_fsp83Hgy4niJ53CbEfPWzq6RPzVFXHcUww_l-Uth6Gf2zjHyfHtmN2c-uRynucbduUpZnf7rwu2e1ru2pdivXletY_rgmoNhUcUoCoBtXMH3CsE40lKoy0Yko0ViGQsAZDU2EhbNbU-WGGN9NKCVrhg95fbM1H3kcKR0nf3R9adyfAXpu5IRw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Insights on Evaluation of Camera Re-localization Using Relative Pose Regression</title><source>arXiv.org</source><creator>Shalev, Amir ; Achrack, Omer ; Fulkerson, Brian ; Bobrovsky, Ben-Zion</creator><creatorcontrib>Shalev, Amir ; Achrack, Omer ; Fulkerson, Brian ; Bobrovsky, Ben-Zion</creatorcontrib><description>We consider the problem of relative pose regression in visual relocalization. Recently, several promising approaches have emerged in this area. We claim that even though they demonstrate on the same datasets using the same split to train and test, a faithful comparison between them was not available since on currently used evaluation metric, some approaches might perform favorably, while in reality performing worse. We reveal a tradeoff between accuracy and the 3D volume of the regressed subspace. We believe that unlike other relocalization approaches, in the case of relative pose regression, the regressed subspace 3D volume is less dependent on the scene and more affect by the method used to score the overlap, which determined how closely sampled viewpoints are. We propose three new metrics to remedy the issue mentioned above. The proposed metrics incorporate statistics about the regression subspace volume. We also propose a new pose regression network that serves as a new baseline for this task. We compare the performance of our trained model on Microsoft 7-Scenes and Cambridge Landmarks datasets both with the standard metrics and the newly proposed metrics and adjust the overlap score to reveal the tradeoff between the subspace and performance. The results show that the proposed metrics are more robust to different overlap threshold than the conventional approaches. Finally, we show that our network generalizes well, specifically, training on a single scene leads to little loss of performance on the other scenes.</description><identifier>DOI: 10.48550/arxiv.2009.11342</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2020-09</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2009.11342$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2009.11342$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Shalev, Amir</creatorcontrib><creatorcontrib>Achrack, Omer</creatorcontrib><creatorcontrib>Fulkerson, Brian</creatorcontrib><creatorcontrib>Bobrovsky, Ben-Zion</creatorcontrib><title>Insights on Evaluation of Camera Re-localization Using Relative Pose Regression</title><description>We consider the problem of relative pose regression in visual relocalization. Recently, several promising approaches have emerged in this area. We claim that even though they demonstrate on the same datasets using the same split to train and test, a faithful comparison between them was not available since on currently used evaluation metric, some approaches might perform favorably, while in reality performing worse. We reveal a tradeoff between accuracy and the 3D volume of the regressed subspace. We believe that unlike other relocalization approaches, in the case of relative pose regression, the regressed subspace 3D volume is less dependent on the scene and more affect by the method used to score the overlap, which determined how closely sampled viewpoints are. We propose three new metrics to remedy the issue mentioned above. The proposed metrics incorporate statistics about the regression subspace volume. We also propose a new pose regression network that serves as a new baseline for this task. We compare the performance of our trained model on Microsoft 7-Scenes and Cambridge Landmarks datasets both with the standard metrics and the newly proposed metrics and adjust the overlap score to reveal the tradeoff between the subspace and performance. The results show that the proposed metrics are more robust to different overlap threshold than the conventional approaches. Finally, we show that our network generalizes well, specifically, training on a single scene leads to little loss of performance on the other scenes.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj99KwzAUxnPjhUwfwCvzAq3JOU2TXkqZOhhsyLwuJ1tSA1krySzq01s3r75_8MGPsTspysooJR4ofYWpBCGaUkqs4JptVkMO_fsp83Hgy4niJ53CbEfPWzq6RPzVFXHcUww_l-Uth6Gf2zjHyfHtmN2c-uRynucbduUpZnf7rwu2e1ru2pdivXletY_rgmoNhUcUoCoBtXMH3CsE40lKoy0Yko0ViGQsAZDU2EhbNbU-WGGN9NKCVrhg95fbM1H3kcKR0nf3R9adyfAXpu5IRw</recordid><startdate>20200923</startdate><enddate>20200923</enddate><creator>Shalev, Amir</creator><creator>Achrack, Omer</creator><creator>Fulkerson, Brian</creator><creator>Bobrovsky, Ben-Zion</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20200923</creationdate><title>Insights on Evaluation of Camera Re-localization Using Relative Pose Regression</title><author>Shalev, Amir ; Achrack, Omer ; Fulkerson, Brian ; Bobrovsky, Ben-Zion</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-f330254026eed3c5328fa1187b28a19b033a8ba22a17391b4967db0b81f1b2753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Shalev, Amir</creatorcontrib><creatorcontrib>Achrack, Omer</creatorcontrib><creatorcontrib>Fulkerson, Brian</creatorcontrib><creatorcontrib>Bobrovsky, Ben-Zion</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shalev, Amir</au><au>Achrack, Omer</au><au>Fulkerson, Brian</au><au>Bobrovsky, Ben-Zion</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Insights on Evaluation of Camera Re-localization Using Relative Pose Regression</atitle><date>2020-09-23</date><risdate>2020</risdate><abstract>We consider the problem of relative pose regression in visual relocalization. Recently, several promising approaches have emerged in this area. We claim that even though they demonstrate on the same datasets using the same split to train and test, a faithful comparison between them was not available since on currently used evaluation metric, some approaches might perform favorably, while in reality performing worse. We reveal a tradeoff between accuracy and the 3D volume of the regressed subspace. We believe that unlike other relocalization approaches, in the case of relative pose regression, the regressed subspace 3D volume is less dependent on the scene and more affect by the method used to score the overlap, which determined how closely sampled viewpoints are. We propose three new metrics to remedy the issue mentioned above. The proposed metrics incorporate statistics about the regression subspace volume. We also propose a new pose regression network that serves as a new baseline for this task. We compare the performance of our trained model on Microsoft 7-Scenes and Cambridge Landmarks datasets both with the standard metrics and the newly proposed metrics and adjust the overlap score to reveal the tradeoff between the subspace and performance. The results show that the proposed metrics are more robust to different overlap threshold than the conventional approaches. Finally, we show that our network generalizes well, specifically, training on a single scene leads to little loss of performance on the other scenes.</abstract><doi>10.48550/arxiv.2009.11342</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2009.11342
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2009_11342
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
title	Insights on Evaluation of Camera Re-localization Using Relative Pose Regression
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T00%3A26%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Insights%20on%20Evaluation%20of%20Camera%20Re-localization%20Using%20Relative%20Pose%20Regression&rft.au=Shalev,%20Amir&rft.date=2020-09-23&rft_id=info:doi/10.48550/arxiv.2009.11342&rft_dat=%3Carxiv_GOX%3E2009_11342%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true