DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration
Pose registration is critical in vision and robotics. This paper focuses on the challenging task of initialization-free pose registration up to 7DoF for homogeneous and heterogeneous measurements. While recent learning-based methods show promise using differentiable solvers, they either rely on heur...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Chen, Zexi Liao, Yiyi Du, Haozhe Zhang, Haodong Xu, Xuecheng Lu, Haojian Xiong, Rong Wang, Yue |
description | Pose registration is critical in vision and robotics. This paper focuses on
the challenging task of initialization-free pose registration up to 7DoF for
homogeneous and heterogeneous measurements. While recent learning-based methods
show promise using differentiable solvers, they either rely on heuristically
defined correspondences or are prone to local minima. We present a
differentiable phase correlation (DPC) solver that is globally convergent and
correspondence-free. When combined with simple feature extraction networks, our
general framework DPCN++ allows for versatile pose registration with arbitrary
initialization. Specifically, the feature extraction networks first learn dense
feature grids from a pair of homogeneous/heterogeneous measurements. These
feature grids are then transformed into a translation and scale invariant
spectrum representation based on Fourier transform and spherical radial
aggregation, decoupling translation and scale from rotation. Next, the
rotation, scale, and translation are independently and efficiently estimated in
the spectrum step-by-step using the DPC solver. The entire pipeline is
differentiable and trained end-to-end. We evaluate DCPN++ on a wide range of
registration tasks taking different input modalities, including 2D bird's-eye
view images, 3D object and scene measurements, and medical images. Experimental
results demonstrate that DCPN++ outperforms both classical and learning-based
baselines, especially on partially observed and heterogeneous measurements. |
doi_str_mv | 10.48550/arxiv.2206.05707 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2206_05707</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2206_05707</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-6476698ae88f59425451b276f393e1e0561615dc6232b2f782c76d8a976302ed3</originalsourceid><addsrcrecordid>eNotj0tLw0AURmfjQqo_wJWzL4nzyNyZuJPUR6HUUEq3YdLc0cHYkZvg49_bRlcffBwOHMaupMgLZ4y48fQdP3OlBOTCWGHPWb2oq_V8fssXMQQkPIzRtz3y-tUPyKtEhL0fYzrwNY5fid54SMR3SMPxPXHpiG3wJQ4jTdwFOwu-H_Dyf2ds-3C_rZ6y1fPjsrpbZR6szaCwAKXz6FwwZaFMYWSrLARdapQoDEiQptuD0qpVwTq1t9A5X1rQQmGnZ-z6TzslNR8U3z39NKe0ZkrTv06bR6Y</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration</title><source>arXiv.org</source><creator>Chen, Zexi ; Liao, Yiyi ; Du, Haozhe ; Zhang, Haodong ; Xu, Xuecheng ; Lu, Haojian ; Xiong, Rong ; Wang, Yue</creator><creatorcontrib>Chen, Zexi ; Liao, Yiyi ; Du, Haozhe ; Zhang, Haodong ; Xu, Xuecheng ; Lu, Haojian ; Xiong, Rong ; Wang, Yue</creatorcontrib><description>Pose registration is critical in vision and robotics. This paper focuses on
the challenging task of initialization-free pose registration up to 7DoF for
homogeneous and heterogeneous measurements. While recent learning-based methods
show promise using differentiable solvers, they either rely on heuristically
defined correspondences or are prone to local minima. We present a
differentiable phase correlation (DPC) solver that is globally convergent and
correspondence-free. When combined with simple feature extraction networks, our
general framework DPCN++ allows for versatile pose registration with arbitrary
initialization. Specifically, the feature extraction networks first learn dense
feature grids from a pair of homogeneous/heterogeneous measurements. These
feature grids are then transformed into a translation and scale invariant
spectrum representation based on Fourier transform and spherical radial
aggregation, decoupling translation and scale from rotation. Next, the
rotation, scale, and translation are independently and efficiently estimated in
the spectrum step-by-step using the DPC solver. The entire pipeline is
differentiable and trained end-to-end. We evaluate DCPN++ on a wide range of
registration tasks taking different input modalities, including 2D bird's-eye
view images, 3D object and scene measurements, and medical images. Experimental
results demonstrate that DCPN++ outperforms both classical and learning-based
baselines, especially on partially observed and heterogeneous measurements.</description><identifier>DOI: 10.48550/arxiv.2206.05707</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Robotics</subject><creationdate>2022-06</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2206.05707$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2206.05707$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Chen, Zexi</creatorcontrib><creatorcontrib>Liao, Yiyi</creatorcontrib><creatorcontrib>Du, Haozhe</creatorcontrib><creatorcontrib>Zhang, Haodong</creatorcontrib><creatorcontrib>Xu, Xuecheng</creatorcontrib><creatorcontrib>Lu, Haojian</creatorcontrib><creatorcontrib>Xiong, Rong</creatorcontrib><creatorcontrib>Wang, Yue</creatorcontrib><title>DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration</title><description>Pose registration is critical in vision and robotics. This paper focuses on
the challenging task of initialization-free pose registration up to 7DoF for
homogeneous and heterogeneous measurements. While recent learning-based methods
show promise using differentiable solvers, they either rely on heuristically
defined correspondences or are prone to local minima. We present a
differentiable phase correlation (DPC) solver that is globally convergent and
correspondence-free. When combined with simple feature extraction networks, our
general framework DPCN++ allows for versatile pose registration with arbitrary
initialization. Specifically, the feature extraction networks first learn dense
feature grids from a pair of homogeneous/heterogeneous measurements. These
feature grids are then transformed into a translation and scale invariant
spectrum representation based on Fourier transform and spherical radial
aggregation, decoupling translation and scale from rotation. Next, the
rotation, scale, and translation are independently and efficiently estimated in
the spectrum step-by-step using the DPC solver. The entire pipeline is
differentiable and trained end-to-end. We evaluate DCPN++ on a wide range of
registration tasks taking different input modalities, including 2D bird's-eye
view images, 3D object and scene measurements, and medical images. Experimental
results demonstrate that DCPN++ outperforms both classical and learning-based
baselines, especially on partially observed and heterogeneous measurements.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj0tLw0AURmfjQqo_wJWzL4nzyNyZuJPUR6HUUEq3YdLc0cHYkZvg49_bRlcffBwOHMaupMgLZ4y48fQdP3OlBOTCWGHPWb2oq_V8fssXMQQkPIzRtz3y-tUPyKtEhL0fYzrwNY5fid54SMR3SMPxPXHpiG3wJQ4jTdwFOwu-H_Dyf2ds-3C_rZ6y1fPjsrpbZR6szaCwAKXz6FwwZaFMYWSrLARdapQoDEiQptuD0qpVwTq1t9A5X1rQQmGnZ-z6TzslNR8U3z39NKe0ZkrTv06bR6Y</recordid><startdate>20220612</startdate><enddate>20220612</enddate><creator>Chen, Zexi</creator><creator>Liao, Yiyi</creator><creator>Du, Haozhe</creator><creator>Zhang, Haodong</creator><creator>Xu, Xuecheng</creator><creator>Lu, Haojian</creator><creator>Xiong, Rong</creator><creator>Wang, Yue</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220612</creationdate><title>DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration</title><author>Chen, Zexi ; Liao, Yiyi ; Du, Haozhe ; Zhang, Haodong ; Xu, Xuecheng ; Lu, Haojian ; Xiong, Rong ; Wang, Yue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-6476698ae88f59425451b276f393e1e0561615dc6232b2f782c76d8a976302ed3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Chen, Zexi</creatorcontrib><creatorcontrib>Liao, Yiyi</creatorcontrib><creatorcontrib>Du, Haozhe</creatorcontrib><creatorcontrib>Zhang, Haodong</creatorcontrib><creatorcontrib>Xu, Xuecheng</creatorcontrib><creatorcontrib>Lu, Haojian</creatorcontrib><creatorcontrib>Xiong, Rong</creatorcontrib><creatorcontrib>Wang, Yue</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Chen, Zexi</au><au>Liao, Yiyi</au><au>Du, Haozhe</au><au>Zhang, Haodong</au><au>Xu, Xuecheng</au><au>Lu, Haojian</au><au>Xiong, Rong</au><au>Wang, Yue</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration</atitle><date>2022-06-12</date><risdate>2022</risdate><abstract>Pose registration is critical in vision and robotics. This paper focuses on
the challenging task of initialization-free pose registration up to 7DoF for
homogeneous and heterogeneous measurements. While recent learning-based methods
show promise using differentiable solvers, they either rely on heuristically
defined correspondences or are prone to local minima. We present a
differentiable phase correlation (DPC) solver that is globally convergent and
correspondence-free. When combined with simple feature extraction networks, our
general framework DPCN++ allows for versatile pose registration with arbitrary
initialization. Specifically, the feature extraction networks first learn dense
feature grids from a pair of homogeneous/heterogeneous measurements. These
feature grids are then transformed into a translation and scale invariant
spectrum representation based on Fourier transform and spherical radial
aggregation, decoupling translation and scale from rotation. Next, the
rotation, scale, and translation are independently and efficiently estimated in
the spectrum step-by-step using the DPC solver. The entire pipeline is
differentiable and trained end-to-end. We evaluate DCPN++ on a wide range of
registration tasks taking different input modalities, including 2D bird's-eye
view images, 3D object and scene measurements, and medical images. Experimental
results demonstrate that DCPN++ outperforms both classical and learning-based
baselines, especially on partially observed and heterogeneous measurements.</abstract><doi>10.48550/arxiv.2206.05707</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2206.05707 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2206_05707 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Robotics |
title | DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T08%3A38%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DPCN++:%20Differentiable%20Phase%20Correlation%20Network%20for%20Versatile%20Pose%20Registration&rft.au=Chen,%20Zexi&rft.date=2022-06-12&rft_id=info:doi/10.48550/arxiv.2206.05707&rft_dat=%3Carxiv_GOX%3E2206_05707%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |