DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction

We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shim, Jaehyeok, Joo, Kyungdon
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Shim, Jaehyeok Joo, Kyungdon
description	We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.
doi_str_mv	10.48550/arxiv.2403.05005
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2403_05005</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403_05005</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-40b2f1fc126be257118c20e31d2037be37df184c73302b3c1cbeffb1f8c3cd6d3</originalsourceid><addsrcrecordid>eNotz71OwzAYhWEvDKjlApjwDSTY_uI4YqsafiIiVaq8R_6tLKV25LgI7h4oLOfdjvQgdE9J3XSck0eVP8NHzRoCNeGE8Fv03g9SHp5wf1EzVtHiIRZ3yqo4i8efjQXLtKQ5nYJbsU8ZD-dlDiYUDD0-OpPiWvLFlJDiFt14Na_u7r8bJF-e5f6tGg-vw343VqoVvGqIZp56Q1mrHeOC0s4w4oBaRkBoB8J62jVGABCmwVCjnfea-s6Asa2FDXr4u71qpiWHs8pf069quqrgG9ORRyw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><source>arXiv.org</source><creator>Shim, Jaehyeok ; Joo, Kyungdon</creator><creatorcontrib>Shim, Jaehyeok ; Joo, Kyungdon</creatorcontrib><description>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</description><identifier>DOI: 10.48550/arxiv.2403.05005</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2403.05005$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2403.05005$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Shim, Jaehyeok</creatorcontrib><creatorcontrib>Joo, Kyungdon</creatorcontrib><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><description>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAYhWEvDKjlApjwDSTY_uI4YqsafiIiVaq8R_6tLKV25LgI7h4oLOfdjvQgdE9J3XSck0eVP8NHzRoCNeGE8Fv03g9SHp5wf1EzVtHiIRZ3yqo4i8efjQXLtKQ5nYJbsU8ZD-dlDiYUDD0-OpPiWvLFlJDiFt14Na_u7r8bJF-e5f6tGg-vw343VqoVvGqIZp56Q1mrHeOC0s4w4oBaRkBoB8J62jVGABCmwVCjnfea-s6Asa2FDXr4u71qpiWHs8pf069quqrgG9ORRyw</recordid><startdate>20240307</startdate><enddate>20240307</enddate><creator>Shim, Jaehyeok</creator><creator>Joo, Kyungdon</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240307</creationdate><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><author>Shim, Jaehyeok ; Joo, Kyungdon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-40b2f1fc126be257118c20e31d2037be37df184c73302b3c1cbeffb1f8c3cd6d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Shim, Jaehyeok</creatorcontrib><creatorcontrib>Joo, Kyungdon</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shim, Jaehyeok</au><au>Joo, Kyungdon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</atitle><date>2024-03-07</date><risdate>2024</risdate><abstract>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</abstract><doi>10.48550/arxiv.2403.05005</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2403.05005
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2403_05005
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T07%3A11%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DITTO:%20Dual%20and%20Integrated%20Latent%20Topologies%20for%20Implicit%203D%20Reconstruction&rft.au=Shim,%20Jaehyeok&rft.date=2024-03-07&rft_id=info:doi/10.48550/arxiv.2403.05005&rft_dat=%3Carxiv_GOX%3E2403_05005%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true