DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction

We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shim, Jaehyeok, Joo, Kyungdon
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Shim, Jaehyeok
Joo, Kyungdon
description We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.
doi_str_mv 10.48550/arxiv.2403.05005
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2403_05005</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403_05005</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-40b2f1fc126be257118c20e31d2037be37df184c73302b3c1cbeffb1f8c3cd6d3</originalsourceid><addsrcrecordid>eNotz71OwzAYhWEvDKjlApjwDSTY_uI4YqsafiIiVaq8R_6tLKV25LgI7h4oLOfdjvQgdE9J3XSck0eVP8NHzRoCNeGE8Fv03g9SHp5wf1EzVtHiIRZ3yqo4i8efjQXLtKQ5nYJbsU8ZD-dlDiYUDD0-OpPiWvLFlJDiFt14Na_u7r8bJF-e5f6tGg-vw343VqoVvGqIZp56Q1mrHeOC0s4w4oBaRkBoB8J62jVGABCmwVCjnfea-s6Asa2FDXr4u71qpiWHs8pf069quqrgG9ORRyw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><source>arXiv.org</source><creator>Shim, Jaehyeok ; Joo, Kyungdon</creator><creatorcontrib>Shim, Jaehyeok ; Joo, Kyungdon</creatorcontrib><description>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</description><identifier>DOI: 10.48550/arxiv.2403.05005</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2403.05005$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2403.05005$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Shim, Jaehyeok</creatorcontrib><creatorcontrib>Joo, Kyungdon</creatorcontrib><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><description>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAYhWEvDKjlApjwDSTY_uI4YqsafiIiVaq8R_6tLKV25LgI7h4oLOfdjvQgdE9J3XSck0eVP8NHzRoCNeGE8Fv03g9SHp5wf1EzVtHiIRZ3yqo4i8efjQXLtKQ5nYJbsU8ZD-dlDiYUDD0-OpPiWvLFlJDiFt14Na_u7r8bJF-e5f6tGg-vw343VqoVvGqIZp56Q1mrHeOC0s4w4oBaRkBoB8J62jVGABCmwVCjnfea-s6Asa2FDXr4u71qpiWHs8pf069quqrgG9ORRyw</recordid><startdate>20240307</startdate><enddate>20240307</enddate><creator>Shim, Jaehyeok</creator><creator>Joo, Kyungdon</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240307</creationdate><title>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</title><author>Shim, Jaehyeok ; Joo, Kyungdon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-40b2f1fc126be257118c20e31d2037be37df184c73302b3c1cbeffb1f8c3cd6d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Shim, Jaehyeok</creatorcontrib><creatorcontrib>Joo, Kyungdon</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shim, Jaehyeok</au><au>Joo, Kyungdon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction</atitle><date>2024-03-07</date><risdate>2024</risdate><abstract>We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.</abstract><doi>10.48550/arxiv.2403.05005</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2403.05005
ispartof
issn
language eng
recordid cdi_arxiv_primary_2403_05005
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T07%3A11%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DITTO:%20Dual%20and%20Integrated%20Latent%20Topologies%20for%20Implicit%203D%20Reconstruction&rft.au=Shim,%20Jaehyeok&rft.date=2024-03-07&rft_id=info:doi/10.48550/arxiv.2403.05005&rft_dat=%3Carxiv_GOX%3E2403_05005%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true