Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation

3D reconstruction of medical imaging from 2D images has become an increasingly interesting topic with the development of deep learning models in recent years. Previous studies in 3D reconstruction from limited X-ray images mainly rely on learning from paired 2D and 3D images, where the reconstructio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Song, Weinan, Zheng, Haoxin, Tu, Dezhan, Liang, Chengwen, He, Lei
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Song, Weinan Zheng, Haoxin Tu, Dezhan Liang, Chengwen He, Lei
description	3D reconstruction of medical imaging from 2D images has become an increasingly interesting topic with the development of deep learning models in recent years. Previous studies in 3D reconstruction from limited X-ray images mainly rely on learning from paired 2D and 3D images, where the reconstruction quality relies on the scale and variation of collected data. This has brought significant challenges in the collection of training data, as only a tiny fraction of patients take two types of radiation examinations in the same period. Although simulation from higher-dimension images could solve this problem, the variance between real and simulated data could bring great uncertainty at the same time. In oral reconstruction, the situation becomes more challenging as only a single panoramic X-ray image is available, where models need to infer the curved shape by prior individual knowledge. To overcome these limitations, we propose Oral-3Dv2 to solve this cross-dimension translation problem in dental healthcare by learning solely on projection information, i.e., the projection image and trajectory of the X-ray tube. Our model learns to represent the 3D oral structure in an implicit way by mapping 2D coordinates into density values of voxels in the 3D space. To improve efficiency and effectiveness, we utilize a multi-head model that predicts a bunch of voxel values in 3D space simultaneously from a 2D coordinate in the axial plane and the dynamic sampling strategy to refine details of the density distribution in the reconstruction result. Extensive experiments in simulated and real data show that our model significantly outperforms existing state-of-the-art models without learning from paired images or prior individual knowledge. To the best of our knowledge, this is the first work of a non-adversarial-learning-based model in 3D radiology reconstruction from a single panoramic X-ray image.
doi_str_mv	10.48550/arxiv.2303.12123
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_12123</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_12123</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-1e9e26ea88b39eaf40a18e8795d419f7901d1b17bfe347a139013e7062f228d53</originalsourceid><addsrcrecordid>eNotj71ugzAURr10qNI-QKf6BaC2L2DTrUr6EylqqihDN3SB69QSGGRI2rx9Q5Lp0_mGIx3GHqSIE5Om4gnDnzvECgTEUkkFt6xcB2wiWBzUM4cFn4hvqOr8MIZ9NbrOcxu6ln-h7wK2ruLf0QaPfNnizvkd_3Xjzwn6xlVu5J-0vwj6QAP5ESfBHbux2Ax0f90Z2769bucf0Wr9vpy_rCLMNESSclIZoTEl5IQ2ESgNGZ2ndSJzq3Mha1lKXVqCRKOE0wGkRaasUqZOYcYeL9pzZNEH12I4FlNscY6Ff4MLTo8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation</title><source>arXiv.org</source><creator>Song, Weinan ; Zheng, Haoxin ; Tu, Dezhan ; Liang, Chengwen ; He, Lei</creator><creatorcontrib>Song, Weinan ; Zheng, Haoxin ; Tu, Dezhan ; Liang, Chengwen ; He, Lei</creatorcontrib><description>3D reconstruction of medical imaging from 2D images has become an increasingly interesting topic with the development of deep learning models in recent years. Previous studies in 3D reconstruction from limited X-ray images mainly rely on learning from paired 2D and 3D images, where the reconstruction quality relies on the scale and variation of collected data. This has brought significant challenges in the collection of training data, as only a tiny fraction of patients take two types of radiation examinations in the same period. Although simulation from higher-dimension images could solve this problem, the variance between real and simulated data could bring great uncertainty at the same time. In oral reconstruction, the situation becomes more challenging as only a single panoramic X-ray image is available, where models need to infer the curved shape by prior individual knowledge. To overcome these limitations, we propose Oral-3Dv2 to solve this cross-dimension translation problem in dental healthcare by learning solely on projection information, i.e., the projection image and trajectory of the X-ray tube. Our model learns to represent the 3D oral structure in an implicit way by mapping 2D coordinates into density values of voxels in the 3D space. To improve efficiency and effectiveness, we utilize a multi-head model that predicts a bunch of voxel values in 3D space simultaneously from a 2D coordinate in the axial plane and the dynamic sampling strategy to refine details of the density distribution in the reconstruction result. Extensive experiments in simulated and real data show that our model significantly outperforms existing state-of-the-art models without learning from paired images or prior individual knowledge. To the best of our knowledge, this is the first work of a non-adversarial-learning-based model in 3D radiology reconstruction from a single panoramic X-ray image.</description><identifier>DOI: 10.48550/arxiv.2303.12123</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.12123$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.12123$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Song, Weinan</creatorcontrib><creatorcontrib>Zheng, Haoxin</creatorcontrib><creatorcontrib>Tu, Dezhan</creatorcontrib><creatorcontrib>Liang, Chengwen</creatorcontrib><creatorcontrib>He, Lei</creatorcontrib><title>Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation</title><description>3D reconstruction of medical imaging from 2D images has become an increasingly interesting topic with the development of deep learning models in recent years. Previous studies in 3D reconstruction from limited X-ray images mainly rely on learning from paired 2D and 3D images, where the reconstruction quality relies on the scale and variation of collected data. This has brought significant challenges in the collection of training data, as only a tiny fraction of patients take two types of radiation examinations in the same period. Although simulation from higher-dimension images could solve this problem, the variance between real and simulated data could bring great uncertainty at the same time. In oral reconstruction, the situation becomes more challenging as only a single panoramic X-ray image is available, where models need to infer the curved shape by prior individual knowledge. To overcome these limitations, we propose Oral-3Dv2 to solve this cross-dimension translation problem in dental healthcare by learning solely on projection information, i.e., the projection image and trajectory of the X-ray tube. Our model learns to represent the 3D oral structure in an implicit way by mapping 2D coordinates into density values of voxels in the 3D space. To improve efficiency and effectiveness, we utilize a multi-head model that predicts a bunch of voxel values in 3D space simultaneously from a 2D coordinate in the axial plane and the dynamic sampling strategy to refine details of the density distribution in the reconstruction result. Extensive experiments in simulated and real data show that our model significantly outperforms existing state-of-the-art models without learning from paired images or prior individual knowledge. To the best of our knowledge, this is the first work of a non-adversarial-learning-based model in 3D radiology reconstruction from a single panoramic X-ray image.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj71ugzAURr10qNI-QKf6BaC2L2DTrUr6EylqqihDN3SB69QSGGRI2rx9Q5Lp0_mGIx3GHqSIE5Om4gnDnzvECgTEUkkFt6xcB2wiWBzUM4cFn4hvqOr8MIZ9NbrOcxu6ln-h7wK2ruLf0QaPfNnizvkd_3Xjzwn6xlVu5J-0vwj6QAP5ESfBHbux2Ax0f90Z2769bucf0Wr9vpy_rCLMNESSclIZoTEl5IQ2ESgNGZ2ndSJzq3Mha1lKXVqCRKOE0wGkRaasUqZOYcYeL9pzZNEH12I4FlNscY6Ff4MLTo8</recordid><startdate>20230321</startdate><enddate>20230321</enddate><creator>Song, Weinan</creator><creator>Zheng, Haoxin</creator><creator>Tu, Dezhan</creator><creator>Liang, Chengwen</creator><creator>He, Lei</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230321</creationdate><title>Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation</title><author>Song, Weinan ; Zheng, Haoxin ; Tu, Dezhan ; Liang, Chengwen ; He, Lei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-1e9e26ea88b39eaf40a18e8795d419f7901d1b17bfe347a139013e7062f228d53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Song, Weinan</creatorcontrib><creatorcontrib>Zheng, Haoxin</creatorcontrib><creatorcontrib>Tu, Dezhan</creatorcontrib><creatorcontrib>Liang, Chengwen</creatorcontrib><creatorcontrib>He, Lei</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Song, Weinan</au><au>Zheng, Haoxin</au><au>Tu, Dezhan</au><au>Liang, Chengwen</au><au>He, Lei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation</atitle><date>2023-03-21</date><risdate>2023</risdate><abstract>3D reconstruction of medical imaging from 2D images has become an increasingly interesting topic with the development of deep learning models in recent years. Previous studies in 3D reconstruction from limited X-ray images mainly rely on learning from paired 2D and 3D images, where the reconstruction quality relies on the scale and variation of collected data. This has brought significant challenges in the collection of training data, as only a tiny fraction of patients take two types of radiation examinations in the same period. Although simulation from higher-dimension images could solve this problem, the variance between real and simulated data could bring great uncertainty at the same time. In oral reconstruction, the situation becomes more challenging as only a single panoramic X-ray image is available, where models need to infer the curved shape by prior individual knowledge. To overcome these limitations, we propose Oral-3Dv2 to solve this cross-dimension translation problem in dental healthcare by learning solely on projection information, i.e., the projection image and trajectory of the X-ray tube. Our model learns to represent the 3D oral structure in an implicit way by mapping 2D coordinates into density values of voxels in the 3D space. To improve efficiency and effectiveness, we utilize a multi-head model that predicts a bunch of voxel values in 3D space simultaneously from a 2D coordinate in the axial plane and the dynamic sampling strategy to refine details of the density distribution in the reconstruction result. Extensive experiments in simulated and real data show that our model significantly outperforms existing state-of-the-art models without learning from paired images or prior individual knowledge. To the best of our knowledge, this is the first work of a non-adversarial-learning-based model in 3D radiology reconstruction from a single panoramic X-ray image.</abstract><doi>10.48550/arxiv.2303.12123</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2303.12123
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2303_12123
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	Oral-3Dv2: 3D Oral Reconstruction from Panoramic X-Ray Imaging with Implicit Neural Representation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T20%3A44%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Oral-3Dv2:%203D%20Oral%20Reconstruction%20from%20Panoramic%20X-Ray%20Imaging%20with%20Implicit%20Neural%20Representation&rft.au=Song,%20Weinan&rft.date=2023-03-21&rft_id=info:doi/10.48550/arxiv.2303.12123&rft_dat=%3Carxiv_GOX%3E2303_12123%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true