Denoising Diffusions in Latent Space for Medical Image Segmentation

Diffusion models (DPMs) have demonstrated remarkable performance in image generation, often times outperforming other generative models. Since their introduction, the powerful noise-to-image denoising pipeline has been extended to various discriminative tasks, including image segmentation. In case o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-07
Hauptverfasser: Fahim Ahmed Zaman, Mathews, Jacob, Chang, Amanda, Liu, Kan, Sonka, Milan, Wu, Xiaodong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Fahim Ahmed Zaman
Mathews, Jacob
Chang, Amanda
Liu, Kan
Sonka, Milan
Wu, Xiaodong
description Diffusion models (DPMs) have demonstrated remarkable performance in image generation, often times outperforming other generative models. Since their introduction, the powerful noise-to-image denoising pipeline has been extended to various discriminative tasks, including image segmentation. In case of medical imaging, often times the images are large 3D scans, where segmenting one image using DPMs become extremely inefficient due to large memory consumption and time consuming iterative sampling process. In this work, we propose a novel conditional generative modeling framework (LDSeg) that performs diffusion in latent space for medical image segmentation. Our proposed framework leverages the learned inherent low-dimensional latent distribution of the target object shapes and source image embeddings. The conditional diffusion in latent space not only ensures accurate n-D image segmentation for multi-label objects, but also mitigates the major underlying problems of the traditional DPM based segmentation: (1) large memory consumption, (2) time consuming sampling process and (3) unnatural noise injection in forward/reverse process. LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. Furthermore, we show that our proposed model is significantly more robust to noises, compared to the traditional deterministic segmentation models, which can be potential in solving the domain shift problems in the medical imaging domain. Codes are available at: https://github.com/LDSeg/LDSeg.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3082703870</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3082703870</sourcerecordid><originalsourceid>FETCH-proquest_journals_30827038703</originalsourceid><addsrcrecordid>eNqNikELgjAYQEcQJOV_-KCzsDZN71oU1MnuMuzbmOhmfvP_t0M_oMPjHd7bsERIecqqXIgdS4kGzrk4l6IoZMLqBp23ZJ2Bxmq9kvWOwDp4qIAuQDurHkH7BZ74tr0a4T4pg9CimWJXIf4HttVqJEx_3rPj9fKqb9m8-M-KFLrBr4uLqZO8EiWXVeS_6wvIAjnA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3082703870</pqid></control><display><type>article</type><title>Denoising Diffusions in Latent Space for Medical Image Segmentation</title><source>Free E- Journals</source><creator>Fahim Ahmed Zaman ; Mathews, Jacob ; Chang, Amanda ; Liu, Kan ; Sonka, Milan ; Wu, Xiaodong</creator><creatorcontrib>Fahim Ahmed Zaman ; Mathews, Jacob ; Chang, Amanda ; Liu, Kan ; Sonka, Milan ; Wu, Xiaodong</creatorcontrib><description>Diffusion models (DPMs) have demonstrated remarkable performance in image generation, often times outperforming other generative models. Since their introduction, the powerful noise-to-image denoising pipeline has been extended to various discriminative tasks, including image segmentation. In case of medical imaging, often times the images are large 3D scans, where segmenting one image using DPMs become extremely inefficient due to large memory consumption and time consuming iterative sampling process. In this work, we propose a novel conditional generative modeling framework (LDSeg) that performs diffusion in latent space for medical image segmentation. Our proposed framework leverages the learned inherent low-dimensional latent distribution of the target object shapes and source image embeddings. The conditional diffusion in latent space not only ensures accurate n-D image segmentation for multi-label objects, but also mitigates the major underlying problems of the traditional DPM based segmentation: (1) large memory consumption, (2) time consuming sampling process and (3) unnatural noise injection in forward/reverse process. LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. Furthermore, we show that our proposed model is significantly more robust to noises, compared to the traditional deterministic segmentation models, which can be potential in solving the domain shift problems in the medical imaging domain. Codes are available at: https://github.com/LDSeg/LDSeg.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Consumption ; Diffusion ; Image processing ; Image segmentation ; Medical imaging ; Noise reduction ; Sampling</subject><ispartof>arXiv.org, 2024-07</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Fahim Ahmed Zaman</creatorcontrib><creatorcontrib>Mathews, Jacob</creatorcontrib><creatorcontrib>Chang, Amanda</creatorcontrib><creatorcontrib>Liu, Kan</creatorcontrib><creatorcontrib>Sonka, Milan</creatorcontrib><creatorcontrib>Wu, Xiaodong</creatorcontrib><title>Denoising Diffusions in Latent Space for Medical Image Segmentation</title><title>arXiv.org</title><description>Diffusion models (DPMs) have demonstrated remarkable performance in image generation, often times outperforming other generative models. Since their introduction, the powerful noise-to-image denoising pipeline has been extended to various discriminative tasks, including image segmentation. In case of medical imaging, often times the images are large 3D scans, where segmenting one image using DPMs become extremely inefficient due to large memory consumption and time consuming iterative sampling process. In this work, we propose a novel conditional generative modeling framework (LDSeg) that performs diffusion in latent space for medical image segmentation. Our proposed framework leverages the learned inherent low-dimensional latent distribution of the target object shapes and source image embeddings. The conditional diffusion in latent space not only ensures accurate n-D image segmentation for multi-label objects, but also mitigates the major underlying problems of the traditional DPM based segmentation: (1) large memory consumption, (2) time consuming sampling process and (3) unnatural noise injection in forward/reverse process. LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. Furthermore, we show that our proposed model is significantly more robust to noises, compared to the traditional deterministic segmentation models, which can be potential in solving the domain shift problems in the medical imaging domain. Codes are available at: https://github.com/LDSeg/LDSeg.</description><subject>Consumption</subject><subject>Diffusion</subject><subject>Image processing</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><subject>Noise reduction</subject><subject>Sampling</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNikELgjAYQEcQJOV_-KCzsDZN71oU1MnuMuzbmOhmfvP_t0M_oMPjHd7bsERIecqqXIgdS4kGzrk4l6IoZMLqBp23ZJ2Bxmq9kvWOwDp4qIAuQDurHkH7BZ74tr0a4T4pg9CimWJXIf4HttVqJEx_3rPj9fKqb9m8-M-KFLrBr4uLqZO8EiWXVeS_6wvIAjnA</recordid><startdate>20240717</startdate><enddate>20240717</enddate><creator>Fahim Ahmed Zaman</creator><creator>Mathews, Jacob</creator><creator>Chang, Amanda</creator><creator>Liu, Kan</creator><creator>Sonka, Milan</creator><creator>Wu, Xiaodong</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240717</creationdate><title>Denoising Diffusions in Latent Space for Medical Image Segmentation</title><author>Fahim Ahmed Zaman ; Mathews, Jacob ; Chang, Amanda ; Liu, Kan ; Sonka, Milan ; Wu, Xiaodong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30827038703</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Consumption</topic><topic>Diffusion</topic><topic>Image processing</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><topic>Noise reduction</topic><topic>Sampling</topic><toplevel>online_resources</toplevel><creatorcontrib>Fahim Ahmed Zaman</creatorcontrib><creatorcontrib>Mathews, Jacob</creatorcontrib><creatorcontrib>Chang, Amanda</creatorcontrib><creatorcontrib>Liu, Kan</creatorcontrib><creatorcontrib>Sonka, Milan</creatorcontrib><creatorcontrib>Wu, Xiaodong</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fahim Ahmed Zaman</au><au>Mathews, Jacob</au><au>Chang, Amanda</au><au>Liu, Kan</au><au>Sonka, Milan</au><au>Wu, Xiaodong</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Denoising Diffusions in Latent Space for Medical Image Segmentation</atitle><jtitle>arXiv.org</jtitle><date>2024-07-17</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Diffusion models (DPMs) have demonstrated remarkable performance in image generation, often times outperforming other generative models. Since their introduction, the powerful noise-to-image denoising pipeline has been extended to various discriminative tasks, including image segmentation. In case of medical imaging, often times the images are large 3D scans, where segmenting one image using DPMs become extremely inefficient due to large memory consumption and time consuming iterative sampling process. In this work, we propose a novel conditional generative modeling framework (LDSeg) that performs diffusion in latent space for medical image segmentation. Our proposed framework leverages the learned inherent low-dimensional latent distribution of the target object shapes and source image embeddings. The conditional diffusion in latent space not only ensures accurate n-D image segmentation for multi-label objects, but also mitigates the major underlying problems of the traditional DPM based segmentation: (1) large memory consumption, (2) time consuming sampling process and (3) unnatural noise injection in forward/reverse process. LDSeg achieved state-of-the-art segmentation accuracy on three medical image datasets with different imaging modalities. Furthermore, we show that our proposed model is significantly more robust to noises, compared to the traditional deterministic segmentation models, which can be potential in solving the domain shift problems in the medical imaging domain. Codes are available at: https://github.com/LDSeg/LDSeg.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_3082703870
source Free E- Journals
subjects Consumption
Diffusion
Image processing
Image segmentation
Medical imaging
Noise reduction
Sampling
title Denoising Diffusions in Latent Space for Medical Image Segmentation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T21%3A31%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Denoising%20Diffusions%20in%20Latent%20Space%20for%20Medical%20Image%20Segmentation&rft.jtitle=arXiv.org&rft.au=Fahim%20Ahmed%20Zaman&rft.date=2024-07-17&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3082703870%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3082703870&rft_id=info:pmid/&rfr_iscdi=true