Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation

Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the adve...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on intelligent systems and technology 2024-12
Hauptverfasser: Cheng, Ziming, Zhu, Yazhou, Wang, Shidong, Xin, Tong, Zhang, Haofeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title ACM transactions on intelligent systems and technology
container_volume
creator Cheng, Ziming
Zhu, Yazhou
Wang, Shidong
Xin, Tong
Zhang, Haofeng
description Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.
doi_str_mv 10.1145/3709145
format Article
fullrecord <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3709145</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3709145</sourcerecordid><originalsourceid>FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</originalsourceid><addsrcrecordid>eNo9kEFLw0AQRhdRsNTi3dPePEV3JrtJ9liKtYWKYHMP291JjCQb2Q2I_96U1s7lDXxv5vAxdg_iCUCq5zQXeuIVmyGoPMk04PVlF_KWLWL8EtNIjRqKGduvTLTGkePLbqTgzdj6hn9Q3XrqyY-8DMbHegg9BT6Br-kniZ_DyN_ItdZ0fNubhviemqM-nQ_-jt3Upou0OHPOyvVLudoku_fX7Wq5S4wClRAhodJp7sBBBpkqMouoZH7AlIRQwllSqBWKItdSS2sEutq5QyGwsBrSOXs8vbVhiDFQXX2HtjfhtwJRHduozm1M5sPJNLa_SP_hH1xOWII</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><source>ACM Digital Library</source><creator>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</creator><creatorcontrib>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</creatorcontrib><description>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</description><identifier>ISSN: 2157-6904</identifier><identifier>EISSN: 2157-6912</identifier><identifier>DOI: 10.1145/3709145</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computing methodologies ; Image segmentation</subject><ispartof>ACM transactions on intelligent systems and technology, 2024-12</ispartof><rights>Copyright held by the owner/author(s).</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</cites><orcidid>0000-0003-1023-1286 ; 0000-0001-5479-262X ; 0000-0002-4039-7618 ; 0000-0002-7537-5945 ; 0009-0006-4316-0903</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Cheng, Ziming</creatorcontrib><creatorcontrib>Zhu, Yazhou</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Xin, Tong</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><title>ACM transactions on intelligent systems and technology</title><addtitle>ACM TIST</addtitle><description>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</description><subject>Computing methodologies</subject><subject>Image segmentation</subject><issn>2157-6904</issn><issn>2157-6912</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNo9kEFLw0AQRhdRsNTi3dPePEV3JrtJ9liKtYWKYHMP291JjCQb2Q2I_96U1s7lDXxv5vAxdg_iCUCq5zQXeuIVmyGoPMk04PVlF_KWLWL8EtNIjRqKGduvTLTGkePLbqTgzdj6hn9Q3XrqyY-8DMbHegg9BT6Br-kniZ_DyN_ItdZ0fNubhviemqM-nQ_-jt3Upou0OHPOyvVLudoku_fX7Wq5S4wClRAhodJp7sBBBpkqMouoZH7AlIRQwllSqBWKItdSS2sEutq5QyGwsBrSOXs8vbVhiDFQXX2HtjfhtwJRHduozm1M5sPJNLa_SP_hH1xOWII</recordid><startdate>20241220</startdate><enddate>20241220</enddate><creator>Cheng, Ziming</creator><creator>Zhu, Yazhou</creator><creator>Wang, Shidong</creator><creator>Xin, Tong</creator><creator>Zhang, Haofeng</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0001-5479-262X</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-7537-5945</orcidid><orcidid>https://orcid.org/0009-0006-4316-0903</orcidid></search><sort><creationdate>20241220</creationdate><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><author>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computing methodologies</topic><topic>Image segmentation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cheng, Ziming</creatorcontrib><creatorcontrib>Zhu, Yazhou</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Xin, Tong</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on intelligent systems and technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cheng, Ziming</au><au>Zhu, Yazhou</au><au>Wang, Shidong</au><au>Xin, Tong</au><au>Zhang, Haofeng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</atitle><jtitle>ACM transactions on intelligent systems and technology</jtitle><stitle>ACM TIST</stitle><date>2024-12-20</date><risdate>2024</risdate><issn>2157-6904</issn><eissn>2157-6912</eissn><abstract>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3709145</doi><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0001-5479-262X</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-7537-5945</orcidid><orcidid>https://orcid.org/0009-0006-4316-0903</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2157-6904
ispartof ACM transactions on intelligent systems and technology, 2024-12
issn 2157-6904
2157-6912
language eng
recordid cdi_crossref_primary_10_1145_3709145
source ACM Digital Library
subjects Computing methodologies
Image segmentation
title Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T07%3A33%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cascaded%20Alternating%20Refinement%20Transformer%20for%20Few-shot%20Medical%20Image%20Segmentation&rft.jtitle=ACM%20transactions%20on%20intelligent%20systems%20and%20technology&rft.au=Cheng,%20Ziming&rft.date=2024-12-20&rft.issn=2157-6904&rft.eissn=2157-6912&rft_id=info:doi/10.1145/3709145&rft_dat=%3Cacm_cross%3E3709145%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true