Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation

Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the adve...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on intelligent systems and technology 2024-12
Hauptverfasser:	Cheng, Ziming, Zhu, Yazhou, Wang, Shidong, Xin, Tong, Zhang, Haofeng
Format:	Artikel
Sprache:	eng
Schlagworte:	Computing methodologies Image segmentation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	ACM transactions on intelligent systems and technology
container_volume
creator	Cheng, Ziming Zhu, Yazhou Wang, Shidong Xin, Tong Zhang, Haofeng
description	Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.
doi_str_mv	10.1145/3709145
format	Article
fullrecord	<record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3709145</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3709145</sourcerecordid><originalsourceid>FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</originalsourceid><addsrcrecordid>eNo9kEFLw0AQRhdRsNTi3dPePEV3JrtJ9liKtYWKYHMP291JjCQb2Q2I_96U1s7lDXxv5vAxdg_iCUCq5zQXeuIVmyGoPMk04PVlF_KWLWL8EtNIjRqKGduvTLTGkePLbqTgzdj6hn9Q3XrqyY-8DMbHegg9BT6Br-kniZ_DyN_ItdZ0fNubhviemqM-nQ_-jt3Upou0OHPOyvVLudoku_fX7Wq5S4wClRAhodJp7sBBBpkqMouoZH7AlIRQwllSqBWKItdSS2sEutq5QyGwsBrSOXs8vbVhiDFQXX2HtjfhtwJRHduozm1M5sPJNLa_SP_hH1xOWII</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><source>ACM Digital Library</source><creator>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</creator><creatorcontrib>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</creatorcontrib><description>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</description><identifier>ISSN: 2157-6904</identifier><identifier>EISSN: 2157-6912</identifier><identifier>DOI: 10.1145/3709145</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computing methodologies ; Image segmentation</subject><ispartof>ACM transactions on intelligent systems and technology, 2024-12</ispartof><rights>Copyright held by the owner/author(s).</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</cites><orcidid>0000-0003-1023-1286 ; 0000-0001-5479-262X ; 0000-0002-4039-7618 ; 0000-0002-7537-5945 ; 0009-0006-4316-0903</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Cheng, Ziming</creatorcontrib><creatorcontrib>Zhu, Yazhou</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Xin, Tong</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><title>ACM transactions on intelligent systems and technology</title><addtitle>ACM TIST</addtitle><description>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</description><subject>Computing methodologies</subject><subject>Image segmentation</subject><issn>2157-6904</issn><issn>2157-6912</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNo9kEFLw0AQRhdRsNTi3dPePEV3JrtJ9liKtYWKYHMP291JjCQb2Q2I_96U1s7lDXxv5vAxdg_iCUCq5zQXeuIVmyGoPMk04PVlF_KWLWL8EtNIjRqKGduvTLTGkePLbqTgzdj6hn9Q3XrqyY-8DMbHegg9BT6Br-kniZ_DyN_ItdZ0fNubhviemqM-nQ_-jt3Upou0OHPOyvVLudoku_fX7Wq5S4wClRAhodJp7sBBBpkqMouoZH7AlIRQwllSqBWKItdSS2sEutq5QyGwsBrSOXs8vbVhiDFQXX2HtjfhtwJRHduozm1M5sPJNLa_SP_hH1xOWII</recordid><startdate>20241220</startdate><enddate>20241220</enddate><creator>Cheng, Ziming</creator><creator>Zhu, Yazhou</creator><creator>Wang, Shidong</creator><creator>Xin, Tong</creator><creator>Zhang, Haofeng</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0001-5479-262X</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-7537-5945</orcidid><orcidid>https://orcid.org/0009-0006-4316-0903</orcidid></search><sort><creationdate>20241220</creationdate><title>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</title><author>Cheng, Ziming ; Zhu, Yazhou ; Wang, Shidong ; Xin, Tong ; Zhang, Haofeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a515-ee2e25937d1d1616586c22547b23e0050dce529520879494ca02dfddb8028c913</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computing methodologies</topic><topic>Image segmentation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cheng, Ziming</creatorcontrib><creatorcontrib>Zhu, Yazhou</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Xin, Tong</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on intelligent systems and technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cheng, Ziming</au><au>Zhu, Yazhou</au><au>Wang, Shidong</au><au>Xin, Tong</au><au>Zhang, Haofeng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation</atitle><jtitle>ACM transactions on intelligent systems and technology</jtitle><stitle>ACM TIST</stitle><date>2024-12-20</date><risdate>2024</risdate><issn>2157-6904</issn><eissn>2157-6912</eissn><abstract>Conventional biomedical image segmentation heavily relies on substantial annotations, which demand significant human and financial resources for collection. Consequently, learning a model with excellent performance using limited medical image data becomes a challenging problem. Upliftingly, the advent of few-shot medical image segmentation (FSMIS) offers a potential solution. Although prototypical networks are commonly employed in existing FSMIS tasks, the prototypes derived from support features often induce significant bias issues caused by intra-class variations. To this end, we propose a method called Cascaded Altering Refinement Transformer (CART) to iteratively calibrate the prototypes with both support and query features. This method focuses on capturing the commonality between foreground information of the support and query features using the Alterating Refinement Transformer (ART) module, which includes two Multi-head Cross Attention (MCA) modules. Furthermore, we cascade ART modules to refine the class prototypes, resulting in representative prototypes. This process ultimately contributes to a more accurate predicted mask. Besides, to preserve more valid information in each cascaded ART module and achieve better performance, we propose a novel inference method that accumulates the predicted segmentation masks in all ART modules by applying the Rounding-Up strategy. Extensive experiments on three public medical image datasets demonstrate that our model outperforms the state-of-the-art methods, and detailed analysis also validates the reasonableness of this design. Code is available at: https://github.com/zmcheng9/CART.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3709145</doi><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0001-5479-262X</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-7537-5945</orcidid><orcidid>https://orcid.org/0009-0006-4316-0903</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2157-6904
ispartof	ACM transactions on intelligent systems and technology, 2024-12
issn	2157-6904 2157-6912
language	eng
recordid	cdi_crossref_primary_10_1145_3709145
source	ACM Digital Library
subjects	Computing methodologies Image segmentation
title	Cascaded Alternating Refinement Transformer for Few-shot Medical Image Segmentation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T07%3A33%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cascaded%20Alternating%20Refinement%20Transformer%20for%20Few-shot%20Medical%20Image%20Segmentation&rft.jtitle=ACM%20transactions%20on%20intelligent%20systems%20and%20technology&rft.au=Cheng,%20Ziming&rft.date=2024-12-20&rft.issn=2157-6904&rft.eissn=2157-6912&rft_id=info:doi/10.1145/3709145&rft_dat=%3Cacm_cross%3E3709145%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true