A multi-scale large kernel attention with U-Net for medical image registration

Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of supercomputing 2025, Vol.81 (1), Article 70
Hauptverfasser: Chen, Yilin, Hu, Xin, Lu, Tao, Zou, Lu, Liao, Xiangyun
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title The Journal of supercomputing
container_volume 81
creator Chen, Yilin
Hu, Xin
Lu, Tao
Zou, Lu
Liao, Xiangyun
description Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.
doi_str_mv 10.1007/s11227-024-06489-9
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3120031260</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3120031260</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</originalsourceid><addsrcrecordid>eNp9kLtOAzEQRS0EEiHwA1SWqA3j1z7KKOIlRaEhteV4Z5cN-wi2I8Tf47BIdDQzzbl3RoeQaw63HCC_C5wLkTMQikGmipKVJ2TGdS4ZqEKdkhmUAlihlTgnFyHsAEDJXM7IekH7QxdbFpztkHbWN0jf0Q_YURsjDrEdB_rZxje6YWuMtB497bFqE07b3ibaY9OG6O2RvCRnte0CXv3uOdk83L8un9jq5fF5uVgxJwAiy7SrnLBCbQVI5LVG6XRhHS9zxVFWXGtwrhJohS1Auq0FWQleZxyd1ttKzsnN1Lv348cBQzS78eCHdNJInk6kkUGixEQ5P4bgsTZ7n372X4aDOXozkzeTvJkfb6ZMITmFQoKHBv1f9T-pb9WmcFA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3120031260</pqid></control><display><type>article</type><title>A multi-scale large kernel attention with U-Net for medical image registration</title><source>SpringerNature Journals</source><creator>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</creator><creatorcontrib>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</creatorcontrib><description>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</description><identifier>ISSN: 0920-8542</identifier><identifier>EISSN: 1573-0484</identifier><identifier>DOI: 10.1007/s11227-024-06489-9</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial neural networks ; Compilers ; Computer Science ; Computer vision ; Datasets ; Deformation ; Formability ; Image acquisition ; Image registration ; Interpreters ; Medical imaging ; Medical research ; Modelling ; Modules ; Multilayers ; Parameter uncertainty ; Processor Architectures ; Programming Languages ; Registration ; Uncertainty analysis</subject><ispartof>The Journal of supercomputing, 2025, Vol.81 (1), Article 70</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11227-024-06489-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11227-024-06489-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Chen, Yilin</creatorcontrib><creatorcontrib>Hu, Xin</creatorcontrib><creatorcontrib>Lu, Tao</creatorcontrib><creatorcontrib>Zou, Lu</creatorcontrib><creatorcontrib>Liao, Xiangyun</creatorcontrib><title>A multi-scale large kernel attention with U-Net for medical image registration</title><title>The Journal of supercomputing</title><addtitle>J Supercomput</addtitle><description>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</description><subject>Artificial neural networks</subject><subject>Compilers</subject><subject>Computer Science</subject><subject>Computer vision</subject><subject>Datasets</subject><subject>Deformation</subject><subject>Formability</subject><subject>Image acquisition</subject><subject>Image registration</subject><subject>Interpreters</subject><subject>Medical imaging</subject><subject>Medical research</subject><subject>Modelling</subject><subject>Modules</subject><subject>Multilayers</subject><subject>Parameter uncertainty</subject><subject>Processor Architectures</subject><subject>Programming Languages</subject><subject>Registration</subject><subject>Uncertainty analysis</subject><issn>0920-8542</issn><issn>1573-0484</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kLtOAzEQRS0EEiHwA1SWqA3j1z7KKOIlRaEhteV4Z5cN-wi2I8Tf47BIdDQzzbl3RoeQaw63HCC_C5wLkTMQikGmipKVJ2TGdS4ZqEKdkhmUAlihlTgnFyHsAEDJXM7IekH7QxdbFpztkHbWN0jf0Q_YURsjDrEdB_rZxje6YWuMtB497bFqE07b3ibaY9OG6O2RvCRnte0CXv3uOdk83L8un9jq5fF5uVgxJwAiy7SrnLBCbQVI5LVG6XRhHS9zxVFWXGtwrhJohS1Auq0FWQleZxyd1ttKzsnN1Lv348cBQzS78eCHdNJInk6kkUGixEQ5P4bgsTZ7n372X4aDOXozkzeTvJkfb6ZMITmFQoKHBv1f9T-pb9WmcFA</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Chen, Yilin</creator><creator>Hu, Xin</creator><creator>Lu, Tao</creator><creator>Zou, Lu</creator><creator>Liao, Xiangyun</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2025</creationdate><title>A multi-scale large kernel attention with U-Net for medical image registration</title><author>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Artificial neural networks</topic><topic>Compilers</topic><topic>Computer Science</topic><topic>Computer vision</topic><topic>Datasets</topic><topic>Deformation</topic><topic>Formability</topic><topic>Image acquisition</topic><topic>Image registration</topic><topic>Interpreters</topic><topic>Medical imaging</topic><topic>Medical research</topic><topic>Modelling</topic><topic>Modules</topic><topic>Multilayers</topic><topic>Parameter uncertainty</topic><topic>Processor Architectures</topic><topic>Programming Languages</topic><topic>Registration</topic><topic>Uncertainty analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Yilin</creatorcontrib><creatorcontrib>Hu, Xin</creatorcontrib><creatorcontrib>Lu, Tao</creatorcontrib><creatorcontrib>Zou, Lu</creatorcontrib><creatorcontrib>Liao, Xiangyun</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of supercomputing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Yilin</au><au>Hu, Xin</au><au>Lu, Tao</au><au>Zou, Lu</au><au>Liao, Xiangyun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A multi-scale large kernel attention with U-Net for medical image registration</atitle><jtitle>The Journal of supercomputing</jtitle><stitle>J Supercomput</stitle><date>2025</date><risdate>2025</risdate><volume>81</volume><issue>1</issue><artnum>70</artnum><issn>0920-8542</issn><eissn>1573-0484</eissn><abstract>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11227-024-06489-9</doi></addata></record>
fulltext fulltext
identifier ISSN: 0920-8542
ispartof The Journal of supercomputing, 2025, Vol.81 (1), Article 70
issn 0920-8542
1573-0484
language eng
recordid cdi_proquest_journals_3120031260
source SpringerNature Journals
subjects Artificial neural networks
Compilers
Computer Science
Computer vision
Datasets
Deformation
Formability
Image acquisition
Image registration
Interpreters
Medical imaging
Medical research
Modelling
Modules
Multilayers
Parameter uncertainty
Processor Architectures
Programming Languages
Registration
Uncertainty analysis
title A multi-scale large kernel attention with U-Net for medical image registration
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T12%3A04%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20multi-scale%20large%20kernel%20attention%20with%20U-Net%20for%20medical%20image%20registration&rft.jtitle=The%20Journal%20of%20supercomputing&rft.au=Chen,%20Yilin&rft.date=2025&rft.volume=81&rft.issue=1&rft.artnum=70&rft.issn=0920-8542&rft.eissn=1573-0484&rft_id=info:doi/10.1007/s11227-024-06489-9&rft_dat=%3Cproquest_cross%3E3120031260%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3120031260&rft_id=info:pmid/&rfr_iscdi=true