A multi-scale large kernel attention with U-Net for medical image registration
Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and t...
Gespeichert in:
Veröffentlicht in: | The Journal of supercomputing 2025, Vol.81 (1), Article 70 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | The Journal of supercomputing |
container_volume | 81 |
creator | Chen, Yilin Hu, Xin Lu, Tao Zou, Lu Liao, Xiangyun |
description | Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method. |
doi_str_mv | 10.1007/s11227-024-06489-9 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3120031260</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3120031260</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</originalsourceid><addsrcrecordid>eNp9kLtOAzEQRS0EEiHwA1SWqA3j1z7KKOIlRaEhteV4Z5cN-wi2I8Tf47BIdDQzzbl3RoeQaw63HCC_C5wLkTMQikGmipKVJ2TGdS4ZqEKdkhmUAlihlTgnFyHsAEDJXM7IekH7QxdbFpztkHbWN0jf0Q_YURsjDrEdB_rZxje6YWuMtB497bFqE07b3ibaY9OG6O2RvCRnte0CXv3uOdk83L8un9jq5fF5uVgxJwAiy7SrnLBCbQVI5LVG6XRhHS9zxVFWXGtwrhJohS1Auq0FWQleZxyd1ttKzsnN1Lv348cBQzS78eCHdNJInk6kkUGixEQ5P4bgsTZ7n372X4aDOXozkzeTvJkfb6ZMITmFQoKHBv1f9T-pb9WmcFA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3120031260</pqid></control><display><type>article</type><title>A multi-scale large kernel attention with U-Net for medical image registration</title><source>SpringerNature Journals</source><creator>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</creator><creatorcontrib>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</creatorcontrib><description>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</description><identifier>ISSN: 0920-8542</identifier><identifier>EISSN: 1573-0484</identifier><identifier>DOI: 10.1007/s11227-024-06489-9</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial neural networks ; Compilers ; Computer Science ; Computer vision ; Datasets ; Deformation ; Formability ; Image acquisition ; Image registration ; Interpreters ; Medical imaging ; Medical research ; Modelling ; Modules ; Multilayers ; Parameter uncertainty ; Processor Architectures ; Programming Languages ; Registration ; Uncertainty analysis</subject><ispartof>The Journal of supercomputing, 2025, Vol.81 (1), Article 70</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11227-024-06489-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11227-024-06489-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Chen, Yilin</creatorcontrib><creatorcontrib>Hu, Xin</creatorcontrib><creatorcontrib>Lu, Tao</creatorcontrib><creatorcontrib>Zou, Lu</creatorcontrib><creatorcontrib>Liao, Xiangyun</creatorcontrib><title>A multi-scale large kernel attention with U-Net for medical image registration</title><title>The Journal of supercomputing</title><addtitle>J Supercomput</addtitle><description>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</description><subject>Artificial neural networks</subject><subject>Compilers</subject><subject>Computer Science</subject><subject>Computer vision</subject><subject>Datasets</subject><subject>Deformation</subject><subject>Formability</subject><subject>Image acquisition</subject><subject>Image registration</subject><subject>Interpreters</subject><subject>Medical imaging</subject><subject>Medical research</subject><subject>Modelling</subject><subject>Modules</subject><subject>Multilayers</subject><subject>Parameter uncertainty</subject><subject>Processor Architectures</subject><subject>Programming Languages</subject><subject>Registration</subject><subject>Uncertainty analysis</subject><issn>0920-8542</issn><issn>1573-0484</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kLtOAzEQRS0EEiHwA1SWqA3j1z7KKOIlRaEhteV4Z5cN-wi2I8Tf47BIdDQzzbl3RoeQaw63HCC_C5wLkTMQikGmipKVJ2TGdS4ZqEKdkhmUAlihlTgnFyHsAEDJXM7IekH7QxdbFpztkHbWN0jf0Q_YURsjDrEdB_rZxje6YWuMtB497bFqE07b3ibaY9OG6O2RvCRnte0CXv3uOdk83L8un9jq5fF5uVgxJwAiy7SrnLBCbQVI5LVG6XRhHS9zxVFWXGtwrhJohS1Auq0FWQleZxyd1ttKzsnN1Lv348cBQzS78eCHdNJInk6kkUGixEQ5P4bgsTZ7n372X4aDOXozkzeTvJkfb6ZMITmFQoKHBv1f9T-pb9WmcFA</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Chen, Yilin</creator><creator>Hu, Xin</creator><creator>Lu, Tao</creator><creator>Zou, Lu</creator><creator>Liao, Xiangyun</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2025</creationdate><title>A multi-scale large kernel attention with U-Net for medical image registration</title><author>Chen, Yilin ; Hu, Xin ; Lu, Tao ; Zou, Lu ; Liao, Xiangyun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-65cdc2a24b203e1f5e3c58ac19741e3d1550ccd2ea2a803cba03d21f61ec55bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Artificial neural networks</topic><topic>Compilers</topic><topic>Computer Science</topic><topic>Computer vision</topic><topic>Datasets</topic><topic>Deformation</topic><topic>Formability</topic><topic>Image acquisition</topic><topic>Image registration</topic><topic>Interpreters</topic><topic>Medical imaging</topic><topic>Medical research</topic><topic>Modelling</topic><topic>Modules</topic><topic>Multilayers</topic><topic>Parameter uncertainty</topic><topic>Processor Architectures</topic><topic>Programming Languages</topic><topic>Registration</topic><topic>Uncertainty analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Yilin</creatorcontrib><creatorcontrib>Hu, Xin</creatorcontrib><creatorcontrib>Lu, Tao</creatorcontrib><creatorcontrib>Zou, Lu</creatorcontrib><creatorcontrib>Liao, Xiangyun</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of supercomputing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Yilin</au><au>Hu, Xin</au><au>Lu, Tao</au><au>Zou, Lu</au><au>Liao, Xiangyun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A multi-scale large kernel attention with U-Net for medical image registration</atitle><jtitle>The Journal of supercomputing</jtitle><stitle>J Supercomput</stitle><date>2025</date><risdate>2025</risdate><volume>81</volume><issue>1</issue><artnum>70</artnum><issn>0920-8542</issn><eissn>1573-0484</eissn><abstract>Deformable image registration minimizes the discrepancy between moving and fixed images by establishing linear and nonlinear spatial correspondences. It plays a crucial role in surgical navigation, image fusion and disease analysis. Its challenge lies in the large number of deformed parameters and the uncertainty of acquisition conditions. Benefiting from the powerful ability to capture hierarchical features and spatial relationships of convolutional neural networks, the medical image registration task has made great progress. Nowadays, the long-range relationship modeling and adaptive selection of self-attention show great potential and have also attracted much attention from researchers. Inspired by this, we propose a new method called Multi-scale Large Kernel Attention UNet (MLKA-Net), which combines a large kernel convolution with the attention mechanism using a multi-scale strategy, and uses a correction module to fine-tune the deformation field to achieve high-accuracy registration. Specifically, we first propose a multi-scale large kernel attention mechanism (MLKA), which generates attention maps by aggregating information from convolution kernels at different scales to improve local feature modeling capabilities of attention. Furthermore, we employ large kernel dilation convolution in proposed attention to construct sufficiently long-range relationships, while keeping lower number of parameters. Finally, to further improve local accuracy of the registration, we design an additional correction module and unsupervised framework to fine-tune the deformation field to solve the issue of original information loss in multilayer networks. Our method is compared qualitatively and quantitatively with 24 representative and advanced methods on the 3 public available 3D datasets from IXI database, LPBA40 dataset and OASIS database, respectively. The experiments demonstrate the excellent performance of the proposed method.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11227-024-06489-9</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0920-8542 |
ispartof | The Journal of supercomputing, 2025, Vol.81 (1), Article 70 |
issn | 0920-8542 1573-0484 |
language | eng |
recordid | cdi_proquest_journals_3120031260 |
source | SpringerNature Journals |
subjects | Artificial neural networks Compilers Computer Science Computer vision Datasets Deformation Formability Image acquisition Image registration Interpreters Medical imaging Medical research Modelling Modules Multilayers Parameter uncertainty Processor Architectures Programming Languages Registration Uncertainty analysis |
title | A multi-scale large kernel attention with U-Net for medical image registration |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T12%3A04%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20multi-scale%20large%20kernel%20attention%20with%20U-Net%20for%20medical%20image%20registration&rft.jtitle=The%20Journal%20of%20supercomputing&rft.au=Chen,%20Yilin&rft.date=2025&rft.volume=81&rft.issue=1&rft.artnum=70&rft.issn=0920-8542&rft.eissn=1573-0484&rft_id=info:doi/10.1007/s11227-024-06489-9&rft_dat=%3Cproquest_cross%3E3120031260%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3120031260&rft_id=info:pmid/&rfr_iscdi=true |