Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review
The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2023-11 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Azad, Reza Kazerouni, Amirhossein Heidari, Moein Aghdam, Ehsan Khodapanah Molaei, Amirali Jia, Yiwei Abin Jose Rijo Roy Merhof, Dorit |
description | The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2763956677</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2763956677</sourcerecordid><originalsourceid>FETCH-proquest_journals_27639566773</originalsourceid><addsrcrecordid>eNqNzM0KgkAUQOEhCJLyHS60FmwmtdqJFLVoU9JWBr3miM7YXH_o7WvRA7Q6m48zYw4XYuPttpwvmEtU-77Pw4gHgXDYPS5GqXMkUBquWKhcNnBp5RMh1rJ5kyKYVF_BQ5EyGlIrNZXGtmjpADEkpu0sVqhJjQg3HBVOKzYvZUPo_rpk69MxTc5eZ81rQOqz2gz2e6eMR6HYB2EYReI_9QH5Sz_d</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2763956677</pqid></control><display><type>article</type><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><source>Free E- Journals</source><creator>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</creator><creatorcontrib>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</creatorcontrib><description>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Computer vision ; Image analysis ; Image segmentation ; Medical imaging ; Natural language processing ; Taxonomy</subject><ispartof>arXiv.org, 2023-11</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>778,782</link.rule.ids></links><search><creatorcontrib>Azad, Reza</creatorcontrib><creatorcontrib>Kazerouni, Amirhossein</creatorcontrib><creatorcontrib>Heidari, Moein</creatorcontrib><creatorcontrib>Aghdam, Ehsan Khodapanah</creatorcontrib><creatorcontrib>Molaei, Amirali</creatorcontrib><creatorcontrib>Jia, Yiwei</creatorcontrib><creatorcontrib>Abin Jose</creatorcontrib><creatorcontrib>Rijo Roy</creatorcontrib><creatorcontrib>Merhof, Dorit</creatorcontrib><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><title>arXiv.org</title><description>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</description><subject>Artificial neural networks</subject><subject>Computer vision</subject><subject>Image analysis</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><subject>Natural language processing</subject><subject>Taxonomy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNzM0KgkAUQOEhCJLyHS60FmwmtdqJFLVoU9JWBr3miM7YXH_o7WvRA7Q6m48zYw4XYuPttpwvmEtU-77Pw4gHgXDYPS5GqXMkUBquWKhcNnBp5RMh1rJ5kyKYVF_BQ5EyGlIrNZXGtmjpADEkpu0sVqhJjQg3HBVOKzYvZUPo_rpk69MxTc5eZ81rQOqz2gz2e6eMR6HYB2EYReI_9QH5Sz_d</recordid><startdate>20231105</startdate><enddate>20231105</enddate><creator>Azad, Reza</creator><creator>Kazerouni, Amirhossein</creator><creator>Heidari, Moein</creator><creator>Aghdam, Ehsan Khodapanah</creator><creator>Molaei, Amirali</creator><creator>Jia, Yiwei</creator><creator>Abin Jose</creator><creator>Rijo Roy</creator><creator>Merhof, Dorit</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231105</creationdate><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><author>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_27639566773</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Computer vision</topic><topic>Image analysis</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><topic>Natural language processing</topic><topic>Taxonomy</topic><toplevel>online_resources</toplevel><creatorcontrib>Azad, Reza</creatorcontrib><creatorcontrib>Kazerouni, Amirhossein</creatorcontrib><creatorcontrib>Heidari, Moein</creatorcontrib><creatorcontrib>Aghdam, Ehsan Khodapanah</creatorcontrib><creatorcontrib>Molaei, Amirali</creatorcontrib><creatorcontrib>Jia, Yiwei</creatorcontrib><creatorcontrib>Abin Jose</creatorcontrib><creatorcontrib>Rijo Roy</creatorcontrib><creatorcontrib>Merhof, Dorit</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Azad, Reza</au><au>Kazerouni, Amirhossein</au><au>Heidari, Moein</au><au>Aghdam, Ehsan Khodapanah</au><au>Molaei, Amirali</au><au>Jia, Yiwei</au><au>Abin Jose</au><au>Rijo Roy</au><au>Merhof, Dorit</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</atitle><jtitle>arXiv.org</jtitle><date>2023-11-05</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2023-11 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2763956677 |
source | Free E- Journals |
subjects | Artificial neural networks Computer vision Image analysis Image segmentation Medical imaging Natural language processing Taxonomy |
title | Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T10%3A01%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Advances%20in%20Medical%20Image%20Analysis%20with%20Vision%20Transformers:%20A%20Comprehensive%20Review&rft.jtitle=arXiv.org&rft.au=Azad,%20Reza&rft.date=2023-11-05&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2763956677%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2763956677&rft_id=info:pmid/&rfr_iscdi=true |