Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review

The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-11
Hauptverfasser: Azad, Reza, Kazerouni, Amirhossein, Heidari, Moein, Aghdam, Ehsan Khodapanah, Molaei, Amirali, Jia, Yiwei, Abin Jose, Rijo Roy, Merhof, Dorit
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Azad, Reza
Kazerouni, Amirhossein
Heidari, Moein
Aghdam, Ehsan Khodapanah
Molaei, Amirali
Jia, Yiwei
Abin Jose
Rijo Roy
Merhof, Dorit
description The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2763956677</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2763956677</sourcerecordid><originalsourceid>FETCH-proquest_journals_27639566773</originalsourceid><addsrcrecordid>eNqNzM0KgkAUQOEhCJLyHS60FmwmtdqJFLVoU9JWBr3miM7YXH_o7WvRA7Q6m48zYw4XYuPttpwvmEtU-77Pw4gHgXDYPS5GqXMkUBquWKhcNnBp5RMh1rJ5kyKYVF_BQ5EyGlIrNZXGtmjpADEkpu0sVqhJjQg3HBVOKzYvZUPo_rpk69MxTc5eZ81rQOqz2gz2e6eMR6HYB2EYReI_9QH5Sz_d</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2763956677</pqid></control><display><type>article</type><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><source>Free E- Journals</source><creator>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</creator><creatorcontrib>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</creatorcontrib><description>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Computer vision ; Image analysis ; Image segmentation ; Medical imaging ; Natural language processing ; Taxonomy</subject><ispartof>arXiv.org, 2023-11</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>778,782</link.rule.ids></links><search><creatorcontrib>Azad, Reza</creatorcontrib><creatorcontrib>Kazerouni, Amirhossein</creatorcontrib><creatorcontrib>Heidari, Moein</creatorcontrib><creatorcontrib>Aghdam, Ehsan Khodapanah</creatorcontrib><creatorcontrib>Molaei, Amirali</creatorcontrib><creatorcontrib>Jia, Yiwei</creatorcontrib><creatorcontrib>Abin Jose</creatorcontrib><creatorcontrib>Rijo Roy</creatorcontrib><creatorcontrib>Merhof, Dorit</creatorcontrib><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><title>arXiv.org</title><description>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</description><subject>Artificial neural networks</subject><subject>Computer vision</subject><subject>Image analysis</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><subject>Natural language processing</subject><subject>Taxonomy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNzM0KgkAUQOEhCJLyHS60FmwmtdqJFLVoU9JWBr3miM7YXH_o7WvRA7Q6m48zYw4XYuPttpwvmEtU-77Pw4gHgXDYPS5GqXMkUBquWKhcNnBp5RMh1rJ5kyKYVF_BQ5EyGlIrNZXGtmjpADEkpu0sVqhJjQg3HBVOKzYvZUPo_rpk69MxTc5eZ81rQOqz2gz2e6eMR6HYB2EYReI_9QH5Sz_d</recordid><startdate>20231105</startdate><enddate>20231105</enddate><creator>Azad, Reza</creator><creator>Kazerouni, Amirhossein</creator><creator>Heidari, Moein</creator><creator>Aghdam, Ehsan Khodapanah</creator><creator>Molaei, Amirali</creator><creator>Jia, Yiwei</creator><creator>Abin Jose</creator><creator>Rijo Roy</creator><creator>Merhof, Dorit</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231105</creationdate><title>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</title><author>Azad, Reza ; Kazerouni, Amirhossein ; Heidari, Moein ; Aghdam, Ehsan Khodapanah ; Molaei, Amirali ; Jia, Yiwei ; Abin Jose ; Rijo Roy ; Merhof, Dorit</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_27639566773</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Computer vision</topic><topic>Image analysis</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><topic>Natural language processing</topic><topic>Taxonomy</topic><toplevel>online_resources</toplevel><creatorcontrib>Azad, Reza</creatorcontrib><creatorcontrib>Kazerouni, Amirhossein</creatorcontrib><creatorcontrib>Heidari, Moein</creatorcontrib><creatorcontrib>Aghdam, Ehsan Khodapanah</creatorcontrib><creatorcontrib>Molaei, Amirali</creatorcontrib><creatorcontrib>Jia, Yiwei</creatorcontrib><creatorcontrib>Abin Jose</creatorcontrib><creatorcontrib>Rijo Roy</creatorcontrib><creatorcontrib>Merhof, Dorit</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Azad, Reza</au><au>Kazerouni, Amirhossein</au><au>Heidari, Moein</au><au>Aghdam, Ehsan Khodapanah</au><au>Molaei, Amirali</au><au>Jia, Yiwei</au><au>Abin Jose</au><au>Rijo Roy</au><au>Merhof, Dorit</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review</atitle><jtitle>arXiv.org</jtitle><date>2023-11-05</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision problems so far. Thus, Transformers have become an integral part of modern medical image analysis. In this review, we provide an encyclopedic review of the applications of Transformers in medical imaging. Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks, including classification, segmentation, detection, registration, synthesis, and clinical report generation. For each of these applications, we investigate the novelty, strengths and weaknesses of the different proposed strategies and develop taxonomies highlighting key properties and contributions. Further, if applicable, we outline current benchmarks on different datasets. Finally, we summarize key challenges and discuss different future research directions. In addition, we have provided cited papers with their corresponding implementations in https://github.com/mindflow-institue/Awesome-Transformer.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-11
issn 2331-8422
language eng
recordid cdi_proquest_journals_2763956677
source Free E- Journals
subjects Artificial neural networks
Computer vision
Image analysis
Image segmentation
Medical imaging
Natural language processing
Taxonomy
title Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T10%3A01%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Advances%20in%20Medical%20Image%20Analysis%20with%20Vision%20Transformers:%20A%20Comprehensive%20Review&rft.jtitle=arXiv.org&rft.au=Azad,%20Reza&rft.date=2023-11-05&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2763956677%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2763956677&rft_id=info:pmid/&rfr_iscdi=true