VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence
We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial int...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Qiu, Jianing Wu, Jian Wei, Hao Shi, Peilun Zhang, Minqing Sun, Yunyun Li, Lin Liu, Hanruo Liu, Hongyi Hou, Simeng Zhao, Yuyang Shi, Xuehui Xian, Junfang Qu, Xiaoxia Zhu, Sirui Pan, Lijie Chen, Xiaoniao Zhang, Xiaojia Jiang, Shuai Wang, Kebing Yang, Chenlong Chen, Mingqiang Fan, Sujie Hu, Jianhua Lv, Aiguo Miao, Hui Guo, Li Zhang, Shujun Pei, Cheng Fan, Xiaojuan Lei, Jianqin Wei, Ting Duan, Junguo Liu, Chun Xia, Xiaobo Xiong, Siqi Li, Junhong Lo, Benny Tham, Yih Chung Wong, Tien Yin Wang, Ningli Yuan, Wu |
description | We present VisionFM, a foundation model pre-trained with 3.4 million
ophthalmic images from 560,457 individuals, covering a broad range of
ophthalmic diseases, modalities, imaging devices, and demography. After
pre-training, VisionFM provides a foundation to foster multiple ophthalmic
artificial intelligence (AI) applications, such as disease screening and
diagnosis, disease prognosis, subclassification of disease phenotype, and
systemic biomarker and disease prediction, with each application enhanced with
expert-level intelligence and accuracy. The generalist intelligence of VisionFM
outperformed ophthalmologists with basic and intermediate levels in jointly
diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale
ophthalmic disease diagnosis benchmark database, as well as a new large-scale
segmentation and detection benchmark database, VisionFM outperformed strong
baseline deep neural networks. The ophthalmic image representations learned by
VisionFM exhibited noteworthy explainability, and demonstrated strong
generalizability to new ophthalmic modalities, disease spectrum, and imaging
devices. As a foundation model, VisionFM has a large capacity to learn from
diverse ophthalmic imaging data and disparate datasets. To be commensurate with
this capacity, in addition to the real data used for pre-training, we also
generated and leveraged synthetic ophthalmic imaging data. Experimental results
revealed that synthetic data that passed visual Turing tests, can also enhance
the representation learning capability of VisionFM, leading to substantial
performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI
applications developed, validated, and demonstrated in this work, substantial
further applications can be achieved in an efficient and cost-effective manner
using VisionFM as the foundation. |
doi_str_mv | 10.48550/arxiv.2310.04992 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2310_04992</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2310_04992</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-acb301a0e3c19a81e55871f9926eedb4f5691bd2a0c9a6d2bb88b0bb1523bf233</originalsourceid><addsrcrecordid>eNotj8tOwzAURL1hgQofwAr_QIofcZqwqypSKjXqJuo2unZseoXrVI6L4O9JH6sZjUajOYS8cDbPS6XYG8Rf_JkLOQUsryrxSPweRxxC3bxToM3ZJ8yaoQd_9y2M3_RWofVwDj2ki50q1lM3RLq2wUbwOCa6Ox3SAfwRDV3GhA4NTjubkKz3-GWDsU_kwYEf7fNdZ6StP9rVZ7bdrTer5TaDYiEyMFoyDsxKwysouVWqXHA33S2s7XXuVFFx3QtgpoKiF1qXpWZacyWkdkLKGXm9zV5xu1PEI8S_7oLdXbHlPxKSVAY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence</title><source>arXiv.org</source><creator>Qiu, Jianing ; Wu, Jian ; Wei, Hao ; Shi, Peilun ; Zhang, Minqing ; Sun, Yunyun ; Li, Lin ; Liu, Hanruo ; Liu, Hongyi ; Hou, Simeng ; Zhao, Yuyang ; Shi, Xuehui ; Xian, Junfang ; Qu, Xiaoxia ; Zhu, Sirui ; Pan, Lijie ; Chen, Xiaoniao ; Zhang, Xiaojia ; Jiang, Shuai ; Wang, Kebing ; Yang, Chenlong ; Chen, Mingqiang ; Fan, Sujie ; Hu, Jianhua ; Lv, Aiguo ; Miao, Hui ; Guo, Li ; Zhang, Shujun ; Pei, Cheng ; Fan, Xiaojuan ; Lei, Jianqin ; Wei, Ting ; Duan, Junguo ; Liu, Chun ; Xia, Xiaobo ; Xiong, Siqi ; Li, Junhong ; Lo, Benny ; Tham, Yih Chung ; Wong, Tien Yin ; Wang, Ningli ; Yuan, Wu</creator><creatorcontrib>Qiu, Jianing ; Wu, Jian ; Wei, Hao ; Shi, Peilun ; Zhang, Minqing ; Sun, Yunyun ; Li, Lin ; Liu, Hanruo ; Liu, Hongyi ; Hou, Simeng ; Zhao, Yuyang ; Shi, Xuehui ; Xian, Junfang ; Qu, Xiaoxia ; Zhu, Sirui ; Pan, Lijie ; Chen, Xiaoniao ; Zhang, Xiaojia ; Jiang, Shuai ; Wang, Kebing ; Yang, Chenlong ; Chen, Mingqiang ; Fan, Sujie ; Hu, Jianhua ; Lv, Aiguo ; Miao, Hui ; Guo, Li ; Zhang, Shujun ; Pei, Cheng ; Fan, Xiaojuan ; Lei, Jianqin ; Wei, Ting ; Duan, Junguo ; Liu, Chun ; Xia, Xiaobo ; Xiong, Siqi ; Li, Junhong ; Lo, Benny ; Tham, Yih Chung ; Wong, Tien Yin ; Wang, Ningli ; Yuan, Wu</creatorcontrib><description>We present VisionFM, a foundation model pre-trained with 3.4 million
ophthalmic images from 560,457 individuals, covering a broad range of
ophthalmic diseases, modalities, imaging devices, and demography. After
pre-training, VisionFM provides a foundation to foster multiple ophthalmic
artificial intelligence (AI) applications, such as disease screening and
diagnosis, disease prognosis, subclassification of disease phenotype, and
systemic biomarker and disease prediction, with each application enhanced with
expert-level intelligence and accuracy. The generalist intelligence of VisionFM
outperformed ophthalmologists with basic and intermediate levels in jointly
diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale
ophthalmic disease diagnosis benchmark database, as well as a new large-scale
segmentation and detection benchmark database, VisionFM outperformed strong
baseline deep neural networks. The ophthalmic image representations learned by
VisionFM exhibited noteworthy explainability, and demonstrated strong
generalizability to new ophthalmic modalities, disease spectrum, and imaging
devices. As a foundation model, VisionFM has a large capacity to learn from
diverse ophthalmic imaging data and disparate datasets. To be commensurate with
this capacity, in addition to the real data used for pre-training, we also
generated and leveraged synthetic ophthalmic imaging data. Experimental results
revealed that synthetic data that passed visual Turing tests, can also enhance
the representation learning capability of VisionFM, leading to substantial
performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI
applications developed, validated, and demonstrated in this work, substantial
further applications can be achieved in an efficient and cost-effective manner
using VisionFM as the foundation.</description><identifier>DOI: 10.48550/arxiv.2310.04992</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-10</creationdate><rights>http://creativecommons.org/licenses/by-nc-nd/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2310.04992$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2310.04992$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Qiu, Jianing</creatorcontrib><creatorcontrib>Wu, Jian</creatorcontrib><creatorcontrib>Wei, Hao</creatorcontrib><creatorcontrib>Shi, Peilun</creatorcontrib><creatorcontrib>Zhang, Minqing</creatorcontrib><creatorcontrib>Sun, Yunyun</creatorcontrib><creatorcontrib>Li, Lin</creatorcontrib><creatorcontrib>Liu, Hanruo</creatorcontrib><creatorcontrib>Liu, Hongyi</creatorcontrib><creatorcontrib>Hou, Simeng</creatorcontrib><creatorcontrib>Zhao, Yuyang</creatorcontrib><creatorcontrib>Shi, Xuehui</creatorcontrib><creatorcontrib>Xian, Junfang</creatorcontrib><creatorcontrib>Qu, Xiaoxia</creatorcontrib><creatorcontrib>Zhu, Sirui</creatorcontrib><creatorcontrib>Pan, Lijie</creatorcontrib><creatorcontrib>Chen, Xiaoniao</creatorcontrib><creatorcontrib>Zhang, Xiaojia</creatorcontrib><creatorcontrib>Jiang, Shuai</creatorcontrib><creatorcontrib>Wang, Kebing</creatorcontrib><creatorcontrib>Yang, Chenlong</creatorcontrib><creatorcontrib>Chen, Mingqiang</creatorcontrib><creatorcontrib>Fan, Sujie</creatorcontrib><creatorcontrib>Hu, Jianhua</creatorcontrib><creatorcontrib>Lv, Aiguo</creatorcontrib><creatorcontrib>Miao, Hui</creatorcontrib><creatorcontrib>Guo, Li</creatorcontrib><creatorcontrib>Zhang, Shujun</creatorcontrib><creatorcontrib>Pei, Cheng</creatorcontrib><creatorcontrib>Fan, Xiaojuan</creatorcontrib><creatorcontrib>Lei, Jianqin</creatorcontrib><creatorcontrib>Wei, Ting</creatorcontrib><creatorcontrib>Duan, Junguo</creatorcontrib><creatorcontrib>Liu, Chun</creatorcontrib><creatorcontrib>Xia, Xiaobo</creatorcontrib><creatorcontrib>Xiong, Siqi</creatorcontrib><creatorcontrib>Li, Junhong</creatorcontrib><creatorcontrib>Lo, Benny</creatorcontrib><creatorcontrib>Tham, Yih Chung</creatorcontrib><creatorcontrib>Wong, Tien Yin</creatorcontrib><creatorcontrib>Wang, Ningli</creatorcontrib><creatorcontrib>Yuan, Wu</creatorcontrib><title>VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence</title><description>We present VisionFM, a foundation model pre-trained with 3.4 million
ophthalmic images from 560,457 individuals, covering a broad range of
ophthalmic diseases, modalities, imaging devices, and demography. After
pre-training, VisionFM provides a foundation to foster multiple ophthalmic
artificial intelligence (AI) applications, such as disease screening and
diagnosis, disease prognosis, subclassification of disease phenotype, and
systemic biomarker and disease prediction, with each application enhanced with
expert-level intelligence and accuracy. The generalist intelligence of VisionFM
outperformed ophthalmologists with basic and intermediate levels in jointly
diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale
ophthalmic disease diagnosis benchmark database, as well as a new large-scale
segmentation and detection benchmark database, VisionFM outperformed strong
baseline deep neural networks. The ophthalmic image representations learned by
VisionFM exhibited noteworthy explainability, and demonstrated strong
generalizability to new ophthalmic modalities, disease spectrum, and imaging
devices. As a foundation model, VisionFM has a large capacity to learn from
diverse ophthalmic imaging data and disparate datasets. To be commensurate with
this capacity, in addition to the real data used for pre-training, we also
generated and leveraged synthetic ophthalmic imaging data. Experimental results
revealed that synthetic data that passed visual Turing tests, can also enhance
the representation learning capability of VisionFM, leading to substantial
performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI
applications developed, validated, and demonstrated in this work, substantial
further applications can be achieved in an efficient and cost-effective manner
using VisionFM as the foundation.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAURL1hgQofwAr_QIofcZqwqypSKjXqJuo2unZseoXrVI6L4O9JH6sZjUajOYS8cDbPS6XYG8Rf_JkLOQUsryrxSPweRxxC3bxToM3ZJ8yaoQd_9y2M3_RWofVwDj2ki50q1lM3RLq2wUbwOCa6Ox3SAfwRDV3GhA4NTjubkKz3-GWDsU_kwYEf7fNdZ6StP9rVZ7bdrTer5TaDYiEyMFoyDsxKwysouVWqXHA33S2s7XXuVFFx3QtgpoKiF1qXpWZacyWkdkLKGXm9zV5xu1PEI8S_7oLdXbHlPxKSVAY</recordid><startdate>20231007</startdate><enddate>20231007</enddate><creator>Qiu, Jianing</creator><creator>Wu, Jian</creator><creator>Wei, Hao</creator><creator>Shi, Peilun</creator><creator>Zhang, Minqing</creator><creator>Sun, Yunyun</creator><creator>Li, Lin</creator><creator>Liu, Hanruo</creator><creator>Liu, Hongyi</creator><creator>Hou, Simeng</creator><creator>Zhao, Yuyang</creator><creator>Shi, Xuehui</creator><creator>Xian, Junfang</creator><creator>Qu, Xiaoxia</creator><creator>Zhu, Sirui</creator><creator>Pan, Lijie</creator><creator>Chen, Xiaoniao</creator><creator>Zhang, Xiaojia</creator><creator>Jiang, Shuai</creator><creator>Wang, Kebing</creator><creator>Yang, Chenlong</creator><creator>Chen, Mingqiang</creator><creator>Fan, Sujie</creator><creator>Hu, Jianhua</creator><creator>Lv, Aiguo</creator><creator>Miao, Hui</creator><creator>Guo, Li</creator><creator>Zhang, Shujun</creator><creator>Pei, Cheng</creator><creator>Fan, Xiaojuan</creator><creator>Lei, Jianqin</creator><creator>Wei, Ting</creator><creator>Duan, Junguo</creator><creator>Liu, Chun</creator><creator>Xia, Xiaobo</creator><creator>Xiong, Siqi</creator><creator>Li, Junhong</creator><creator>Lo, Benny</creator><creator>Tham, Yih Chung</creator><creator>Wong, Tien Yin</creator><creator>Wang, Ningli</creator><creator>Yuan, Wu</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231007</creationdate><title>VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence</title><author>Qiu, Jianing ; Wu, Jian ; Wei, Hao ; Shi, Peilun ; Zhang, Minqing ; Sun, Yunyun ; Li, Lin ; Liu, Hanruo ; Liu, Hongyi ; Hou, Simeng ; Zhao, Yuyang ; Shi, Xuehui ; Xian, Junfang ; Qu, Xiaoxia ; Zhu, Sirui ; Pan, Lijie ; Chen, Xiaoniao ; Zhang, Xiaojia ; Jiang, Shuai ; Wang, Kebing ; Yang, Chenlong ; Chen, Mingqiang ; Fan, Sujie ; Hu, Jianhua ; Lv, Aiguo ; Miao, Hui ; Guo, Li ; Zhang, Shujun ; Pei, Cheng ; Fan, Xiaojuan ; Lei, Jianqin ; Wei, Ting ; Duan, Junguo ; Liu, Chun ; Xia, Xiaobo ; Xiong, Siqi ; Li, Junhong ; Lo, Benny ; Tham, Yih Chung ; Wong, Tien Yin ; Wang, Ningli ; Yuan, Wu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-acb301a0e3c19a81e55871f9926eedb4f5691bd2a0c9a6d2bb88b0bb1523bf233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Qiu, Jianing</creatorcontrib><creatorcontrib>Wu, Jian</creatorcontrib><creatorcontrib>Wei, Hao</creatorcontrib><creatorcontrib>Shi, Peilun</creatorcontrib><creatorcontrib>Zhang, Minqing</creatorcontrib><creatorcontrib>Sun, Yunyun</creatorcontrib><creatorcontrib>Li, Lin</creatorcontrib><creatorcontrib>Liu, Hanruo</creatorcontrib><creatorcontrib>Liu, Hongyi</creatorcontrib><creatorcontrib>Hou, Simeng</creatorcontrib><creatorcontrib>Zhao, Yuyang</creatorcontrib><creatorcontrib>Shi, Xuehui</creatorcontrib><creatorcontrib>Xian, Junfang</creatorcontrib><creatorcontrib>Qu, Xiaoxia</creatorcontrib><creatorcontrib>Zhu, Sirui</creatorcontrib><creatorcontrib>Pan, Lijie</creatorcontrib><creatorcontrib>Chen, Xiaoniao</creatorcontrib><creatorcontrib>Zhang, Xiaojia</creatorcontrib><creatorcontrib>Jiang, Shuai</creatorcontrib><creatorcontrib>Wang, Kebing</creatorcontrib><creatorcontrib>Yang, Chenlong</creatorcontrib><creatorcontrib>Chen, Mingqiang</creatorcontrib><creatorcontrib>Fan, Sujie</creatorcontrib><creatorcontrib>Hu, Jianhua</creatorcontrib><creatorcontrib>Lv, Aiguo</creatorcontrib><creatorcontrib>Miao, Hui</creatorcontrib><creatorcontrib>Guo, Li</creatorcontrib><creatorcontrib>Zhang, Shujun</creatorcontrib><creatorcontrib>Pei, Cheng</creatorcontrib><creatorcontrib>Fan, Xiaojuan</creatorcontrib><creatorcontrib>Lei, Jianqin</creatorcontrib><creatorcontrib>Wei, Ting</creatorcontrib><creatorcontrib>Duan, Junguo</creatorcontrib><creatorcontrib>Liu, Chun</creatorcontrib><creatorcontrib>Xia, Xiaobo</creatorcontrib><creatorcontrib>Xiong, Siqi</creatorcontrib><creatorcontrib>Li, Junhong</creatorcontrib><creatorcontrib>Lo, Benny</creatorcontrib><creatorcontrib>Tham, Yih Chung</creatorcontrib><creatorcontrib>Wong, Tien Yin</creatorcontrib><creatorcontrib>Wang, Ningli</creatorcontrib><creatorcontrib>Yuan, Wu</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Qiu, Jianing</au><au>Wu, Jian</au><au>Wei, Hao</au><au>Shi, Peilun</au><au>Zhang, Minqing</au><au>Sun, Yunyun</au><au>Li, Lin</au><au>Liu, Hanruo</au><au>Liu, Hongyi</au><au>Hou, Simeng</au><au>Zhao, Yuyang</au><au>Shi, Xuehui</au><au>Xian, Junfang</au><au>Qu, Xiaoxia</au><au>Zhu, Sirui</au><au>Pan, Lijie</au><au>Chen, Xiaoniao</au><au>Zhang, Xiaojia</au><au>Jiang, Shuai</au><au>Wang, Kebing</au><au>Yang, Chenlong</au><au>Chen, Mingqiang</au><au>Fan, Sujie</au><au>Hu, Jianhua</au><au>Lv, Aiguo</au><au>Miao, Hui</au><au>Guo, Li</au><au>Zhang, Shujun</au><au>Pei, Cheng</au><au>Fan, Xiaojuan</au><au>Lei, Jianqin</au><au>Wei, Ting</au><au>Duan, Junguo</au><au>Liu, Chun</au><au>Xia, Xiaobo</au><au>Xiong, Siqi</au><au>Li, Junhong</au><au>Lo, Benny</au><au>Tham, Yih Chung</au><au>Wong, Tien Yin</au><au>Wang, Ningli</au><au>Yuan, Wu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence</atitle><date>2023-10-07</date><risdate>2023</risdate><abstract>We present VisionFM, a foundation model pre-trained with 3.4 million
ophthalmic images from 560,457 individuals, covering a broad range of
ophthalmic diseases, modalities, imaging devices, and demography. After
pre-training, VisionFM provides a foundation to foster multiple ophthalmic
artificial intelligence (AI) applications, such as disease screening and
diagnosis, disease prognosis, subclassification of disease phenotype, and
systemic biomarker and disease prediction, with each application enhanced with
expert-level intelligence and accuracy. The generalist intelligence of VisionFM
outperformed ophthalmologists with basic and intermediate levels in jointly
diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale
ophthalmic disease diagnosis benchmark database, as well as a new large-scale
segmentation and detection benchmark database, VisionFM outperformed strong
baseline deep neural networks. The ophthalmic image representations learned by
VisionFM exhibited noteworthy explainability, and demonstrated strong
generalizability to new ophthalmic modalities, disease spectrum, and imaging
devices. As a foundation model, VisionFM has a large capacity to learn from
diverse ophthalmic imaging data and disparate datasets. To be commensurate with
this capacity, in addition to the real data used for pre-training, we also
generated and leveraged synthetic ophthalmic imaging data. Experimental results
revealed that synthetic data that passed visual Turing tests, can also enhance
the representation learning capability of VisionFM, leading to substantial
performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI
applications developed, validated, and demonstrated in this work, substantial
further applications can be achieved in an efficient and cost-effective manner
using VisionFM as the foundation.</abstract><doi>10.48550/arxiv.2310.04992</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2310.04992 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2310_04992 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T10%3A47%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=VisionFM:%20a%20Multi-Modal%20Multi-Task%20Vision%20Foundation%20Model%20for%20Generalist%20Ophthalmic%20Artificial%20Intelligence&rft.au=Qiu,%20Jianing&rft.date=2023-10-07&rft_id=info:doi/10.48550/arxiv.2310.04992&rft_dat=%3Carxiv_GOX%3E2310_04992%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |