A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability

Health data that are publicly available are valuable resources for digital health research. Several public datasets containing ophthalmological imaging have been frequently used in machine learning research; however, the total number of datasets containing ophthalmological health information and the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Lancet. Digital health 2021-01, Vol.3 (1), p.e51-e66
Hauptverfasser: Khan, Saad M, Liu, Xiaoxuan, Nath, Siddharth, Korot, Edward, Faes, Livia, Wagner, Siegfried K, Keane, Pearse A, Sebire, Neil J, Burton, Matthew J, Denniston, Alastair K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e66
container_issue 1
container_start_page e51
container_title The Lancet. Digital health
container_volume 3
creator Khan, Saad M
Liu, Xiaoxuan
Nath, Siddharth
Korot, Edward
Faes, Livia
Wagner, Siegfried K
Keane, Pearse A
Sebire, Neil J
Burton, Matthew J
Denniston, Alastair K
description Health data that are publicly available are valuable resources for digital health research. Several public datasets containing ophthalmological imaging have been frequently used in machine learning research; however, the total number of datasets containing ophthalmological health information and their respective content is unclear. This Review aimed to identify all publicly available ophthalmological imaging datasets, detail their accessibility, describe which diseases and populations are represented, and report on the completeness of the associated metadata. With the use of MEDLINE, Google's search engine, and Google Dataset Search, we identified 94 open access datasets containing 507 724 images and 125 videos from 122 364 patients. Most datasets originated from Asia, North America, and Europe. Disease populations were unevenly represented, with glaucoma, diabetic retinopathy, and age-related macular degeneration disproportionately overrepresented in comparison with other eye diseases. The reporting of basic demographic characteristics such as age, sex, and ethnicity was poor, even at the aggregate level. This Review provides greater visibility for ophthalmological datasets that are publicly available as powerful resources for research. Our paper also exposes an increasing divide in the representation of different population and disease groups in health data repositories. The improved reporting of metadata would enable researchers to access the most appropriate datasets for their needs and maximise the potential of such resources.
doi_str_mv 10.1016/S2589-7500(20)30240-5
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2503446079</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2589750020302405</els_id><sourcerecordid>2503446079</sourcerecordid><originalsourceid>FETCH-LOGICAL-c478t-d9befef0b9cdfddcf3ee4f0fe1cfaf74c0d542c69519a8070489230c23fb6c573</originalsourceid><addsrcrecordid>eNqFkE9v1DAQxS0EolXpRwD5WKQGJnYcb7igquKfVIkDcLbG9jg18saLnRTtjY9O2m0rbpxm9PTejN6PsZctvGmh7d9-E2ozNFoBnAl4LUF00Kgn7PhRfvrPfsROa_0JAEK0Umv9nB1JqaWCfjhmfy74mLLFxAvdRPrNc-C7xabo0p7jDcaENhH3OGOlufKQC8-76_ka0zanPEa3RuMWxziN77jFUiKVyufM0Tmq9ZwvFW1Mcd6fc5w8H2migik-qC_Ys4Cp0un9PGE_Pn74fvm5ufr66cvlxVXjOr2ZGz9YChTADs4H712QRF2AQK0LGHTnwKtOuH5Q7YAb0NBtBiHBCRls75SWJ-zscHdX8q-F6my2sTpKCSfKSzVCgey6HvSwWtXB6kqutVAwu7JWLHvTgrnlb-74m1u4RoC542_Umnt1_2KxW_KPqQfaq-H9wUBr0ZV2MdVFmhz5WMjNxuf4nxd_Ac5AmAk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2503446079</pqid></control><display><type>article</type><title>A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Khan, Saad M ; Liu, Xiaoxuan ; Nath, Siddharth ; Korot, Edward ; Faes, Livia ; Wagner, Siegfried K ; Keane, Pearse A ; Sebire, Neil J ; Burton, Matthew J ; Denniston, Alastair K</creator><creatorcontrib>Khan, Saad M ; Liu, Xiaoxuan ; Nath, Siddharth ; Korot, Edward ; Faes, Livia ; Wagner, Siegfried K ; Keane, Pearse A ; Sebire, Neil J ; Burton, Matthew J ; Denniston, Alastair K</creatorcontrib><description>Health data that are publicly available are valuable resources for digital health research. Several public datasets containing ophthalmological imaging have been frequently used in machine learning research; however, the total number of datasets containing ophthalmological health information and their respective content is unclear. This Review aimed to identify all publicly available ophthalmological imaging datasets, detail their accessibility, describe which diseases and populations are represented, and report on the completeness of the associated metadata. With the use of MEDLINE, Google's search engine, and Google Dataset Search, we identified 94 open access datasets containing 507 724 images and 125 videos from 122 364 patients. Most datasets originated from Asia, North America, and Europe. Disease populations were unevenly represented, with glaucoma, diabetic retinopathy, and age-related macular degeneration disproportionately overrepresented in comparison with other eye diseases. The reporting of basic demographic characteristics such as age, sex, and ethnicity was poor, even at the aggregate level. This Review provides greater visibility for ophthalmological datasets that are publicly available as powerful resources for research. Our paper also exposes an increasing divide in the representation of different population and disease groups in health data repositories. The improved reporting of metadata would enable researchers to access the most appropriate datasets for their needs and maximise the potential of such resources.</description><identifier>ISSN: 2589-7500</identifier><identifier>EISSN: 2589-7500</identifier><identifier>DOI: 10.1016/S2589-7500(20)30240-5</identifier><identifier>PMID: 33735069</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Databases, Factual ; Datasets as Topic ; Diagnostic Imaging - methods ; Eye Diseases - diagnostic imaging ; Humans ; Metadata - standards ; Ophthalmology</subject><ispartof>The Lancet. Digital health, 2021-01, Vol.3 (1), p.e51-e66</ispartof><rights>2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license</rights><rights>Copyright © 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license. Published by Elsevier Ltd.. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c478t-d9befef0b9cdfddcf3ee4f0fe1cfaf74c0d542c69519a8070489230c23fb6c573</citedby><cites>FETCH-LOGICAL-c478t-d9befef0b9cdfddcf3ee4f0fe1cfaf74c0d542c69519a8070489230c23fb6c573</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,860,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33735069$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Khan, Saad M</creatorcontrib><creatorcontrib>Liu, Xiaoxuan</creatorcontrib><creatorcontrib>Nath, Siddharth</creatorcontrib><creatorcontrib>Korot, Edward</creatorcontrib><creatorcontrib>Faes, Livia</creatorcontrib><creatorcontrib>Wagner, Siegfried K</creatorcontrib><creatorcontrib>Keane, Pearse A</creatorcontrib><creatorcontrib>Sebire, Neil J</creatorcontrib><creatorcontrib>Burton, Matthew J</creatorcontrib><creatorcontrib>Denniston, Alastair K</creatorcontrib><title>A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability</title><title>The Lancet. Digital health</title><addtitle>Lancet Digit Health</addtitle><description>Health data that are publicly available are valuable resources for digital health research. Several public datasets containing ophthalmological imaging have been frequently used in machine learning research; however, the total number of datasets containing ophthalmological health information and their respective content is unclear. This Review aimed to identify all publicly available ophthalmological imaging datasets, detail their accessibility, describe which diseases and populations are represented, and report on the completeness of the associated metadata. With the use of MEDLINE, Google's search engine, and Google Dataset Search, we identified 94 open access datasets containing 507 724 images and 125 videos from 122 364 patients. Most datasets originated from Asia, North America, and Europe. Disease populations were unevenly represented, with glaucoma, diabetic retinopathy, and age-related macular degeneration disproportionately overrepresented in comparison with other eye diseases. The reporting of basic demographic characteristics such as age, sex, and ethnicity was poor, even at the aggregate level. This Review provides greater visibility for ophthalmological datasets that are publicly available as powerful resources for research. Our paper also exposes an increasing divide in the representation of different population and disease groups in health data repositories. The improved reporting of metadata would enable researchers to access the most appropriate datasets for their needs and maximise the potential of such resources.</description><subject>Databases, Factual</subject><subject>Datasets as Topic</subject><subject>Diagnostic Imaging - methods</subject><subject>Eye Diseases - diagnostic imaging</subject><subject>Humans</subject><subject>Metadata - standards</subject><subject>Ophthalmology</subject><issn>2589-7500</issn><issn>2589-7500</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkE9v1DAQxS0EolXpRwD5WKQGJnYcb7igquKfVIkDcLbG9jg18saLnRTtjY9O2m0rbpxm9PTejN6PsZctvGmh7d9-E2ozNFoBnAl4LUF00Kgn7PhRfvrPfsROa_0JAEK0Umv9nB1JqaWCfjhmfy74mLLFxAvdRPrNc-C7xabo0p7jDcaENhH3OGOlufKQC8-76_ka0zanPEa3RuMWxziN77jFUiKVyufM0Tmq9ZwvFW1Mcd6fc5w8H2migik-qC_Ys4Cp0un9PGE_Pn74fvm5ufr66cvlxVXjOr2ZGz9YChTADs4H712QRF2AQK0LGHTnwKtOuH5Q7YAb0NBtBiHBCRls75SWJ-zscHdX8q-F6my2sTpKCSfKSzVCgey6HvSwWtXB6kqutVAwu7JWLHvTgrnlb-74m1u4RoC542_Umnt1_2KxW_KPqQfaq-H9wUBr0ZV2MdVFmhz5WMjNxuf4nxd_Ac5AmAk</recordid><startdate>202101</startdate><enddate>202101</enddate><creator>Khan, Saad M</creator><creator>Liu, Xiaoxuan</creator><creator>Nath, Siddharth</creator><creator>Korot, Edward</creator><creator>Faes, Livia</creator><creator>Wagner, Siegfried K</creator><creator>Keane, Pearse A</creator><creator>Sebire, Neil J</creator><creator>Burton, Matthew J</creator><creator>Denniston, Alastair K</creator><general>Elsevier Ltd</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202101</creationdate><title>A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability</title><author>Khan, Saad M ; Liu, Xiaoxuan ; Nath, Siddharth ; Korot, Edward ; Faes, Livia ; Wagner, Siegfried K ; Keane, Pearse A ; Sebire, Neil J ; Burton, Matthew J ; Denniston, Alastair K</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c478t-d9befef0b9cdfddcf3ee4f0fe1cfaf74c0d542c69519a8070489230c23fb6c573</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Databases, Factual</topic><topic>Datasets as Topic</topic><topic>Diagnostic Imaging - methods</topic><topic>Eye Diseases - diagnostic imaging</topic><topic>Humans</topic><topic>Metadata - standards</topic><topic>Ophthalmology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khan, Saad M</creatorcontrib><creatorcontrib>Liu, Xiaoxuan</creatorcontrib><creatorcontrib>Nath, Siddharth</creatorcontrib><creatorcontrib>Korot, Edward</creatorcontrib><creatorcontrib>Faes, Livia</creatorcontrib><creatorcontrib>Wagner, Siegfried K</creatorcontrib><creatorcontrib>Keane, Pearse A</creatorcontrib><creatorcontrib>Sebire, Neil J</creatorcontrib><creatorcontrib>Burton, Matthew J</creatorcontrib><creatorcontrib>Denniston, Alastair K</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>The Lancet. Digital health</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khan, Saad M</au><au>Liu, Xiaoxuan</au><au>Nath, Siddharth</au><au>Korot, Edward</au><au>Faes, Livia</au><au>Wagner, Siegfried K</au><au>Keane, Pearse A</au><au>Sebire, Neil J</au><au>Burton, Matthew J</au><au>Denniston, Alastair K</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability</atitle><jtitle>The Lancet. Digital health</jtitle><addtitle>Lancet Digit Health</addtitle><date>2021-01</date><risdate>2021</risdate><volume>3</volume><issue>1</issue><spage>e51</spage><epage>e66</epage><pages>e51-e66</pages><issn>2589-7500</issn><eissn>2589-7500</eissn><abstract>Health data that are publicly available are valuable resources for digital health research. Several public datasets containing ophthalmological imaging have been frequently used in machine learning research; however, the total number of datasets containing ophthalmological health information and their respective content is unclear. This Review aimed to identify all publicly available ophthalmological imaging datasets, detail their accessibility, describe which diseases and populations are represented, and report on the completeness of the associated metadata. With the use of MEDLINE, Google's search engine, and Google Dataset Search, we identified 94 open access datasets containing 507 724 images and 125 videos from 122 364 patients. Most datasets originated from Asia, North America, and Europe. Disease populations were unevenly represented, with glaucoma, diabetic retinopathy, and age-related macular degeneration disproportionately overrepresented in comparison with other eye diseases. The reporting of basic demographic characteristics such as age, sex, and ethnicity was poor, even at the aggregate level. This Review provides greater visibility for ophthalmological datasets that are publicly available as powerful resources for research. Our paper also exposes an increasing divide in the representation of different population and disease groups in health data repositories. The improved reporting of metadata would enable researchers to access the most appropriate datasets for their needs and maximise the potential of such resources.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>33735069</pmid><doi>10.1016/S2589-7500(20)30240-5</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2589-7500
ispartof The Lancet. Digital health, 2021-01, Vol.3 (1), p.e51-e66
issn 2589-7500
2589-7500
language eng
recordid cdi_proquest_miscellaneous_2503446079
source MEDLINE; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects Databases, Factual
Datasets as Topic
Diagnostic Imaging - methods
Eye Diseases - diagnostic imaging
Humans
Metadata - standards
Ophthalmology
title A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T04%3A06%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20global%20review%20of%20publicly%20available%20datasets%20for%20ophthalmological%20imaging:%20barriers%20to%20access,%20usability,%20and%20generalisability&rft.jtitle=The%20Lancet.%20Digital%20health&rft.au=Khan,%20Saad%20M&rft.date=2021-01&rft.volume=3&rft.issue=1&rft.spage=e51&rft.epage=e66&rft.pages=e51-e66&rft.issn=2589-7500&rft.eissn=2589-7500&rft_id=info:doi/10.1016/S2589-7500(20)30240-5&rft_dat=%3Cproquest_cross%3E2503446079%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2503446079&rft_id=info:pmid/33735069&rft_els_id=S2589750020302405&rfr_iscdi=true