Deep Learning vs. Human Graders for Classifying Severity Levels of Diabetic Retinopathy in a Real-World Nationwide Screening Program

Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. 25,326 gradable retinal images of patient...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Raumviboonsuk, Paisan, Krause, Jonathan, Chotcomwongse, Peranut, Sayres, Rory, Raman, Rajiv, Widner, Kasumi, Campana, Bilson J L, Phene, Sonia, Hemarat, Kornwipa, Tadarati, Mongkol, Silpa-Acha, Sukhum, Limwattanayingyong, Jirawut, Rao, Chetan, Kuruvilla, Oscar, Jung, Jesse, Tan, Jeffrey, Orprayoon, Surapong, Kangwanwongpaisan, Chawawat, Sukulmalpaiboon, Ramase, Luengchaichawang, Chainarong, Fuangkaew, Jitumporn, Kongsap, Pipat, Chualinpha, Lamyong, Saree, Sarawuth, Kawinpanitan, Srirat, Mitvongsa, Korntip, Lawanasakol, Siriporn, Thepchatri, Chaiyasit, Wongpichedchai, Lalita, Corrado, Greg S, Peng, Lily, Webster, Dale R
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. 25,326 gradable retinal images of patients with diabetes from the community-based, nation-wide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening.
DOI:10.48550/arxiv.1810.08290