MVDRNet: Multi-view diabetic retinopathy detection by combining DCNNs and attention mechanisms

•We propose a novel multi-view DCNN-based approach which can take advantage of not only multi-view images but also the relationships between them.•The proposed networks learn the integrated features of multi-view fundus images for DR detection.•We introduce the attention mechanisms for mining the re...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2021-12, Vol.120, p.108104, Article 108104
Hauptverfasser: Luo, Xiaoling, Pu, Zuhui, Xu, Yong, Wong, Wai Keung, Su, Jingyong, Dou, Xiaoyan, Ye, Baikang, Hu, Jiying, Mou, Lisha
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•We propose a novel multi-view DCNN-based approach which can take advantage of not only multi-view images but also the relationships between them.•The proposed networks learn the integrated features of multi-view fundus images for DR detection.•We introduce the attention mechanisms for mining the relationship between different views.•In order to boost the ability to capture the tiny lesion features in the retinal images, we combine the idea of attention mechanisms with the channel dimension. Diabetic retinopathy (DR) detection has attracted much attention recently, and the deep learning algorithms have gained traction in this area. At present, DR screening by deep learning algorithms is often based on single-view fundus images, which usually leads to an unsatisfactory accuracy of DR grading due to the incomplete lesion features. In this paper, we proposed a novel diabetic retinopathy detection convolutional network for automatic DR detection by integrating multi-view fundus images. Compared to existing single-view DCNN-based DR detection methods, the proposed method has the following advantages. First, our method fully utilizes the lesion features from the retina with a field-of-view around 120∘−150∘. Second, by introducing the attention mechanisms, more attention will be paid on the influential view and the performance can be improved. Besides, we also assign large weights to important channels in the network for effective feature extraction. Experiments are conducted on our collected multi-view DR dataset contained 15,468 images, in which each eye sample provides four-view images. The experimental results indicate that using multi-view images is suitable for automatic DR detection and our proposed method is superior to other benchmarking methods.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2021.108104