Training high-performance deep learning classifier for diagnosis in oral cytology using diverse annotations

The uncertainty of true labels in medical images hinders diagnosis owing to the variability across professionals when applying deep learning models. We used deep learning to obtain an optimal convolutional neural network (CNN) by adequately annotating data for oral exfoliative cytology considering l...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific reports 2024-07, Vol.14 (1), p.17591-8, Article 17591
Hauptverfasser:	Sukegawa, Shintaro, Tanaka, Futa, Nakano, Keisuke, Hara, Takeshi, Ochiai, Takanaga, Shimada, Katsumitsu, Inoue, Yuta, Taki, Yoshihiro, Nakai, Fumi, Nakai, Yasuhiro, Ishihama, Takanori, Miyazaki, Ryo, Murakami, Satoshi, Nagatsuka, Hitoshi, Miyake, Minoru
Format:	Artikel
Sprache:	eng
Schlagworte:	692/4028/67/1536/1665 692/699/3020/1665/3016 Annotations Artificial intelligence Cellular biology Classification Convolutional neural network Cytodiagnosis - methods Cytology Deep Learning Diagnosis Humanities and Social Sciences Humans Image Processing, Computer-Assisted - methods Mouth Neoplasms - diagnosis Mouth Neoplasms - pathology multidisciplinary Neural networks Neural Networks, Computer Oral cytology Pathologists Probabilistic labeling Science Science (multidisciplinary)
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The uncertainty of true labels in medical images hinders diagnosis owing to the variability across professionals when applying deep learning models. We used deep learning to obtain an optimal convolutional neural network (CNN) by adequately annotating data for oral exfoliative cytology considering labels from multiple oral pathologists. Six whole-slide images were processed using QuPath for segmenting them into tiles. The images were labeled by three oral pathologists, resulting in 14,535 images with the corresponding pathologists’ annotations. Data from three pathologists who provided the same diagnosis were labeled as ground truth (GT) and used for testing. We investigated six models trained using the annotations of (1) pathologist A, (2) pathologist B, (3) pathologist C, (4) GT, (5) majority voting, and (6) a probabilistic model. We divided the test by cross-validation per slide dataset and examined the classification performance of the CNN with a ResNet50 baseline. Statistical evaluation was performed repeatedly and independently using every slide 10 times as test data. For the area under the curve, three cases showed the highest values (0.861, 0.955, and 0.991) for the probabilistic model. Regarding accuracy, two cases showed the highest values (0.988 and 0.967). For the models using the pathologists and GT annotations, many slides showed very low accuracy and large variations across tests. Hence, the classifier trained with probabilistic labels provided the optimal CNN for oral exfoliative cytology considering diagnoses from multiple pathologists. These results may lead to trusted medical artificial intelligence solutions that reflect diverse diagnoses of various professionals.
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-67879-w