Deep Learning-assisted Diagnosis of Breast Lesions on US Images: A Multivendor, Multicenter Study

To evaluate the diagnostic performance of a deep learning (DL) model for breast US across four hospitals and assess its value to readers with different levels of experience. In this retrospective study, a dual attention-based convolutional neural network was built and validated to discriminate malig...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Radiology. Artificial intelligence 2023-09, Vol.5 (5), p.e220185
Hauptverfasser: Xiang, Huiling, Wang, Xi, Xu, Min, Zhang, Yuhua, Zeng, Shue, Li, Chunyan, Liu, Lixian, Deng, Tingting, Tang, Guoxue, Yan, Cuiju, Ou, Jinjing, Lin, Qingguang, He, Jiehua, Sun, Peng, Li, Anhua, Chen, Hao, Heng, Pheng-Ann, Lin, Xi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To evaluate the diagnostic performance of a deep learning (DL) model for breast US across four hospitals and assess its value to readers with different levels of experience. In this retrospective study, a dual attention-based convolutional neural network was built and validated to discriminate malignant tumors from benign tumors by using B-mode and color Doppler US images ( = 45 909, March 2011-August 2018), acquired with 42 types of US machines, of 9895 pathologic analysis-confirmed breast lesions in 8797 patients (27 men and 8770 women; mean age, 47 years ± 12 [SD]). With and without assistance from the DL model, three novice readers with less than 5 years of US experience and two experienced readers with 8 and 18 years of US experience, respectively, interpreted 1024 randomly selected lesions. Differences in the areas under the receiver operating characteristic curves (AUCs) were tested using the DeLong test. The DL model using both B-mode and color Doppler US images demonstrated expert-level performance at the lesion level, with an AUC of 0.94 (95% CI: 0.92, 0.95) for the internal set. In external datasets, the AUCs were 0.92 (95% CI: 0.90, 0.94) for hospital 1, 0.91 (95% CI: 0.89, 0.94) for hospital 2, and 0.96 (95% CI: 0.94, 0.98) for hospital 3. DL assistance led to improved AUCs ( < .001) for one experienced and three novice radiologists and improved interobserver agreement. The average false-positive rate was reduced by 7.6% ( = .08). The DL model may help radiologists, especially novice readers, improve accuracy and interobserver agreement of breast tumor diagnosis using US. Ultrasound, Breast, Diagnosis, Breast Cancer, Deep Learning, Ultrasonography © RSNA, 2023.
ISSN:2638-6100
2638-6100
DOI:10.1148/ryai.220185