Deep Learning Applied to Diffusion-weighted Imaging for Differentiating Malignant from Benign Breast Tumors without Lesion Segmentation

Purpose To evaluate and compare the performance of different artificial intelligence (AI) models in differentiating between benign and malignant breast tumors at diffusion-weighted imaging (DWI), including comparison with radiologist assessments. Materials and Methods In this retrospective study, pa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Radiology. Artificial intelligence 2025-01, Vol.7 (1), p.e240206
Hauptverfasser: Iima, Mami, Mizuno, Ryosuke, Kataoka, Masako, Tsuji, Kazuki, Yamazaki, Toshiki, Minami, Akihiko, Honda, Maya, Imanishi, Keiho, Takada, Masahiro, Nakamoto, Yuji
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Purpose To evaluate and compare the performance of different artificial intelligence (AI) models in differentiating between benign and malignant breast tumors at diffusion-weighted imaging (DWI), including comparison with radiologist assessments. Materials and Methods In this retrospective study, patients with breast lesions underwent 3-T breast MRI from May 2019 to March 2022. In addition to T1-weighted imaging, T2-weighted imaging, and contrast-enhanced imaging, DWI was performed with five values (0, 200, 800, 1000, and 1500 sec/mm ). DWI data split into training and tuning and test sets were used for the development and assessment of AI models, including a small two-dimensional (2D) convolutional neural network (CNN), ResNet-18, EfficientNet-B0, and a three-dimensional (3D) CNN. Performance of the DWI-based models in differentiating between benign and malignant breast tumors was compared with that of radiologists assessing standard breast MR images, with diagnostic performance assessed using receiver operating characteristic analysis. The study also examined data augmentation effects (augmentation A: random elastic deformation, augmentation B: random affine transformation and random noise, and augmentation C: mixup) on model performance. Results A total of 334 breast lesions in 293 patients (mean age, 54.9 years ± 14.3 [SD]; all female) were analyzed. The 2D CNN models outperformed the 3D CNN on the test dataset (area under the receiver operating characteristic curve [AUC] with different data augmentation methods: range, 0.83-0.88 vs 0.75-0.76). There was no evidence of a difference in performance between the small 2D CNN with augmentations A and B (AUC: 0.88) and the radiologists (AUC: 0.86) on the test dataset ( = .64). When comparing the small 2D CNN to radiologists, there was no evidence of a difference in specificity (81.4% vs 72.1%, = .64) or sensitivity (85.9% vs 98.8%, = .64). Conclusion AI models, particularly a small 2D CNN, showed good performance in differentiating between malignant and benign breast tumors using DWI, without needing manual segmentation. MR Imaging, Breast, Comparative Studies, Feature Detection, Diagnosis ©RSNA, 2024.
ISSN:2638-6100
2638-6100
DOI:10.1148/ryai.240206