An Artificial Intelligence Model Based on ACR TI-RADS Characteristics for US Diagnosis of Thyroid Nodules
Background US-based diagnosis of thyroid nodules is subjective and influenced by radiologists' experience levels. Purpose To develop an artificial intelligence model based on American College of Radiology Thyroid Imaging Reporting and Data System characteristics for diagnosing thyroid nodules a...
Gespeichert in:
Veröffentlicht in: | Radiology 2022-06, Vol.303 (3), p.613-619 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Background US-based diagnosis of thyroid nodules is subjective and influenced by radiologists' experience levels. Purpose To develop an artificial intelligence model based on American College of Radiology Thyroid Imaging Reporting and Data System characteristics for diagnosing thyroid nodules and identifying nodule characteristics (hereafter, M
) and to compare the performance of M
, radiologists, and a model trained on benign and malignant status based on surgical histopathologic analysis (hereafter, M
). Materials and Methods In this retrospective study, 1588 surgically proven nodules from 636 consecutive patients (mean age, 49 years ± 14 [SD]; 485 women) were included. M
and M
were trained on US images of 1345 nodules (January 2018 to December 2019). The performance of M
was compared with that of M
and radiologists with different experience levels on the test data set (243 nodules, January 2019 to December 2019) with the DeLong method and McNemar test. Results The area under the receiver operating characteristic curve (AUC) and sensitivity of M
were 0.91 and 83% (55 of 66 nodules), respectively, which were not significantly different from those of experienced radiologists (0.93 [
= .45] and 92% [61 of 66 nodules;
= .07]) and exceeded those of junior radiologists (0.78 [
< .001] and 70% [46 of 66 nodules;
= .04]). The specificity of M
(87% [154 of 177 nodules]) was higher than that of both experienced and junior radiologists (80% [141 of 177 nodules;
= .02] and 75% [133 of 177 nodules;
= .001], respectively). The AUC of M
was higher than that of M
(0.91 vs 0.84, respectively;
= .001). In the test set of 243 nodules, the consistency rates between M
and the experienced group were higher than those between M
and the junior group for composition (79% [
= 193] vs 73% [
= 178], respectively;
= .02), echogenicity (75% [
= 183] vs 68% [
= 166];
= .04), shape (93% [
= 227] vs 88% [
= 215];
= .04), and smooth or ill-defined margin (72% [
= 174] vs 63% [
= 152];
= .002). Conclusion The area under the receiver operating characteristic curve (AUC) of an artificial intelligence model based on the American College of Radiology Thyroid Imaging Reporting and Data System (TI-RADS) was higher than that of a model trained on benign and malignant status based on surgical histopathologic analysis. The AUC and sensitivity of the model based on TI-RADS exceeded those of junior radiologists; the specificity of the model was higher than that of both experienced and junior radiolo |
---|---|
ISSN: | 0033-8419 1527-1315 |
DOI: | 10.1148/radiol.211455 |