Deep Learning Measurement of Leg Length Discrepancy in Children Based on Radiographs
Background: Radiographic measurement of leg length discrepancy (LLD) is time consuming yet cognitively simple for pediatric radiologists. Purpose: To compare deep learning (DL) measurements of LLD in pediatric patients to measurements performed by radiologists. Materials and Methods: For this HIPAA-...
Gespeichert in:
Veröffentlicht in: | Radiology 2020-07, Vol.296 (1), p.152-158 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Background: Radiographic measurement of leg length discrepancy (LLD) is time consuming yet cognitively simple for pediatric radiologists.
Purpose: To compare deep learning (DL) measurements of LLD in pediatric patients to measurements performed by radiologists.
Materials and Methods: For this HIPAA-compliant retrospective study, radiographs obtained to evaluate LLD in children between January and August 2018 were identified. LLD was automatically measured by means of image segmentation followed by leg length calculation. On training data, a DL model was trained to segment femurs and tibias on radiographs. The validation set was used to select the optimized model. On testing data, leg lengths were calculated from segmentation masks and compared with measurements from the radiology report. Statistical analysis was performed by using a paired Wilcoxon signed-rank test to compare DL calculations and radiology reports. In addition, the measurement time was manually assessed by a pediatric radiologist and automatically assessed by the DL model on a randomly chosen group of 26 cases; the values were compared with the paired Wilcoxon signed-rank test.
Results: Radiographs obtained to evaluate LLD in 179 children (mean age 6 standard deviation, 12 years 6 3; age range, 5-19 years; 89 boys and 90 girls) were evaluated. Radiographs were randomly divided into training, validation, and testing sets and consisted of studies from 70, 32, and 77 patients, respectively. In the training and validation sets, the DL model showed a high spatial overlap between manual and automatic segmentation masks of pediatric legs (Dice similarity coefficient, 0.94). For the testing set, the correlation between radiology reports and DL-calculated lengths of separated femurs and tibias (r = 0.99; mean absolute error[MAE], 0.45 cm), full pediatric leg lengths (r = 0.99; MAE, 0.45 cm), and full LLD (r = 0.92; MAE, 0.51 cm) was high (P |
---|---|
ISSN: | 0033-8419 1527-1315 |
DOI: | 10.1148/radiol.2020192003 |