Deep Learning Algorithm of the SPARCC Scoring System in SI Joint MRI
Background The Spondyloarthritis Research Consortium of Canada (SPARCC) scoring system is a sacroiliitis grading system. Purpose To develop a deep learning‐based pipeline for grading sacroiliitis using the SPARCC scoring system. Study Type Prospective. Population The study included 389 participants...
Gespeichert in:
Veröffentlicht in: | Journal of magnetic resonance imaging 2024-10, Vol.60 (4), p.1390-1399 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Background
The Spondyloarthritis Research Consortium of Canada (SPARCC) scoring system is a sacroiliitis grading system.
Purpose
To develop a deep learning‐based pipeline for grading sacroiliitis using the SPARCC scoring system.
Study Type
Prospective.
Population
The study included 389 participants (42.2‐year‐old, 44.6% female, 317/35/37 for training/validation/testing). A pretrained algorithm was used to differentiate image with/without sacroiliitis.
Field Strength/Sequence
3‐T, short tau inversion recovery (STIR) sequence, fast spine echo.
Assessment
The regions of interest as ground truth for models' training were identified by a rheumatologist (HYC, 10‐year‐experience) and a radiologist (KHL, 6‐year‐experience) using the Assessment of Spondyloarthritis International Society definition of MRI sacroiliitis independently. Another radiologist (YYL, 4.5‐year‐experience) solved the discrepancies. The bone marrow edema (BME) and sacroiliac region models were for segmentation. Frangi‐filter detected vessels used as intense reference. Deep learning pipeline scored using SPARCC scoring system evaluating presence and features of BMEs. A rheumatologist (SCWC, 6‐year‐experience) and a radiologist (VWHL, 14‐year‐experience) scored using the SPARCC scoring system once. The radiologist (YYL) scored twice with 5‐day interval.
Statistical Tests
Independent samples t‐tests and Chi‐squared tests were used. Interobserver and intraobserver reliability by intraclass correlation coefficient (ICC) and Pearson coefficient evaluated consistency between readers and the deep learning pipeline. We evaluated the performance using sensitivity, accuracy, positive predictive value, and Dice coefficient. A P‐value |
---|---|
ISSN: | 1053-1807 1522-2586 1522-2586 |
DOI: | 10.1002/jmri.29211 |