Affiliation:
1. Department of Diagnostic Radiology The University of Hong Kong Hong Kong China
2. Division of Rheumatology and Clinical Immunology The University of Hong Kong Hong Kong China
3. Department of Medicine The University of Hong Kong Hong Kong China
4. Department of diagnostic radiology Queen Mary Hospital Hong Kong China
5. Glenagles Hospital Hong Kong China
Abstract
BackgroundThe Spondyloarthritis Research Consortium of Canada (SPARCC) scoring system is a sacroiliitis grading system.PurposeTo develop a deep learning‐based pipeline for grading sacroiliitis using the SPARCC scoring system.Study TypeProspective.PopulationThe study included 389 participants (42.2‐year‐old, 44.6% female, 317/35/37 for training/validation/testing). A pretrained algorithm was used to differentiate image with/without sacroiliitis.Field Strength/Sequence3‐T, short tau inversion recovery (STIR) sequence, fast spine echo.AssessmentThe regions of interest as ground truth for models' training were identified by a rheumatologist (HYC, 10‐year‐experience) and a radiologist (KHL, 6‐year‐experience) using the Assessment of Spondyloarthritis International Society definition of MRI sacroiliitis independently. Another radiologist (YYL, 4.5‐year‐experience) solved the discrepancies. The bone marrow edema (BME) and sacroiliac region models were for segmentation. Frangi‐filter detected vessels used as intense reference. Deep learning pipeline scored using SPARCC scoring system evaluating presence and features of BMEs. A rheumatologist (SCWC, 6‐year‐experience) and a radiologist (VWHL, 14‐year‐experience) scored using the SPARCC scoring system once. The radiologist (YYL) scored twice with 5‐day interval.Statistical TestsIndependent samples t‐tests and Chi‐squared tests were used. Interobserver and intraobserver reliability by intraclass correlation coefficient (ICC) and Pearson coefficient evaluated consistency between readers and the deep learning pipeline. We evaluated the performance using sensitivity, accuracy, positive predictive value, and Dice coefficient. A P‐value <0.05 was considered statistically significant.ResultsThe ICC and the Pearson coefficient between the SPARCC scores from three readers and the deep learning pipeline were 0.83 and 0.86, respectively. The sensitivity in identifying BME and accuracy of identifying SI joints and blood vessels was 0.83, 0.90, and 0.88, respectively. The dice coefficients were 0.82 (sacrum) and 0.80 (ilium).Data ConclusionThe high consistency with human readers indicated that deep learning pipeline may provide a SPARCC‐informed deep learning approach for scoring of STIR images in spondyloarthritis.Evidence Level1Technical EfficacyStage 2
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献