Interpretable deep learning for the prognosis of long-term functional outcome post-stroke using acute diffusion weighted imaging

Author:

Moulton Eric1ORCID,Valabregue Romain12,Piotin Michel3,Marnat Gaultier4,Saleme Suzana5,Lapergue Bertrand6,Lehericy Stephane127,Clarencon Frederic189,Rosso Charlotte1910ORCID

Affiliation:

1. Institut du Cerveau et de la Moelle épinière, ICM, Inserm U 1127, CNRS UMR 7225, Sorbonne Université, Paris, France

2. Centre de Neuro-Imagerie de Recherche, CENIR, ICM, Paris, France

3. Department of Diagnostic and Interventional Neuroradiology, Rothschild Foundation, Paris, France

4. Department of Diagnostic and Interventional Neuroradiology, University Hospital of Bordeaux, Bordeaux, France

5. Diagnostic and Interventional Neuroradiology, University Hospital of Limoges, Limoges, France

6. Department of Stroke Center and Diagnostic and Interventional Neuroradiology, University of Versailles and Saint Quentin en Yvelines, Foch Hospital, Suresnes, France

7. AP-HP Service de Neuroradiologie diagnostique, Hôpital Pitié-Salpêtrière, Paris, France

8. AP-HP Service de Neuroradiologie interventionelle Hôpital Pitié-Salpêtrière, Paris, France

9. ICM iCRIN team: STAR (Stroke Therapy And Registries)

10. AP-HP, Urgences Cérébro-Vasculaires, Hôpital Pitié-Salpêtrière, Paris, France

Abstract

Advances in deep learning can be applied to acute stroke imaging to build powerful and explainable prediction models that could supersede traditionally used biomarkers. We aimed to evaluate the performance and interpretability of a deep learning model based on convolutional neural networks (CNN) in predicting long-term functional outcome with diffusion-weighted imaging (DWI) acquired at day 1 post-stroke. Ischemic stroke patients (n = 322) were included from the ASTER and INSULINFARCT trials as well as the Pitié-Salpêtrière registry. We trained a CNN to predict long-term functional outcome assessed at 3 months with the modified Rankin Scale (dichotomized as good [mRS ≤ 2] vs. poor [mRS ≥ 3]) and compared its performance to two logistic regression models using lesion volume and ASPECTS. The CNN contained an attention mechanism, which allowed to visualize the areas of the brain that drove prediction. The deep learning model yielded a significantly higher area under the curve (0.83 95%CI [0.78–0.87]) than lesion volume (0.78 [0.73–0.83]) and ASPECTS (0.77 [0.71–0.83]) (p < 0.05). Setting all classifiers to the specificity as the deep learning model (i.e., 0.87 [0.82–0.92]), the CNN yielded a significantly higher sensitivity (0.67 [0.59–0.73]) than lesion volume (0.48 [0.40–0.56]) and ASPECTS (0.50 [0.41–0.58]) (p = 0.002). The attention mechanism revealed that the network learned to naturally attend to the lesion to predict outcome.

Publisher

SAGE Publications

Subject

Cardiology and Cardiovascular Medicine,Neurology (clinical),Neurology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3