Unsupervised machine-learning identifies clinically distinct subtypes of ALS that reflect different genetic architectures and biological mechanisms

Author:

Spargo Thomas PORCID,Marriott HeatherORCID,Hunt Guy P,Pain OliverORCID,Kabiljo Renata,Bowles Harry,Sproviero William,Gillett Alexandra CORCID,Fogh Isabella,Andersen Peter M.,Başak Nazli A.,Shaw Pamela J.,Corcia Philippe,Couratier Philippe,de Carvalho MamedeORCID,Drory Vivian,Glass Jonathan D.,Gotkine Marc,Hardiman Orla,Landers John E.,McLaughlin Russell,Mora Pardina Jesús S.,Morrison Karen E.,Pinto Susana,Povedano Monica,Shaw Christopher E.,Silani VincenzoORCID,Ticozzi Nicola,Damme Philip VanORCID,van den Berg Leonard H.,Vourc’h Patrick,Weber Markus,Veldink Jan H.ORCID,Dobson Richard J.B.,Khleifat Ahmad Al,Cummins Nicholas,Stahl Daniel,Al-Chalabi AmmarORCID,Iacoangeli Alfredo,

Abstract

AbstractBackgroundAmyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease characterised by a highly variable clinical presentation and multifaceted genetic and biological bases that translate into great patient heterogeneity. The identification of homogeneous subgroups of patients in terms of both clinical presentation and biological causes, could favour the development of effective treatments, healthcare, and clinical trials. We aimed to identify and characterise homogenous clinical subgroups of ALS, examining whether they represent underlying biological trends.MethodsLatent class clustering analysis, an unsupervised machine-learning method, was used to identify homogenous subpopulations in 6,523 people with ALS from Project MinE, using widely collected ALS-related clinical variables. The clusters were validated using 7,829 independent patients from STRENGTH. We tested whether the identified subgroups were associated with biological trends in genetic variation across genes previously linked to ALS, polygenic risk scores of ALS and related neuropsychiatric traits, and in gene expression data from post-mortem motor cortex samples.ResultsWe identified five ALS subgroups based on patterns in clinical data which were general across international datasets. Distinct genetic trends were observed for rare variants in theSOD1andC9orf72genes, and across genes implicated in biological processes relevant to ALS. Polygenic risk scores of ALS, schizophrenia and Parkinson’s disease were also higher in distinct clusters with respect to controls. Gene expression analysis identified different altered biological processes across clusters reflecting the genetic differences. We developed a machine learning classifier based on our model to assign subgroup membership using clinical data available at first visit, and made it available on a public webserver athttp://latentclusterals.er.kcl.ac.uk.ConclusionALS subgroups characterised by highly distinct clinical presentations were discovered and validated in two large independent international datasets. Such groups were also characterised by different underlying genetic architectures and biology. Our results showed that data-driven patient stratification into more clinically and biologically homogeneous subtypes of ALS is possible and could help develop more effective and targeted approaches to the biomedical and clinical study of ALS.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3