Using Machine Learning to identify microRNA biomarkers for predisposition to Huntington’s Disease

Author:

Patel KORCID,Sheridan C,Shanley DP

Abstract

AbstractBackgroundHuntington’s disease (HD) is an autosomal dominant disease which is triggered by a large expansion of CAG nucleotides in theHTTgene. While the CAG expansion linearly correlates with the age of disease onset in HD, twin-studies and cohorts of Juvenile Onset HD (JOHD) patients have shown other factors influence the progression of HD. Thus, it would be of interest to identify molecular biomarkers which indicate predisposition to the development of HD, and as microRNAs (miRNAs) circulate in bio-fluids they would be particularly useful biomarkers. We explored a large HD miRNA-mRNA expression dataset (GSE65776) to establish appropriate questions that could be addressed using Machine Learning (ML). We sought sets of features (mRNAs or miRNAs) to predict HD or WT samples from aged or young mouse cortex samples, and we asked if a set of features could predict predisposition to HD or WT genotypes by training models on aged samples and testing the models on young samples. Several models were created using ADAboost, ExtraTrees, GaussianNB and Random Forest, and the best performing models were further analysed using AUC curves and PCA plots. Finally, genes used to train our miRNA-based predisposition model were mined from HD patient bio-fluid samples.ResultsOur testing accuracies were between 66-100% and AUC scores were between 31-100%. We generated several excellent models with testing accuracies >80% and AUC scores >90%. We also identified homologues ofmmu-miR-154-5p,mmu-miR-181a-5p,mmu-miR-212-3p, mmu-miR-378b, mmu-miR-382-5pandmmu-miR-770-5pfrom our miRNA-based predisposition model to be circulating in HD patient blood samples at p.values of <0.05.ConclusionsWe generated several age-based models which could differentiate between HD and WT samples, including an aged mRNA-based model with a 100% AUC score, an aged miRNA-based model with a 92% AUC score and an aged miRNA-based model with a 96% AUC score. We also identified several miRNAs used to train our miRNA-based predisposition model which were detectable in HD patient blood samples, which suggests they could be potential candidates for use as non-invasive biomarkers for HD research.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3