Machine learning using genetic and clinical data identifies a signature that robustly predicts methotrexate response in rheumatoid arthritis

Author:

Lim Lee Jin1,Lim Ashley J W1,Ooi Brandon N S1,Tan Justina Wei Lynn2,Koh Ee Tzun2,Ang Andrea Ee Ling,Chan Grace Yin Lai,Chan Madelynn Tsu-Li,Chia Faith Li-Ann,Chng Hiok Hee,Chua Choon Guan,Howe Hwee Siew,Koh Ee Tzun,Koh Li Wearn,Kong Kok Ooi,Law Weng Giap,Lee Samuel Shang Ming,Leong Khai PangORCID,Lian Tsui Yee,Lim Xin Rong,Loh Jess Mung Ee,Manghani Mona,Tan Justina Wei Lynn,Tan Sze-Chin,Teo Claire Min-Li,Thong Bernard Yu-Hor,Tjokrosaputro Paula Permatasari,Xu Chuanhui,Chong Samuel S3,Khor Chiea Chuen4,Tucker-Kellogg Lisa5ORCID,Lee Caroline G1678ORCID,Leong Khai Pang29ORCID,

Affiliation:

1. Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore

2. Department of Rheumatology, Allergy and Immunology, Tan Tock Seng Hospital

3. Department of Pediatrics, Yong Loo Lin School of Medicine, National University of Singapore

4. Division of Human Genetics, Genome Institute of Singapore

5. Centre for Computational Biology, and Cancer and Stem Cell Biology, Duke-NUS Medical School

6. Division of Cellular & Molecular Research, Humphrey Oei Institute of Cancer Research, National Cancer Centre Singapore

7. Duke-NUS Medical School

8. NUS Graduate School, National University of Singapore

9. Clinical Research & Innovation Office, Tan Tock Seng Hospital , Singapore

Abstract

Abstract Objective To develop a hypothesis-free model that best predicts response to MTX drug in RA patients utilizing biologically meaningful genetic feature selection of potentially functional single nucleotide polymorphisms (pfSNPs) through robust machine learning (ML) feature selection methods. Methods MTX-treated RA patients with known response were divided in a 4:1 ratio into training and test sets. From the patients’ exomes, potential features for classifier prediction were identified from pfSNPs and non-genetic factors through ML using recursive feature elimination with cross-validation incorporating the random forest classifier. Feature selection was repeated on random subsets of the training cohort, and consensus features were assembled into the final feature set. This feature set was evaluated for predictive potential using six ML classifiers, first by cross-validation within the training set, and finally by analysing its performance with the unseen test set. Results The final feature set contains 56 pfSNPs and five non-genetic factors. The majority of these pfSNPs are located in pathways related to RA pathogenesis or MTX action and are predicted to modulate gene expression. When used for training in six ML classifiers, performance was good in both the training set (area under the curve: 0.855–0.916; sensitivity: 0.715–0.892; and specificity: 0.733–0.862) and the unseen test set (area under the curve: 0.751–0.826; sensitivity: 0.581–0.839; and specificity: 0.641–0.923). Conclusion Sensitive and specific predictors of MTX response in RA patients were identified in this study through a novel strategy combining biologically meaningful and machine learning feature selection and training. These predictors may facilitate better treatment decision-making in RA management.

Funder

Singapore Ministry of Health’s National Medical Research Council

National Cancer Center Research Fund

Duke-NUS Medical School to Associate Professor

Singapore Ministry of Education Academic Research Fund Tier 2

Publisher

Oxford University Press (OUP)

Subject

Pharmacology (medical),Rheumatology

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3