Early Prediction of Lupus Disease: A Study on the Variations of Decision Tree Models

Author:

Singh Jagjiven Kaur Jasber1,Ponnusamy Raja Rajeswari1,Ling Elaine Chan Wan2,Chin Lim Sern3

Affiliation:

1. Asia Pacific University of Technology and Innovation

2. International Medical University

3. Universiti Teknologi Mara

Abstract

Abstract Systematic Lupus Erythematosus (SLE) is an irreversible autoimmune disease that has seen to bring a lot of negative effect on the human body. It has become a very challenging task in predicting the prevalence of Lupus in patients. It has slowly gained popularity among many researchers to study the prevalence of this disease and developing prediction models that not only study the prevalence of the disease but is also able to predict suitable dosage requirements, treatment effectiveness and the severity of the disease in patients. All of these is usually done with medical records or clinical data that has different attributes related and significant to the analysis done. With the advancement in machine learning models and ensemble techniques, accurate prediction models have been developed. However, these models are not able to explain the significant contributing factors as well as correctly classify the severity of the disease. Decision Tree Classifier, Random Forest Classifier and Extreme Gradient Boosting (XGBoost) are the models that will be used in this paper to predict the early prevalence to Lupus Disease in patients using clinical records. The most significant factors affecting Systematic Lupus Erythematosus (SLE) will then be identified to aid medical practitioners to take suitable preventive measures that can manage the complications that arise from the disease. Hence, this paper aims to assess the performance of tree models by performing several experiments on the hyper parameters to develop a more accurate model that is able to classify Lupus Disease in patients in the early stages. Findings revealed that the best model was the Random Forest Classifier with parameter tuning. The most significant factor that affected the presence of Lupus Disease in patients was identified as the Ethnicity and the Renal Outcome or the kidney function of the patients.

Funder

Asia Pacific University of Technology and Innovation

Publisher

Research Square Platform LLC

Reference45 articles.

1. Lupus or not? SLE Risk Probability Index (SLERPI): a simple, clinician-friendly machine learning-based model to assist the diagnosis of systemic lupus erythematosus;Adamichou C;Annals of the Rheumatic Diseases, [online],2021

2. Burden of lupus on work: Issues in the employment of individuals with lupus;Agarwal N;Work, [online],2016

3. Systemic lupus erythematosus in Iran: a study of 2280 patients over 33 years;Akbarian M;International Journal of Rheumatic Diseases, [online],2010

4. Analysis and prediction of diabetes mellitus using machine learning algorithm;Alehegn M;International Journal of Pure and Applied Mathematics, [online],2018

5. Apte, A. (2018). 3 Ways to Load CSV files into Colab. Medium. Retrieved 19 October 2021, from https://towardsdatascience.com/3-ways-to-load-csv-files-into-colab-7c14fcbdcb92.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3