Affiliation:
1. Dokuz Eylül Üniversitesi Yönetim Bilişim Sistemleri Bölümü
2. BOLU ABANT İZZET BAYSAL ÜNİVERSİTESİ
3. Karabağlar 17 Nolu Aile Sağlığı Merkezi
4. İZMİR KATİP ÇELEBİ ÜNİVERSİTESİ, TIP FAKÜLTESİ
Abstract
Purpose: Vitamin D level is emphasized as an important biomarker in determining risk factors for different diseases. Vitamin D is an important vitamin for human health and its deficiency is associated with serious health problems. Therefore, it is of great importance to detect vitamin D deficiency, which can be easily prevented and treated. The possible relationship between vitamin D deficiency and musculoskeletal pain, osteoporosis, diabetes mellitus, hypertension is frequently discussed in researches. In this research, it is aimed to analyze the factors in determining the vitamin D level and the decision rules related to it.
Methods: A descriptive framework based on one of the machine learning techniques, that is decision tree is followed. The data used to create the decision rules were obtained from volunteers between the ages of 18-85 who applied to Izmir Katip Çelebi University Atatürk Training and Research Hospital Infectious Diseases and Family Medicine Polyclinics and agreed to participate in the study between 01.03.2017 and 01.09.2017.
Results: It was observed that age, gender and laboratory test values are strong predictors for vitamin D level. As a result of two CART (Classification and Regression Trees) models, %90.47 and %95 predictive accuracies were observed respectively. In the first model, uric acid, age and creatine; in the second model TSH, ALP and smoking(yes) were the most important three biomarkers affecting vitamin D level.
Discussion: The collected features give a comprehensive list of variables that have an effect on vitamin D in the dataset under consideration. Important findings of the study include not only the identification of these variables, but also the effective categorization determination procedures. In contrast to previous research, the Age variable is the most influential factor within the scope of this dataset, which includes demographic information on patients and their existing disorders.
Publisher
Dokuz Eyul Universitesi Saglik Bilimleri Enstitusu
Reference37 articles.
1. Bates DW, Saria S, Ohno-Machado L, Shah A, Escobar G. Big Data In Health Care: Using Analytics To Identify And Manage High-Risk And High-Cost Patients. Health Aff (Millwood). 2014 Jul 1;33(7):1123–31.
2. Cui A, Zhang T, Xiao P, Fan Z, Wang H, Zhuang Y. Global and regional prevalence of vitamin D deficiency in population-based studies from 2000 to 2022: A pooled analysis of 7.9 million participants. Front Nutr. 2023;10:1070808.
3. The vitamin D deficiency pandemic: Approaches for diagnosis, treatment and prevention - PubMed [Internet]. [cited 2024 Feb 25]. Available from: https://pubmed.ncbi.nlm.nih.gov/28516265/
4. Bhan I, Burnett-Bowie SAM, Ye J, Tonelli M, Thadhani R. Clinical measures identify vitamin D deficiency in dialysis. Clin J Am Soc Nephrol CJASN. 2010 Mar;5(3):460–7.
5. Gonoodi K, Tayefi M, Saberi-Karimian M, Amirabadi zadeh A, Darroudi S, Farahmand SK, et al. An assessment of the risk factors for vitamin D deficiency using a decision tree model. Diabetes Metab Syndr Clin Res Rev. 2019 May 1;13(3):1773–7.