Abstract
Heart diseases and stroke are the number one cause of death and disability among people with type 2 diabetes (T2D). Clinicians and health authorities for many years have expressed interest in identifying individuals at increased risk of coronary heart disease (CHD). Our main objective is to develop a prognostic workflow of CHD in T2D patients using a Holter dataset. This workflow development will be based on machine learning techniques by testing a variety of classifiers and subsequent selection of the best performing system. It will also assess the impact of feature selection and bootstrapping techniques over these systems. Among a variety of classifiers such as Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Alternating Decision Tree (ADT), Random Tree (RT) and K-Nearest Neighbour (KNN), the best performing classifier is NB. We achieved an area under receiver operating characteristics curve (AUC) of 68,06% and 74,33% for a prognosis of 3 and 4 years, respectively.
Reference35 articles.
1. Instance-based learning algorithms
2. Assign Score - Prioritising Prevention of Cardiovascular Disease. (n.d.). Retrieved March 2015 from http://assign-score.com/
3. SMOTE: Synthetic minority over-sampling technique.;N. V.Chawla;Journal of Artificial Intelligence Research,2011