Abstract
Heart diseases are highly ranked among the leading causes of mortality in the world. They have various types including vascular, ischemic, and hypertensive heart disease. A large number of medical features are reported for patients in the Electronic Health Records (EHR) that allow physicians to diagnose and monitor heart disease. We collected a dataset from Medica Norte Hospital in Mexico that includes 800 records and 141 indicators such as age, weight, glucose, blood pressure rate, and clinical symptoms. Distribution of the collected records is very unbalanced on the different types of heart disease, where 17% of records have hypertensive heart disease, 16% of records have ischemic heart disease, 7% of records have mixed heart disease, and 8% of records have valvular heart disease. Herein, we propose an ensemble-learning framework of different neural network models, and a method of aggregating random under-sampling. To improve the performance of the classification algorithms, we implement a data preprocessing step with features selection. Experiments were conducted with unidirectional and bidirectional neural network models and results showed that an ensemble classifier with a BiLSTM or BiGRU model with a CNN model had the best classification performance with accuracy and F1-score between 91% and 96% for the different types of heart disease. These results are competitive and promising for heart disease dataset. We showed that ensemble-learning framework based on deep models could overcome the problem of classifying an unbalanced heart disease dataset. Our proposed framework can lead to highly accurate models that are adapted for clinical real data and diagnosis use.
Reference79 articles.
1. Heart disease and stroke statistics-2016 update a report from the American Heart Association;Mozaffarian;Circulation,2016
2. Secondary use of EHR: Data quality issues and informatics opportunities;Trifirò;Pharmacoepidemiol. Drug Saf.,2009
3. Data mining on electronic health record databases for signal detection in pharmacovigilance: Which events to monitor;Botsis;Summit Transl. Bioinform.,2010
4. Uses of Electronic Health Records for Public Health Surveillance to Advance Public Health
5. Dynamic Handwriting Analysis for Supporting Earlier Parkinson’s Disease Diagnosis
Cited by
78 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献