Data pre-processing for cardiovascular disease classification: A systematic literature review-Reference-Cited by-同舟云学术

Data pre-processing for cardiovascular disease classification: A systematic literature review

Published:2023-01-05 Issue:1 Volume:44 Page:1525-1545
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Javid Irfan¹²,Ghazali Rozaida¹,Zulqarnain Muhammad³,Hassan Norlida¹

Affiliation:

1. Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn, Malaysia

2. Department of Computer Science & IT, University of Poonch Rawalakot, AJK, Pakistan

3. Riphah College of Computing, Riphah International University Faisalabad Campus, Pakistan

Abstract

The important task in the medical field is the early detection of disease. Heart disease is one of the greatest challenging diseases in all other diseases subsequently 17.3 million people died once a year due to heart disease. A minute error in heart disease diagnosis is a risk for an individual lifespan. Precise heart disease diagnosis is consequently critical. Different approaches including data mining have been used for the prediction of heart disease. However, there are some solemn concerns related to the data quality for example inconsistencies, missing values, noise, high dimensionality, and imbalanced statistics. In order to improve the accuracy of Data Mining based prediction systems, techniques for data preparation were applied to increase the quality of the data. The foremost objective of this paper is to highlight and summarize the research work about (i) data preparation techniques mostly used, (ii) the impact of pre-processing procedures on the accuracy of a heart disease prediction system, (iii) classifier enactment with data pre-processing techniques, (4) comparison in terms of accuracy of the different pre-processing model. A systematic literature review on the use of data pre-processing in heart disease diagnosis is carried out from January 2001 to July 2021 by studying the published material. Almost 30 studies were designated and examined related to the above-mentioned benchmarks. The literature review concludes that data reduction and data cleaning pre-processing techniques are mostly used in heart disease prediction systems. Overall this study concludes that data pre-processing has improved the accuracy of models used for heart disease prediction. Some hybrid models including (ANN+CHI), (ANN+PCA), (DNN+CHI) and (SVM+PCA) have shown improved accuracy level. However, due to the lack of clarification, there is a number of limitations and challenges in order to implementing these models in the real world.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference86 articles.

1. Irfan Javid , Ahmed Khalaf Zager Alsaedi , Rozaida Ghazali , Accuracy of Heart Disease Prediction using Machine Learning and Recurrent Neural Networks Ensemble Majority Voting Method, International Journal of Advanced Computer Science and Applications (IJACSA) 11(3) (2020). https://dx.doi.org/10.14569/IJACSA.2020.0110369.

2. A new data preparation method based on clustering algorithms for diagnosis systems of heart and diabetes diseases;Yilmaz;J Med Syst,2014

3. Hybrid of firefly algorithm and pattern search for solving optimization problems;Wahid;Evol Intel,2019

4. Wrapper method for feature selection to classify cardiac arrhythmia,;Mustaqeem;Annu Int Conf IEEE Eng Med Biol Soc,2017

5. Systematic mapping study of data mining–based empirical studies in cardiology

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robust Heart Disease Prognosis: Integrating Extended Isolation Forest Outlier Detection with Advanced Prediction Models;Lecture Notes in Networks and Systems;2024

2. An ARIMA and XGBoost Model Utilized for Forecasting Municipal Solid Waste Generation;Communications in Computer and Information Science;2023