A systematic review of prediction accuracy as an evaluation measure for determining machine learning model performance in healthcare systems-Reference-Cited by-同舟云学术

A systematic review of prediction accuracy as an evaluation measure for determining machine learning model performance in healthcare systems

Published:2023-06-04 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Owusu-Adjei Michael^ORCID,Hayfron-Acquah James Ben,Frimpong Twum,Abdul-Salaam Gaddafi

Abstract

AbstractBackgroundFocus on predictive algorithm and its performance evaluation is extensively covered in most research studies. Best predictive models offer Optimum prediction solutions in the form of prediction accuracy scores, precision, recall etc. Prediction accuracy score from performance evaluation have been used as a determining factor for appropriate model recommendations use. It is one of the most widely used metric for identifying optimal prediction solutions irrespective of context or nature of dataset, size and output class distributions between the minority and majority variables. The key research question however is the impact of using prediction accuracy as compared to balanced accuracy in the determination of model performance in healthcare and other real-world application systems. Answering this question requires an appraisal of current state of knowledge in both prediction accuracy and balanced accuracy use in real-world applications including a search for related works that highlight appropriate machine learning methodologies and techniques.Materials and methodsA systematic review of related research works through an adopted search strategy protocol for relevant literature with a focus on the following characteristics; current state of knowledge with respect to ML techniques, applications and evaluations, research works with prediction accuracy score as an evaluation metric, research works in real-world context with appropriate methodologies. Excluded from this review search is defining specific search timelines and the motivation for not specifying search period was to include as many important works as possible irrespective of its date of publication. Of particular interest was related works on healthcare systems and other real-world applications (spam detections, fraud predictions, risk predictions etc).ResultsObservations from the related literature used indicate extensive use of machine learning techniques in real-world applications. Predominantly used machine learning techniques were Random forest, Support vector machine, Logistic regression, K-Nearest Neighbor, Decision trees, Gradient boosting classifier and some few ensemble techniques. The use of evaluation performance metrics such as precision, recall, f1-score, prediction accuracy and in some few instances; predicted positive and predicted negative values as justification for best model recommendation is also noticed. Of interest is the use of prediction accuracy as a predominant metric for assessing model performance among all the related literature works indentified.ConclusionsIn the light of challenges identified with the use of prediction accuracy as a performance measure for best model predictions, we propose a novel evaluation approach for predictive modeling use within healthcare systems context called PMEA (Proposed Model Evaluation Approach) which can be generalized in similar contexts. PMEA, addresses challenges for the use of prediction accuracy with balanced accuracy score derived from two most important evaluation metrics (True positive rates and True negative rates: TPR, TNR) to estimate more accurately best model performance in context. Identifying an appropriate evaluation metric for performance assessment will ensure a true determination of best performing prediction model for recommendation.

Publisher

Cold Spring Harbor Laboratory

Reference56 articles.

1. “Deep-learning technique predicts clinical treatment outcomes | MIT News | Massachusetts Institute of Technology.” https://news.mit.edu/2022/deep-learning-technique-predicts-clinical-treatment-outcomes-0224 (accessed May 27, 2023).

2. Internet of Things is a revolutionary approach for future technology enhancement: a review

3. Data Collection in Studies on Internet of Things (IoT), Wireless Sensor Networks (WSNs), and Sensor Cloud (SC): Similarities and Differences

4. Supervised Machine Learning Models for Prediction of COVID-19 Infection using Epidemiology Dataset

5. “Evaluation Metrics | 12 Must-Know ML Model Evaluation Metrics.” https://www.analyticsvidhya.com/blog/2019/08/11-important-model-evaluation-error-metrics/ (accessed May 24, 2023).