Building the Model-Reference-Cited by-同舟云学术

Building the Model

Published:2022-10-10 Issue: Volume: Page:
ISSN:1543-2165
Container-title:Archives of Pathology & Laboratory Medicine
language:en
Short-container-title:

Author:

Yang He S.¹,Rhoads Daniel D.²³,Sepulveda Jorge⁴,Zang Chengxi⁵,Chadburn Amy¹,Wang Fei⁵

Affiliation:

1. From the Department of Pathology and Laboratory Medicine (Yang, Chadburn), Weill Cornell Medicine, New York, New York.

2. From the Department of Laboratory Medicine, Cleveland Clinic, Cleveland, Ohio (Rhoads).

3. From the Department of Pathology, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio (Rhoads).

4. From the Department of Pathology, School of Medicine and Health Sciences, George Washington University, Washington, District of Columbia (Sepulveda).

5. From the Department of Population Health Sciences (Zang, Wang), Weill Cornell Medicine, New York, New York.

Abstract

Context.— Machine learning (ML) allows for the analysis of massive quantities of high-dimensional clinical laboratory data, thereby revealing complex patterns and trends. Thus, ML can potentially improve the efficiency of clinical data interpretation and the practice of laboratory medicine. However, the risks of generating biased or unrepresentative models, which can lead to misleading clinical conclusions or overestimation of the model performance, should be recognized. Objectives.— To discuss the major components for creating ML models, including data collection, data preprocessing, model development, and model evaluation. We also highlight many of the challenges and pitfalls in developing ML models, which could result in misleading clinical impressions or inaccurate model performance, and provide suggestions and guidance on how to circumvent these challenges. Data Sources.— The references for this review were identified through searches of the PubMed database, the US Food and Drug Administration white papers and guidelines, conference abstracts, and online preprints. Conclusions.— With the growing interest in developing and implementing ML models in clinical practice, laboratorians and clinicians need to be educated in order to collect sufficiently large and high-quality data, properly report the data set characteristics, and combine data from multiple institutions with proper normalization. They will also need to assess the reasons for missing values, determine the inclusion or exclusion of outliers, and evaluate the completeness of a data set. In addition, they require the necessary knowledge to select a suitable ML model for a specific clinical question and accurately evaluate the performance of the ML model, based on objective criteria. Domain-specific knowledge is critical in the entire workflow of developing ML models.

Publisher

Archives of Pathology and Laboratory Medicine

Subject

Medical Laboratory Technology,General Medicine,Pathology and Forensic Medicine

Link

https://meridian.allenpress.com/aplm/article-pdf/doi/10.5858/arpa.2021-0635-RA/3129707/10.5858_arpa.2021-0635-ra.pdf

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning – Driven surface grafting of thin-film composite reverse osmosis (TFC-RO) membrane;Desalination;2024-06

2. Decoding the neurotoxic effects of propofol: insights into the RARα-Snhg1-Bdnf regulatory cascade;American Journal of Physiology-Cell Physiology;2024-06-01

3. A predictive model for disease severity among COVID-19 elderly patients based on IgG subtypes and machine learning;Frontiers in Immunology;2023-11-30

4. Artificial intelligence-driven systems engineering for next-generation plant-derived biopharmaceuticals;Frontiers in Plant Science;2023-11-15

5. Generalizability of a Machine Learning Model for Improving Utilization of Parathyroid Hormone-Related Peptide Testing across Multiple Clinical Centers;Clinical Chemistry;2023-09-21