Machine Learning-Based Sample Misidentification Error Detection in Clinical Laboratory Tests: A Retrospective Multicenter Study

Author:

Seok Hyeon Seok12ORCID,Yu Shinae3ORCID,Shin Kyung-Hwa4ORCID,Lee Woochang5ORCID,Chun Sail5ORCID,Kim Sollip5ORCID,Shin Hangsik2ORCID

Affiliation:

1. Interdisciplinary Program of Biomedical Engineering, Graduate School, Chonnam National University , Yeosu , Republic of Korea

2. Department of Digital Medicine, Brain Korea 21 Project, Asan Medical Center, University of Ulsan College of Medicine , Seoul , Republic of Korea

3. Department of Laboratory Medicine, Haeundae Paik Hospital, Inje University College of Medicine , Busan , Republic of Korea

4. Department of Laboratory Medicine and Biomedical Research Institute, Pusan National University Hospital , Pusan National University School of Medicine, Busan , Republic of Korea

5. Department of Laboratory Medicine, Asan Medical Center, University of Ulsan College of Medicine , Seoul , Republic of Korea

Abstract

Abstract Background In clinical laboratories, the precision and sensitivity of autoverification technologies are crucial for ensuring reliable diagnostics. Conventional methods have limited sensitivity and applicability, making error detection challenging and reducing laboratory efficiency. This study introduces a machine learning (ML)-based autoverification technology to enhance tumor marker test error detection. Methods The effectiveness of various ML models was evaluated by analyzing a large data set of 397 751 for model training and internal validation and 215 339 for external validation. Sample misidentification was simulated by random shuffling error-free test results with a 1% error rate to achieve a real-world approximation. The ML models were developed with Bayesian optimization for tuning. Model validation was performed internally at the primary institution and externally at other institutions, comparing the ML models’ performance with conventional delta check methods. Results Deep neural networks and extreme gradient boosting achieved an area under the receiver operating characteristic curve of 0.834 to 0.903, outperforming that of conventional methods (0.705 to 0.816). External validation by 3 independent laboratories showed that the balanced accuracy of the ML model ranged from 0.760 to 0.836, outperforming the balanced accuracy of 0.670 to 0.773 of the conventional models. Conclusions This study addresses limitations regarding the sensitivity of current delta check methods for detection of sample misidentification errors and provides versatile models that mitigate the operational challenges faced by smaller laboratories. Our findings offer a pathway toward more efficient and reliable clinical laboratory testing.

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3