Achieving high inter-rater reliability in establishing data labels: a retrospective chart review study-Reference-Cited by-同舟云学术

Achieving high inter-rater reliability in establishing data labels: a retrospective chart review study

Published:2024-04 Issue:2 Volume:13 Page:e002722
ISSN:2399-6641
Container-title:BMJ Open Quality
language:en
Short-container-title:BMJ Open Qual

Author:

Wu Guosong^ORCID,Eastwood Cathy^ORCID,Sapiro Natalie,Cheligeer Cheligeer,Southern Danielle A,Quan Hude,Xu Yuan

Abstract

BackgroundIn medical research, the effectiveness of machine learning algorithms depends heavily on the accuracy of labeled data. This study aimed to assess inter-rater reliability (IRR) in a retrospective electronic medical chart review to create high quality labeled data on comorbidities and adverse events (AEs).MethodsSix registered nurses with diverse clinical backgrounds reviewed patient charts, extracted data on 20 predefined comorbidities and 18 AEs. All reviewers underwent four iterative rounds of training aimed to enhance accuracy and foster consensus. Periodic monitoring was conducted at the beginning, middle, and end of the testing phase to ensure data quality. Weighted Kappa coefficients were calculated with their associated 95% confidence intervals (CIs).ResultsSeventy patient charts were reviewed. The overall agreement, measured by Conger's Kappa, was 0.80 (95% CI: 0.78-0.82). IRR scores remained consistently high (ranging from 0.70 to 0.87) throughout each phase.ConclusionOur study suggests the detailed manual for chart review and structured training regimen resulted in a consistently high level of agreement among our reviewers during the chart review process. This establishes a robust foundation for generating high-quality labeled data, thereby enhancing the potential for developing accurate machine learning algorithms.

Funder

CIHR

Publisher

BMJ

Reference6 articles.

1. Machine Learning in Medicine

2. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial

3. Developing EMR-based Algorithms to identify hospital adverse events for health system performance evaluation and improvement: study protocol;Wu;PLoS One,2022

4. Harris PA , Taylor R , Minor BL , et al . The Redcap consortium: building an international community of software platform partners. J Biomed Inform 2019;95. doi:10.1016/j.jbi.2019.103208

5. Training and experience of coding with the world health organization’s International classification of diseases;Eastwood;Health Inf Manag,2023