Evaluating inter-rater reliability in the context of “Sysmex UN2000 detection of protein/creatinine ratio and of renal tubular epithelial cells can be used for screening lupus nephritis”: a statistical examination-Reference-Cited by-同舟云学术

Evaluating inter-rater reliability in the context of “Sysmex UN2000 detection of protein/creatinine ratio and of renal tubular epithelial cells can be used for screening lupus nephritis”: a statistical examination

Published:2024-03-13 Issue:1 Volume:25 Page:
ISSN:1471-2369
Container-title:BMC Nephrology
language:en
Short-container-title:BMC Nephrol

Author:

Li Ming,Gao Qian,Yang Jing,Yu Tianfei

Abstract

Abstract Background The evaluation of inter-rater reliability (IRR) is integral to research designs involving the assessment of observational ratings by two raters. However, existing literature is often heterogeneous in reporting statistical procedures and the evaluation of IRR, although such information can impact subsequent hypothesis testing analyses. Methods This paper evaluates a recent publication by Chen et al., featured in BMC Nephrology, aiming to introduce an alternative statistical approach to assessing IRR and discuss its statistical properties. The study underscores the crucial need for selecting appropriate Kappa statistics, emphasizing the accurate computation, interpretation, and reporting of commonly used IRR statistics between two raters. Results The Cohen’s Kappa statistic is typically used for two raters dealing with two categories or for unordered categorical variables having three or more categories. On the other hand, when assessing the concordance between two raters for ordered categorical variables with three or more categories, the commonly employed measure is the weighted Kappa. Conclusion Chen and colleagues might have underestimated the agreement between AU5800 and UN2000. Although the statistical approach adopted in Chen et al.’s research did not alter their findings, it is important to underscore the importance of researchers being discerning in their choice of statistical techniques to address their specific research inquiries.

Funder

Fundamental Research Funds in Heilongjiang Provincial Universities

Heilongjiang Province Leading Talent Echelon Reserve Leader Funding Project

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s12882-024-03540-y.pdf

Reference15 articles.

1. Hallgren KA. Computing inter-rater reliability for observational data: an overview and tutorial. Tutor Quant Methods Psychol. 2012;8(1):23.

2. Hughes J. Sklar’s omega: a gaussian copula-based framework for assessing agreement. Stat Comput. 2022;32(3):46.

3. Chen Y, Zhao Y, Zhang Z, Cheng X, Lin J, Li J, et al. Sysmex UN2000 detection of protein/creatinine ratio and of renal tubular epithelial cells can be used for screening lupus nephritis. BMC Nephrol. 2022;23(1):328.

4. Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20(1):37–46.

5. Gao P, He W, Jin Y, Zhou C, Zhang P, Wang W, Hu J, Liu J. Acute kidney injury after infant cardiac surgery: a comparison of pRIFLE, KDIGO, and pROCK definitions. BMC Nephrol. 2023;24(1):251.