Detection of Outliers in Reference Distributions: Performance of Horn’s Algorithm

Author:

Solberg Helge Erik1,Lahti Ari2

Affiliation:

1. Department of Medical Biochemistry, Rikshospitalet-Radiumhospitalet HF, Oslo, Norway

2. Department of General Psychiatry, University Hospital of Northern Norway, Tromsø, Norway

Abstract

Abstract Background: Medical laboratory reference data may be contaminated with outliers that should be eliminated before estimation of the reference interval. A statistical test for outliers has been proposed by Paul S. Horn and coworkers (Clin Chem 2001;47:2137–45). The algorithm operates in 2 steps: (a) mathematically transform the original data to approximate a gaussian distribution; and (b) establish detection limits (Tukey fences) based on the central part of the transformed distribution. Methods: We studied the specificity of Horn’s test algorithm (probability of false detection of outliers), using Monte Carlo computer simulations performed on 13 types of probability distributions covering a wide range of positive and negative skewness. Distributions with 3% of the original observations replaced by random outliers were used to also examine the sensitivity of the test (probability of detection of true outliers). Three data transformations were used: the Box and Cox function (used in the original Horn’s test), the Manly exponential function, and the John and Draper modulus function. Results: For many of the probability distributions, the specificity of Horn’s algorithm was rather poor compared with the theoretical expectation. The cause for such poor performance was at least partially related to remaining nongaussian kurtosis (peakedness). The sensitivity showed great variation, dependent on both the type of underlying distribution and the location of the outliers (upper and/or lower tail). Conclusion: Although Horn’s algorithm undoubtedly is an improvement compared with older methods for outlier detection, reliable statistical identification of outliers in reference data remains a challenge.

Publisher

Oxford University Press (OUP)

Subject

Biochemistry, medical,Clinical Biochemistry

Reference13 articles.

1. Solberg HE, ed. International Federation of Clinical Chemistry and International Committee for Standardization in Haematology. Approved recommendation (1987) on the theory of reference values. Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645–56..

2. Solberg HE. Establishment and use of reference values. Burtis CA Ashwood ER Bruns DE eds. Tietz textbook of clinical chemistry and molecular diagnostics, 4th ed2006:425-448 Saunders St. Louis. .

3. Barnett V, Lewis T. Outliers in statistical data1994:584pp John Wiley Chichester, England. .

4. Hawkins DM. Identification of outliers1980:188pp Chapman and Hall London. .

5. Dixon WJ. Processing data for outliers. Biometrics1953;9:74-89.

Cited by 85 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3