Imputation and characterization of uncoded self-harm in major mental illness using machine learning

Author:

Kumar Praveen12ORCID,Nestsiarovich Anastasiya1ORCID,Nelson Stuart J3ORCID,Kerner Berit4ORCID,Perkins Douglas J1ORCID,Lambert Christophe G125ORCID

Affiliation:

1. Center for Global Health, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, New Mexico, USA

2. Department of Computer Science, University of New Mexico, Albuquerque, New Mexico, USA

3. Biomedical Informatics Center, Department of Clinical Research & Leadership, George Washington University, Washington, DC, USA

4. Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California, USA

5. Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, New Mexico, USA

Abstract

Abstract Objective We aimed to impute uncoded self-harm in administrative claims data of individuals with major mental illness (MMI), characterize self-harm incidence, and identify factors associated with coding bias. Materials and Methods The IBM MarketScan database (2003-2016) was used to analyze visit-level self-harm in 10 120 030 patients with ≥2 MMI codes. Five machine learning (ML) classifiers were tested on a balanced data subset, with XGBoost selected for the full dataset. Classification performance was validated via random data mislabeling and comparison with a clinician-derived “gold standard.” The incidence of coded and imputed self-harm was characterized by year, patient age, sex, U.S. state, and MMI diagnosis. Results Imputation identified 1 592 703 self-harm events vs 83 113 coded events, with areas under the curve >0.99 for the balanced and full datasets, and 83.5% agreement with the gold standard. The overall coded and imputed self-harm incidence were 0.28% and 5.34%, respectively, varied considerably by age and sex, and was highest in individuals with multiple MMI diagnoses. Self-harm undercoding was higher in male than in female individuals and increased with age. Substance abuse, injuries, poisoning, asphyxiation, brain disorders, harmful thoughts, and psychotherapy were the main features used by ML to classify visits. Discussion Only 1 of 19 self-harm events was coded for individuals with MMI. ML demonstrated excellent performance in recovering self-harm visits. Male individuals and seniors with MMI are particularly vulnerable to self-harm undercoding and may be at risk of not getting appropriate psychiatric care. Conclusions ML can effectively recover unrecorded self-harm in claims data and inform psychiatric epidemiological and observational studies.

Funder

Patient-Centered Outcomes Research Institute

Longitudinal Comparative Effectiveness of Bipolar Disorder Therapies

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Reference45 articles.

1. Suicidal thoughts and behaviors among adults aged ≥18 years–United States, 2008-2009;Crosby;MMWR Surveill Summ,2011

2. Subsequent suicide mortality among emergency department patients seen for suicidal behavior;Crandall;Acad Emerg Med,2006

3. Non-suicidal self-injury, attempted suicide, and suicidal intent among psychiatric inpatients;Andover;Psychiatry Res,2010

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3