Author:
Dhaubhadel Sayera,Ganguly Kumkum,Ribeiro Ruy M.,Cohn Judith D.,Hyman James M.,Hengartner Nicolas W.,Kolade Beauty,Singley Anna,Bhattacharya Tanmoy,Finley Patrick,Levin Drew,Thelen Haedi,Cho Kelly,Costa Lauren,Ho Yuk-Lam,Justice Amy C.,Pestian John,Santel Daniel,Zamora-Resendiz Rafael,Crivelli Silvia,Tamang Suzanne,Martins Susana,Trafton Jodie,Oslin David W.,Beckham Jean C.,Kimbrel Nathan A., ,Agarwal Khushbu,Ashley-Koch Allison E.,Aslan Mihaela,Begoli Edmond,Brown Ben,Calhoun Patrick S.,Cheung Kei-Hoi,Choudhury Sutanay,Cliff Ashley M.,Cuellar-Hengartner Leticia,Deangelis Haedi E.,Dennis Michelle F.,Finley Patrick D.,Garvin Michael R.,Gelernter Joel E.,Hair Lauren P.,Ham Colby,Harvey Phillip D.,Hauser Elizabeth R.,Hauser Michael A.,Hengartner Nick W.,Jacobson Daniel A.,Jones Jessica,Jones Piet C.,Kainer David,Kaplan Alan D.,Katz Ira R.,Kember Rachel L.,Kirby Angela C.,Ko John C.,Lagergren John,Lane Matthew,Levey Daniel F.,Lindquist Jennifer H.,Liu Xianlian,Madduri Ravi K.,Manore Carrie,Martinez Carianne,McCarthy John F.,Cashman Mikaela McDevitt,Miller J. Izaak,Morrow Destinee,Pavicic-Venegas Mirko,Pyarajan Saiju,Qin Xue J.,Rajeevan Nallakkandi,Ramsey Christine M.,Ribeiro Ruy,Rodriguez Alex,Romero Jonathon,Shi Yunling,Stein Murray B.,Sullivan Kyle A.,Sun Ning,Tamang Suzanne R.,Townsend Alice,Trafton Jodie A.,Walker Angelica,Wang Xiange,Wangia-Anderson Victoria,Yang Renji,Yoo Shinjae,Zhao Hongyu,McMahon Benjamin H.
Abstract
AbstractWe present an ensemble transfer learning method to predict suicide from Veterans Affairs (VA) electronic medical records (EMR). A diverse set of base models was trained to predict a binary outcome constructed from reported suicide, suicide attempt, and overdose diagnoses with varying choices of study design and prediction methodology. Each model used twenty cross-sectional and 190 longitudinal variables observed in eight time intervals covering 7.5 years prior to the time of prediction. Ensembles of seven base models were created and fine-tuned with ten variables expected to change with study design and outcome definition in order to predict suicide and combined outcome in a prospective cohort. The ensemble models achieved c-statistics of 0.73 on 2-year suicide risk and 0.83 on the combined outcome when predicting on a prospective cohort of $$\sim$$
∼
4.2 M veterans. The ensembles rely on nonlinear base models trained using a matched retrospective nested case-control (Rcc) study cohort and show good calibration across a diversity of subgroups, including risk strata, age, sex, race, and level of healthcare utilization. In addition, a linear Rcc base model provided a rich set of biological predictors, including indicators of suicide, substance use disorder, mental health diagnoses and treatments, hypoxia and vascular damage, and demographics.
Publisher
Springer Science and Business Media LLC