Abstract
AbstractImportanceValid biomarkers that can predict longitudinal clinical outcomes at low cost are a holy grail in psychiatric research, promising to ultimately be used to optimize and tailor intervention and prevention efforts.ObjectiveTo determine if baseline linguistic markers in natural speech, as compared to non-speech clinical and demographic measures, can predict drug use severity measures at future sessions in initially abstinent individuals with cocaine use disorder (iCUD).DesignA longitudinal cohort study (August 2017 – March 2020), where baseline measures were used to predict outcomes collected at three-month intervals for up to one year of follow-up.ParticipantsEighty-eight initially abstinent iCUD were studied at baseline; 57 (46 male, age 50.7+/-7.9 years) came back for at least another session.Main Outcomes and MeasuresOutcomes were self-reported symptoms of withdrawal, craving, abstinence duration and frequency of cocaine use in the past 90 days at each study session. The predictors were derived from 5-min recordings of vocal descriptions of the positive consequences of abstinence and the negative consequences of using cocaine; the baseline cocaine and other common drug use measures, demographic and neuropsychological variables were used for comparison.ResultsModels using the non-speech variables showed the best predictive performance at three(r>0.45,P<2×10-3) and six months follow-up (r>0.37,P<3×10-2). At 12 months, the natural language processing-based model showed significant correlations with withdrawal (r=0.43,P=3×10-2), craving (r=0.72,P=5×10-5), days of abstinence (r=0.76,P=1×10-5), and cocaine use in the past 90 days (r=0.61,P=2×10-3), significantly outperforming the other models for abstinence prediction.Conclusions and RelevanceAt short time intervals, maximal predictive power was obtained with models that used baseline drug use (in addition to demographic and neuropsychological) measures, potentially reflecting a slow rate of change in these measures, which could be estimated by linear functions. In contrast, short speech samples predicted longer-term changes in drug use, implying deeper penetrance by potentially capturing non-linear dynamics over longer intervals. Results suggest that, compared to the common outcome measures used in clinical trials, speech-based measures could be leveraged as better predictors of longitudinal drug use outcomes in initially abstinent iCUD, as potentially generalizable to other substance use disorders and related comorbidity.Key PointsQuestionCan natural language processing be leveraged to predict longitudinal drug use outcomes in individuals with substance use disorder?FindingsIn this prospective study that included initially abstinent individuals with cocaine use disorder (iCUD), models using demographics, neuropsychological measures and drug use patterns at baseline were compared to those using minimally structured short natural speech samples relating the positive consequences of abstinence and the negative consequences of using drugs, showing a differential prediction of outcomes measured up to one year later. At three and six months, the former outperformed speech models, including approximately 50% of the variability in craving and 40% in abstinence duration. At 12 months from baseline, speech models were superior, predicting 50% of the variability in abstinence duration.MeaningSpeech variables derived through natural language processing can predict clinically meaningful drug use outcome measures in addiction, with greater value at longer intervals. The applicability of language modeling to aid in assessing treatment response and risk in drug addiction warrants further investigation in clinical settings.
Publisher
Cold Spring Harbor Laboratory