Affiliation:
1. Brandeis University
2. McLean Hospital
3. Bay Area Clinical Associates
4. Harvard Medical School
Abstract
Abstract
Psychiatric electronic health records (EHRs) present a distinctive challenge in the domain of ML owing to their unstructured nature, with a high degree of complexity and variability. This study aimed to identify a cohort of patients with diagnoses of a psychotic disorder and posttraumatic stress disorder (PTSD), develop clinically-informed guidelines for annotating these health records for instances of traumatic events to create a gold standard publicly available dataset, and demonstrate that the data gathered using this annotation scheme is suitable for training a machine learning (ML) model to identify these indicators of trauma in unseen health records. We created a representative corpus of 101 EHRs (222,033 tokens) from a centralized database and a detailed annotation scheme for annotating information relevant to traumatic events in the clinical narratives. A team of clinical experts annotated the dataset and updated the annotation guidelines in collaboration with computational linguistic specialists. Inter-annotator agreement was high (0.688 for span tags, 0.589 for relations, and 0.874 for tag attributes). We characterize the major points relating to the annotation process of psychiatric EHRs. Additionally, high-performing baseline span labeling and relation extraction ML models were developed to demonstrate practical viability of the gold standard corpus for ML applications.
Publisher
Research Square Platform LLC
Reference34 articles.
1. Modelling the incidence and mortality of psychotic disorders: Data from the Second Australian National Survey of Psychosis;Saha S;Australian & New Zealand Journal of Psychiatry,2013
2. ‘earning and learning’ in those with psychotic disorders: The Second Australian National Survey of Psychosis;Waghorn G;Australian & New Zealand Journal of Psychiatry,2012
3. Riecher-Rössler, A. Size of burden of schizophrenia and psychotic disorders;Rössler W;European Neuropsychopharmacology,2005
4. Whiteford, H. A., Ferrari, A. J., Degenhardt, L., Feigin, V. & Vos, T. Global burden of mental, neurological, and Substance Use Disorders: An analysis from the global burden of disease study 2010. Disease Control Priorities, Third Edition (Volume 4): Mental, Neurological, and Substance Use Disorders 29–40 (2016). doi:10.1596/978-1-4648-0426-7_ch2
5. Hewlett, E. & Moran, V. Making mental health count: The social and economic costs of neglecting mental health care. (OECD, 2014).