The structural extraction of Chinese medical narratives-Reference-Cited by-同舟云学术

The structural extraction of Chinese medical narratives

Published:2018-07-30 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhang Rongzhi^ORCID,Zhang Haifei,Yao Zhiyu,Huang Zhengxing

Abstract

AbstractMedical narratives document a vast amount of clinical data. This data has a valuable secondary purpose, as it may be used to optimize health service delivery and improve the quality of medical care. However, medical narratives are typically recorded in an unstructured manner, which complicates the process of extracting the structured information required for optimization. In this paper, we address this problem by applying and comparing two models, a rule-based model and a model based on conditional random fields (CRFs), to a data set of Chinese medical narratives. Among 4626 manually annotated Chinese medical narratives, collected from Shanxi Dayi Hospital in China, the rule-based model achieved 95.87% precision, 69.82% recall, and an F-score of 80.80%, and the CRF-based model realized 95.99% precision, 65.11% recall, and a 77.59% F-score. These experimental results demonstrate the efficacy of both proposed models for structural extraction from Chinese medical narratives.

Publisher

Cold Spring Harbor Laboratory

Reference35 articles.

1. Chapman WW , Nadkarni PM , Hirschman L , D’avolio LW , Savova GK , Uzuner O. Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions. BMJ Group BMA House, Tavistock Square, London, WC1H 9JR; 2011.

2. Pestian JP , Brew C , Matykiewicz P , Hovermale DJ , Johnson N , Cohen KB , et al., editors. A shared task involving multi-label classification of clinical free text. Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing; 2007: Association for Computational Linguistics.

3. Recognizing Obesity and Comorbidities in Sparse Data

4. Evaluating the State-of-the-Art in Automatic De-identification

5. Extracting medication information from clinical text