<i>EvidenceMap</i>: a three-level knowledge representation for medical evidence computation and comprehension-Reference-Cited by-同舟云学术

EvidenceMap: a three-level knowledge representation for medical evidence computation and comprehension

Published:2023-03-15 Issue:6 Volume:30 Page:1022-1031
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Kang Tian¹,Sun Yingcheng¹^ORCID,Kim Jae Hyun¹,Ta Casey¹^ORCID,Perotte Adler¹^ORCID,Schiffer Kayla¹,Wu Mutong²,Zhao Yang²,Moustafa-Fahmy Nour²,Peng Yifan³^ORCID,Weng Chunhua¹

Affiliation:

1. Department of Biomedical Informatics, Columbia University , New York, New York, USA

2. Department of Statistics, Columbia University , New York, New York, USA

3. Department of Population Health Sciences, Weill Cornell Medicine , New York, New York, USA

Abstract

Abstract Objective To develop a computable representation for medical evidence and to contribute a gold standard dataset of annotated randomized controlled trial (RCT) abstracts, along with a natural language processing (NLP) pipeline for transforming free-text RCT evidence in PubMed into the structured representation. Materials and methods Our representation, EvidenceMap, consists of 3 levels of abstraction: Medical Evidence Entity, Proposition and Map, to represent the hierarchical structure of medical evidence composition. Randomly selected RCT abstracts were annotated following EvidenceMap based on the consensus of 2 independent annotators to train an NLP pipeline. Via a user study, we measured how the EvidenceMap improved evidence comprehension and analyzed its representative capacity by comparing the evidence annotation with EvidenceMap representation and without following any specific guidelines. Results Two corpora including 229 disease-agnostic and 80 COVID-19 RCT abstracts were annotated, yielding 12 725 entities and 1602 propositions. EvidenceMap saves users 51.9% of the time compared to reading raw-text abstracts. Most evidence elements identified during the freeform annotation were successfully represented by EvidenceMap, and users gave the enrollment, study design, and study Results sections mean 5-scale Likert ratings of 4.85, 4.70, and 4.20, respectively. The end-to-end evaluations of the pipeline show that the evidence proposition formulation achieves F1 scores of 0.84 and 0.86 in the adjusted random index score. Conclusions EvidenceMap extends the participant, intervention, comparator, and outcome framework into 3 levels of abstraction for transforming free-text evidence from the clinical literature into a computable structure. It can be used as an interoperable format for better evidence retrieval and synthesis and an interpretable representation to efficiently comprehend RCT findings.

Funder

Bridging the semantic gap between research eligibility criteria and clinical data

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

https://academic.oup.com/jamia/article-pdf/30/6/1022/50374770/ocad036.pdf

Reference38 articles.

1. The levels of evidence and their role in evidence-based medicine;Burns;Plast Reconstr Surg,2011

2. Utilization of the PICO framework to improve searching PubMed for clinical questions;Schardt;BMC Med Inform Decis Mak,2007

3. Beyond genes, proteins, and abstracts: identifying scientific claims from full-text biomedical articles;Blake;J Biomed Inform,2010