MT-clinical BERT: scaling clinical information extraction with multitask learning-Reference-Cited by-同舟云学术

MT-clinical BERT: scaling clinical information extraction with multitask learning

Published:2021-08-01 Issue:10 Volume:28 Page:2108-2115
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Mulyar Andriy¹,Uzuner Ozlem²^ORCID,McInnes Bridget²

Affiliation:

1. Computer Science Department, Virginia Commonwealth University, Richmond, Virginia, USA

2. Information Sciences and Technology, George Mason University, Fairfax, Virginia, USA

Abstract

Abstract Objective Clinical notes contain an abundance of important, but not-readily accessible, information about patients. Systems that automatically extract this information rely on large amounts of training data of which there exists limited resources to create. Furthermore, they are developed disjointly, meaning that no information can be shared among task-specific systems. This bottleneck unnecessarily complicates practical application, reduces the performance capabilities of each individual solution, and associates the engineering debt of managing multiple information extraction systems. Materials and Methods We address these challenges by developing Multitask-Clinical BERT: a single deep learning model that simultaneously performs 8 clinical tasks spanning entity extraction, personal health information identification, language entailment, and similarity by sharing representations among tasks. Results We compare the performance of our multitasking information extraction system to state-of-the-art BERT sequential fine-tuning baselines. We observe a slight but consistent performance degradation in MT-Clinical BERT relative to sequential fine-tuning. Discussion These results intuitively suggest that learning a general clinical text representation capable of supporting multiple tasks has the downside of losing the ability to exploit dataset or clinical note-specific properties when compared to a single, task-specific model. Conclusions We find our single system performs competitively with all state-the-art task-specific systems while also benefiting from massive computational benefits at inference.

Funder

National Library of Medicine

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/28/10/2108/40408880/ocab126.pdf

Reference33 articles.

1. 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text;Uzuner;J Am Med Inform Assoc,2011

2. Evaluating temporal relations in clinical text: 2012 i2b2 challenge;Sun;J Am Med Inform Assoc,2013

3. 2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records;Henry;J Am Med Inform Assoc,2020