Clinically relevant pretraining is all you need-Reference-Cited by-同舟云学术

Clinically relevant pretraining is all you need

Published:2021-06-21 Issue:9 Volume:28 Page:1970-1976
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Bear Don’t Walk IV Oliver J¹,Sun Tony¹,Perotte Adler¹^ORCID,Elhadad Noémie¹

Affiliation:

1. Department of Biomedical Informatics, Columbia University, New York, New York, USA

Abstract

Abstract Clinical notes present a wealth of information for applications in the clinical domain, but heterogeneity across clinical institutions and settings presents challenges for their processing. The clinical natural language processing field has made strides in overcoming domain heterogeneity, while pretrained deep learning models present opportunities to transfer knowledge from one task to another. Pretrained models have performed well when transferred to new tasks; however, it is not well understood if these models generalize across differences in institutions and settings within the clinical domain. We explore if institution or setting specific pretraining is necessary for pretrained models to perform well when transferred to new tasks. We find no significant performance difference between models pretrained across institutions and settings, indicating that clinically pretrained models transfer well across such boundaries. Given a clinically pretrained model, clinical natural language processing researchers may forgo the time-consuming pretraining step without a significant performance drop.

Funder

National Library of Medicine

National Institute of General Medical Sciences

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/28/9/1970/39731673/ocab086.pdf

Reference34 articles.

1. Automated data capture from free-text radiology reports to enhance accuracy of hospital inpatient stroke codes;Flynn;Pharmacoepidemiol Drug Saf,2010

2. A text mining approach to the prediction of disease status from clinical discharge summaries;Yang;J Am Med Inform Assoc,2009