GatorTron: A Large Language Model for Clinical Natural Language Processing-Reference-Cited by-同舟云学术

GatorTron: A Large Language Model for Clinical Natural Language Processing

Published:2022-02-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Yang Xi,PourNejatian Nima,Shin Hoo Chang,Smith Kaleb E,Parisien Christopher,Compas Colin,Martin Cheryl,Flores Mona G,Zhang Ying,Magoc Tanja,Harle Christopher A,Lipori Gloria,Mitchell Duane A,Hogan William R,Shenkman Elizabeth A,Bian Jiang^ORCID,Wu Yonghui

Abstract

ABSTRACTObjectiveTo develop a large pretrained clinical language model from scratch using transformer architecture; systematically examine how transformer models of different sizes could help 5 clinical natural language processing (NLP) tasks at different linguistic levels.MethodsWe created a large corpus with >90 billion words from clinical narratives (>82 billion words), scientific literature (6 billion words), and general English text (2.5 billion words). We developed GatorTron models from scratch using the BERT architecture of different sizes including 345 million, 3.9 billion, and 8.9 billion parameters, compared GatorTron with three existing transformer models in the clinical and biomedical domain on 5 different clinical NLP tasks including clinical concept extraction, relation extraction, semantic textual similarity, natural language inference, and medical question answering, to examine how large transformer models could help clinical NLP at different linguistic levels.Results and ConclusionGatorTron scaled up transformer-based clinical language models to a size of 8.9 billion parameters and achieved state-of-the-art performance on 5 clinical NLP tasks of different linguistic levels targeting various healthcare information documented in unstructured electronic health records (EHRs). The proposed GatorTron models performed remarkably better in much complex clinical NLP tasks such as natural language inference (9.6% and 7.5% improvements) and question answering (9.5% and 7.77% improvements) compared with existing smaller clinical transformer models (i.e., BioBERT and ClinicalBERT), demonstrating the potential of large transformer-based clinical models for advanced medical artificial intelligent (AI) applications such as question answering.

Publisher

Cold Spring Harbor Laboratory

Reference76 articles.

1. Adoption of Electronic Health Record Systems among U.S. Non-Federal Acute Care Hospitals: 2008-2015. /evaluations/data-briefs/non-federal-acute-care-hospital-ehr-adoption-2008-2015.php (accessed 20 Dec 2019).

2. Electronic health record adoption in US hospitals: the emergence of a digital “advanced use” divide;J Am Med Inform Assoc,2017

3. Meystre SM , Savova GK , Kipper-Schuler KC , et al. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform 2008;:128–44.

4. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence

5. Assessing the Prognostic Significance of Tumor-Infiltrating Lymphocytes in Patients With Melanoma Using Pathologic Features Identified by Natural Language Processing;JAMA Network Open,2021

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An analysis of large language models: their impact and potential applications;Knowledge and Information Systems;2024-05-11

2. The shaky foundations of large language models and foundation models for electronic health records;npj Digital Medicine;2023-07-29

3. Information extraction from weakly structured radiological reports with natural language queries;European Radiology;2023-07-28

4. BCDRRLE: A Bidirectional Cross-Dynamic Round Robin Learning Encoder Model for Medical Sentence Similarity (Preprint);2022-03-20