Pretrained transformer framework on pediatric claims data for population specific tasks-Reference-Cited by-同舟云学术

Pretrained transformer framework on pediatric claims data for population specific tasks

Published:2022-03-07 Issue:1 Volume:12 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Zeng Xianlong,Linwood Simon L.,Liu Chang

Abstract

AbstractThe adoption of electronic health records (EHR) has become universal during the past decade, which has afforded in-depth data-based research. By learning from the large amount of healthcare data, various data-driven models have been built to predict future events for different medical tasks, such as auto diagnosis and heart-attack prediction. Although EHR is abundant, the population that satisfies specific criteria for learning population-specific tasks is scarce, making it challenging to train data-hungry deep learning models. This study presents the Claim Pre-Training (Claim-PT) framework, a generic pre-training model that first trains on the entire pediatric claims dataset, followed by a discriminative fine-tuning on each population-specific task. The semantic meaning of medical events can be captured in the pre-training stage, and the effective knowledge transfer is completed through the task-aware fine-tuning stage. The fine-tuning process requires minimal parameter modification without changing the model architecture, which mitigates the data scarcity issue and helps train the deep learning model adequately on small patient cohorts. We conducted experiments on a real-world pediatric dataset with more than one million patient records. Experimental results on two downstream tasks demonstrated the effectiveness of our method: our general task-agnostic pre-training framework outperformed tailored task-specific models, achieving more than 10% higher in model performance as compared to baselines. In addition, our framework showed a potential to transfer learned knowledge from one institution to another, which may pave the way for future healthcare model pre-training across institutions.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-07545-1.pdf

Reference38 articles.

1. Choi, E., Schuetz, A., Stewart, W. F. & Sun, J. Using recurrent neural network models for early detection of heart failure onset. J. Am. Med. Inform. Assoc. 24, 361–370 (2017).

2. Landi, I. et al. Deep representation learning of electronic health records to unlock patient stratification at scale. NPJ Digit. Med. 3, 1–11 (2020).

3. Zeng, X. et al. Multilevel self-attention model and its use on medical risk prediction. In Pacific Symposium on Biocomputing 2020, 115–126 (World Scientific, 2019).

4. Sun, C., Shrivastava, A., Singh, S. & Gupta, A. Revisiting unreasonable effectiveness of data in deep learning era. In Proceedings of the IEEE International Conference on Computer Vision, 843–852 (2017).

5. Hedderich, M. A. & Klakow, D. Training a neural network in a low-resource setting on automatically annotated noisy data. arXiv:1807.00745 (arXiv preprint) (2018).

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Early detection of pediatric health risks using maternal and child health data;Scientific Reports;2024-07-04

2. The Integration of Dual Evaluation and Minimum Spanning Tree Clustering to Support Decision-Making in Territorial Spatial Planning;Sustainability;2024-05-08

3. Transformers in health: a systematic review on architectures for longitudinal data analysis;Artificial Intelligence Review;2024-02-03

4. Machine and deep learning for longitudinal biomedical data: a review of methods and applications;Artificial Intelligence Review;2023-08-05

5. The shaky foundations of large language models and foundation models for electronic health records;npj Digital Medicine;2023-07-29