Hybrid Value-Aware Transformer Architecture for Joint Learning from Longitudinal and Non-Longitudinal Clinical Data-Reference-Cited by-同舟云学术

Hybrid Value-Aware Transformer Architecture for Joint Learning from Longitudinal and Non-Longitudinal Clinical Data

Published:2023-03-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Shao Yijun,Cheng Yan,Nelson Stuart J.,Kokkinos Peter,Zamrini Edward Y.,Ahmed Ali,Zeng-Treitler Qing

Abstract

AbstractTransformer is the latest deep neural network (DNN) architecture for sequence data learning that has revolutionized the field of natural language processing. This success has motivated researchers to explore its application in the healthcare domain. Despite the similarities between longitudinal clinical data and natural language data, clinical data presents unique complexities that make adapting Transformer to this domain challenging. To address this issue, we have designed a new Transformer-based DNN architecture, referred to as Hybrid Value-Aware Transformer (HVAT), which can jointly learn from longitudinal and non-longitudinal clinical data. HVAT is unique in the ability to learn from the numerical values associated with clinical codes/concepts such as labs, and also the use of a flexible longitudinal data representation called clinical tokens. We trained a prototype HVAT model on a case-control dataset, achieving high performance in predicting Alzheimer’s disease and related dementias as the patient outcome. The result demonstrates the potential of HVAT for broader clinical data learning tasks.

Publisher

Cold Spring Harbor Laboratory

Reference24 articles.

1. Vaswani A , Shazeer N , Parmar N , et al. Attention is all you need. Advances in Neural Information Processing Systems. 2017:5998–6008.

2. Devlin J , Change M-W , Lee K , Toutanova K. BERT: Pre-training of Deep Bidirectinal Transformers for Language Understanding. Proceedings of NAACL-HLT 2019. 2019:4171–4186.

3. Radford A , Narasimhan K , Salimans T , Sutskever I. Improving Language Understanding by Generative Pre-Training. OpenAI. 2018. https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf

4. Radford A , Wu J , Child R , Luan D , Amodei D , Sutskever I. Language Models are Unsupervised Multitask Learners. OpenAI. 2019. https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf

5. Brown TB , Mann B , Ryder N , et al. Language Models are Few-Shot Learners. arXiv preprint arXiv:200514165. 2020. https://arxiv.org/abs/2005.14165

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models in medicine;Nature Medicine;2023-07-17