Patient contrastive learning: A performant, expressive, and practical approach to electrocardiogram modeling-Reference-Cited by-同舟云学术

Patient contrastive learning: A performant, expressive, and practical approach to electrocardiogram modeling

Published:2022-02-14 Issue:2 Volume:18 Page:e1009862
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Diamant Nathaniel^ORCID,Reinertsen Erik^ORCID,Song Steven^ORCID,Aguirre Aaron D.^ORCID,Stultz Collin M.,Batra Puneet^ORCID

Abstract

Supervised machine learning applications in health care are often limited due to a scarcity of labeled training data. To mitigate the effect of small sample size, we introduce a pre-training approach,PatientContrastiveLearning ofRepresentations (PCLR), which creates latent representations of electrocardiograms (ECGs) from a large number of unlabeled examples using contrastive learning. The resulting representations are expressive, performant, and practical across a wide spectrum of clinical tasks. We develop PCLR using a large health care system with over 3.2 million 12-lead ECGs and demonstrate that training linear models on PCLR representations achieves a 51% performance increase, on average, over six training set sizes and four tasks (sex classification, age regression, and the detection of left ventricular hypertrophy and atrial fibrillation), relative to training neural network models from scratch. We also compared PCLR to three other ECG pre-training approaches (supervised pre-training, unsupervised pre-training with an autoencoder, and pre-training using a contrastive multi ECG-segment approach), and show significant performance benefits in three out of four tasks. We found an average performance benefit of 47% over the other models and an average of a 9% performance benefit compared to best model for each task. We release PCLR to enable others to extract ECG representations athttps://github.com/broadinstitute/ml4h/tree/master/model_zoo/PCLR.

Funder

ibm

bayer ag

quanta computing

Controlled Risk Insurance Company/Risk Management Foundation

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference34 articles.

1. Sudden Cardiac Death in the United States, 1989 to 1998;ZJ Zheng;Circulation,2001

2. Practical Issues in Building Risk-Predicting Models for Complex Diseases;J Kang;Journal of Biopharmaceutical Statistics,2010

3. Why Does Unsupervised Pre-training Help Deep Learning?;D Erhan;Journal of Machine Learning Research,2010

4. Erhan D, Manzagol PA, Bengio Y, Bengio S, Vincent P. The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training. In: Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics. PMLR; 2009. p. 153–160.

5. Azizi S, Mustafa B, Ryan F, Beaver Z, Freyberg J, Deaton J, et al. Big Self-Supervised Models Advance Medical Image Classification. arXiv:210105224 [cs, eess]. 2021;.

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A self-supervised framework for computer-aided arrhythmia diagnosis;Applied Soft Computing;2024-10

2. A Generalisable Heartbeat Classifier Leveraging Self-Supervised Learning for ECG Analysis During Magnetic Resonance Imaging;IEEE Journal of Biomedical and Health Informatics;2024-09

3. Genetic underpinnings of predicted changes in cardiovascular function using self supervised learning;2024-08-21

4. Estimating ECG Intervals from Lead-I Alone: External Validation of Supervised Models;2024-08-13

5. Transferring Knowledge from 12-Lead to 1-Lead ECGs via Contrastive Learning;2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC);2024-07-02