Affiliation:
1. University of Washington, Seattle, United States
2. Fred Hutchinson Cancer Research Center, Seattle, United States
Abstract
Probabilistic models of adaptive immune repertoire sequence distributions can be used to infer the expansion of immune cells in response to stimulus, differentiate genetic from environmental factors that determine repertoire sharing, and evaluate the suitability of various target immune sequences for stimulation via vaccination. Classically, these models are defined in terms of a probabilistic V(D)J recombination model which is sometimes combined with a selection model. In this paper we take a different approach, fitting variational autoencoder (VAE) models parameterized by deep neural networks to T cell receptor (TCR) repertoires. We show that simple VAE models can perform accurate cohort frequency estimation, learn the rules of VDJ recombination, and generalize well to unseen sequences. Further, we demonstrate that VAE-like models can distinguish between real sequences and sequences generated according to a recombination-selection model, and that many characteristics of VAE-generated sequences are similar to those of real sequences.
Funder
National Institutes of Health
Howard Hughes Medical Institute
Publisher
eLife Sciences Publications, Ltd
Subject
General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience
Reference44 articles.
1. TensorFlow: Large-scale machine learning on heterogeneous systems;Abadi,2015
2. Generalization and equilibrium in generative adversarial nets (GANs);Arora,2017
3. Do GANs actually learn the distribution? An empirical study;Arora,2017
4. The mechanism and regulation of chromosomal V(D)J recombination;Bassing;Cell,2002
5. Toward machine-guided design of proteins;Biswas,2018
Cited by
63 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献