Abstract
SummaryAdeno-associated virus (AAV) capsids have shown clinical promise as delivery vectors for gene therapy. However, the high prevalence of pre-existing immunity against natural capsids poses a challenge for widespread treatment. The generation of diverse capsids that are potentially more capable of immune evasion is challenging because introducing multiple mutations often breaks capsid assembly. Here we target a representative, immunologically relevant 28-amino-acid segment of the AAV2 capsid and show that a low-complexity Variational Auto-encoder (VAE) can interpolate in sequence space to produce diverse and novel capsids capable of packaging their own genomes. We first train the VAE on a 564-sample Multiple-Sequence Alignment (MSA) of dependo-parvoviruses, and then further augment this dataset by adding 22,704 samples from a deep mutational exploration (DME) on the target region. In both cases the VAE generated viable variants with many mutations, which we validated experimentally. We propose that this simple approach can be used to optimize and diversify other proteins, as well as other capsid traits of interest for gene delivery.
Publisher
Cold Spring Harbor Laboratory
Reference39 articles.
1. Drawing a high-resolution functional map of adeno-associated virus capsid by massively parallel sequencing
2. Unified rational protein engineering with sequence-based deep representation learning;Nat. Methods,2019
3. End-to-End Differentiable Learning of Protein Structure;Cell Syst,2019
4. Brookes, D.H. , and Listgarten, J. (2018). Design by adaptive sampling.
5. Brookes, D. , Park, H. , and Listgarten, J. (2019). Conditioning by adaptive sampling for robust design. In Proceedings of the 36th International Conference on Machine Learning, K. Chaudhuri , and R. Salakhutdinov , eds. (PMLR), pp. 773–782.
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献