Abstract
AbstractThere have recently been rapid advances in deep learning models for protein design. To demonstrate proof-of-concept, these advancements have focused on small proteins with lots of data for training. This means that they are often not suitable for generating proteins with the most potential for high clinical impact –due to the additional challenges of sparse data and large size many therapeutically relevant proteins have. One major application that fits this category is gene therapy delivery. Viral vectors such as Adenoviruses and AAVs are a common delivery vehicle for gene therapy. However, environmental exposure means that most people exhibit potent pre-existing immune responses to many serotypes. This response, primarily driven by neutralizing antibodies, also precludes repeated administration with the same serotype. Rare serotypes, serotypes targeting other species, and capsid engineering, have all been deployed in the service of reducing neutralization by pre-existing antibodies. However, progress has been very limited using conventional methods and a new approach is urgently needed. To address this, we developed a variational autoencoder that can generate synthetic viral vector serotypes without epitopes for pre-existing neutralizing antibodies. A compact generative computational model was constructed, with only 12.4 million parameters that could be efficiently trained on the limited natural sequences (e.g., 711 natural Adenovirus hexon sequences with average length of 938 amino acids). In contrast to the current state-of-the-art, the model was able to generate high-quality Adenovirus hexon sequences that were folded with high confidence by Alphafold2 to produce structures essentially identical to natural hexon structures. Molecular dynamics simulations confirmed that the structures are stable and protein–protein interfaces are intact. Local secondary structure and local mobility is also comparable with natural serotype behavior. Our model could be used to generate a broad range of synthetic adenovirus serotype sequences without epitopes for pre-existing neutralizing antibodies in the human population. It could be used more broadly to generate different types of viral vector, and any large, therapeutically valuable proteins, where available data is sparse.
Publisher
Cold Spring Harbor Laboratory
Reference66 articles.
1. Principal component analysis
2. Viral adaptation to host: a proteome‐based analysis of codon usage and amino acid preferences
3. Adenovirus-mediated wild-typep53 tumor suppressor gene therapy induces apoptosis and suppresses growth of human pancreatic cancer
4. Bowman, S.R. , Vilnis, L. , Vinyals, O. , Dai, A.M. , Jozefowicz, R. , Bengio, S. , 2015. Generating sentences from a continuous space. ArXiv Prepr. ArXiv151106349.
5. Viral vector platforms within the gene therapy landscape;Signal Transduct. Target. Ther,2021
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献