Abstract
AbstractThe design of binder proteins for specific target proteins using deep learning is a challenging task that has a wide range of applications in both designing therapeutic antibodies and creating new drugs. Machine learning-based solutions, as opposed to laboratory design, streamline the design process and enable the design of new proteins that may be required to address new and orphan diseases. Most techniques proposed in the literature necessitate either domain knowledge or some appraisal of the target protein’s 3-D structure. This paper proposes an approach for designing binder proteins based solely on the amino acid sequence of the target protein and without recourse to domain knowledge or structural information. The sequences of the binders are generated with two new transformers, namely the AppendFormer and MergeFormer architectures. Because, in general, there is more than one binder for a given target protein, these transformers employ a binding score and a prior on the sequence of the binder to obtain a unique targeted solution. Our experimental evaluation confirms the strengths of this novel approach. The performance of the models was determined with 5-fold cross-validation and clearly indicates that our architectures lead to highly accurate results. In addition, scores of up to 0.98 were achieved in terms of Needleman-Wunsch and Smith-Waterman similarity metrics, which indicates that our solutions significantly outperform a seq2seq baseline model.
Funder
Artificial Intelligence for Design Challenge program
Publisher
Springer Science and Business Media LLC
Subject
Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence
Reference94 articles.
1. Cooper GM, Hausman RE, Hausman RE (2007) The cell: a molecular approach, vol 4. ASM Press, Washington
2. Raman D, Sobolik-Delmaire T, Richmond A (2011) Chemokines in health and disease. Exp Cell Res 317(5):575–589. https://doi.org/10.1016/j.yexcr.2011.01.005
3. Zhu H et al (2001) Global analysis of protein activities using proteome chips. Science 293(5537):2101–2105. https://doi.org/10.1126/science.1062191
4. Pinto D et al (2020) Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature 583(7815):290–295. https://doi.org/10.1038/s41586-020-2349-y
5. Murray RK, Granner DK, Mayes PA, Rodwell VW (2006) Harper’s illustrated biochemistry (harper’s biochemistry), 27th edn. McGraw-Hill Medical