Context-Aware Amino Acid Embedding Advances Analysis of TCR-Epitope Interactions-Reference-Cited by-同舟云学术

Context-Aware Amino Acid Embedding Advances Analysis of TCR-Epitope Interactions

Published:2023-04-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhang Pengfei^ORCID,Bang Seojin,Cai Michael,Lee Heewook^ORCID

Abstract

AbstractAccurate prediction of binding interaction between T cell receptors (TCRs) and host cells is fundamental to understanding the regulation of the adaptive immune system as well as to developing data-driven approaches for personalized immunotherapy. While several machine learning models have been developed for this prediction task, the question of how to specifically embed TCR sequences into numeric representations remains largely unexplored compared to protein sequences in general. Here, we investigate whether the embedding models designed for protein sequences, and the most widely used BLOSUM-based embedding techniques are suitable for TCR analysis. Additionally, we present our context-aware amino acid embedding models (catELMo) designed explicitly for TCR analysis and trained on 4M unlabeled TCR sequences with no supervision. We validate the effectiveness ofcatELMoin both supervised and unsupervised scenarios by stacking the simplest models on top of our learned embeddings. For the supervised task, we choose the binding affinity prediction problem of TCR and epitope sequences and demonstrate notably significant performance gains (up by at least 14% AUC) compared to existing embedding models as well as the state-of-the-art methods. Additionally, we also show that our learned embeddings reduce more than 93% annotation cost while achieving comparable results to the state-of-the-art methods. In TCR clustering task (unsupervised),catELMoidentifies TCR clusters that are more homogeneous and complete about their binding epitopes. Altogether, ourcatELMotrained without any explicit supervision interprets TCR sequences better and negates the need for complex deep neural network architectures.

Publisher

Cold Spring Harbor Laboratory

Reference58 articles.

1. The T cell antigen receptor: the Swiss army knife of the immune system

2. T-cell antigen receptor genes and T-cell recognition

3. How T cells 'see' antigen

4. Use of T cell epitopes for vaccine development;Current drug targets-Infectious disorders,2001

5. T-cell-receptor gene therapy

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Quantitative approaches for decoding the specificity of the human T cell repertoire;Frontiers in Immunology;2023-09-07

2. Antigen‐specific and cross‐reactive T cells in protection and disease;Immunological Reviews;2023-05-20