Abstract
AbstractAntigen presentation by Major Histocompatibility Complex (MHC) is a cornerstone of the cellular immune system, and characterization of the rules defining which peptides are suitable for MHC presentation is of key interest for understanding T cell immunogenicity and the development of T cell driven therapeutics. The collective set of peptides presented on a cell surface by MHC molecules is known as the immunopeptidome. Due to the vast MHC polymorphism, this peptidome is unique between individuals. Current state-of-the-art liquid chromatography mass spectrometry (LC-MS) technologies allow the identification of large peptidomes specific for a given host or cell line, and numerous studies have proven this data to be a rich source of information for learning the rules of MHC-mediated antigen presentation. The data obtained with MS techniques is usually poly-specific – i.e. it contains multiple sequence motifs matching the different MHC molecules expressed in the system under investigation. Deconvolution of this poly-specificity has proven a challenge, and imposes a substantial limitation for learning MHC specific rules for antigen presentation from immunopeptidome data. This since each ligand first must be unambiguously associated to its presenting MHC molecule(s) within the haplotype of the cell line.Here, we describe NNAlign_MA, a method that is able to address this challenge. NNAlign_MA is capable of taking a mixed training set composed of single-allele (SA) data (peptides assigned to single MHCs) and multi-allele (MA) data (peptides with multiple options for MHCs assignments) as input, and in a fully automated manner deconvolute the individual MHC restriction of all MA sequences while learning the binding specificities of all the MHCs present in the training set.The NNAlign_MA method was benchmarked on a large and diverse dataset, covering human MHC class I and class II, and bovine class I (BoLA) data. For all cases, the method was demonstrated to have unprecedented performance compared to state-of-the-art methods, achieving a complete deconvolution of binding motifs contained within poly-specificity MS eluted ligand data and an improved predictive performance for identification of both eluted ligands and T cell epitopes.Given its very high flexibility and ease of use, we expect NNAlign_MA to serve as an effective tool to increase our understanding of the rules for MHC antigen presentation and guide the development of novel T cell-based therapeutics.
Publisher
Cold Spring Harbor Laboratory
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献