RESUMEN
MHC peptide binding and presentation is the most selective event defining the landscape of T cell epitopes. Consequently, understanding the diversity of MHC alleles in a given population and the parameters that define the set of ligands that can be bound and presented by each of these alleles (the immunopeptidome) has an enormous impact on our capacity to predict and manipulate the potential of protein Ags to elicit functional T cell responses. Liquid chromatography-mass spectrometry analysis of MHC-eluted ligand data has proven to be a powerful technique for identifying such peptidomes, and methods integrating such data for prediction of Ag presentation have reached a high level of accuracy for both MHC class I and class II. In this study, we demonstrate how these techniques and prediction methods can be readily extended to the bovine leukocyte Ag class II DR locus (BoLA-DR). BoLA-DR binding motifs were characterized by eluted ligand data derived from bovine cell lines expressing a range of DRB3 alleles prevalent in Holstein-Friesian populations. The model generated (NetBoLAIIpan, available as a Web server at www.cbs.dtu.dk/services/NetBoLAIIpan) was shown to have unprecedented predictive power to identify known BoLA-DR-restricted CD4 epitopes. In summary, the results demonstrate the power of an integrated approach combining advanced mass spectrometry peptidomics with immunoinformatics for characterization of the BoLA-DR Ag presentation system and provide a prediction tool that can be used to assist in rational evaluation and selection of bovine CD4 T cell epitopes.
Asunto(s)
Presentación de Antígeno , Linfocitos T CD4-Positivos/inmunología , Biología Computacional/métodos , Epítopos de Linfocito T/inmunología , Antígenos de Histocompatibilidad Clase II/inmunología , Péptidos/inmunología , Alelos , Animales , Secuencia de Bases , Linfocitos T CD4-Positivos/parasitología , Bovinos , Células Cultivadas , Simulación por Computador , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Antígenos de Histocompatibilidad Clase II/genética , Ligandos , Espectrometría de Masas/métodos , Unión Proteica , Theileria annulata , Theileria parva , Theileriosis/inmunología , Theileriosis/parasitologíaRESUMEN
The set of peptides presented on a cell's surface by MHC molecules is known as the immunopeptidome. Current mass spectrometry technologies allow for identification of large peptidomes, and studies have proven these data to be a rich source of information for learning the rules of MHC-mediated antigen presentation. Immunopeptidomes are usually poly-specific, containing multiple sequence motifs matching the MHC molecules expressed in the system under investigation. Motif deconvolution -the process of associating each ligand to its presenting MHC molecule(s)- is therefore a critical and challenging step in the analysis of MS-eluted MHC ligand data. Here, we describe NNAlign_MA, a computational method designed to address this challenge and fully benefit from large, poly-specific data sets of MS-eluted ligands. NNAlign_MA simultaneously performs the tasks of (1) clustering peptides into individual specificities; (2) automatic annotation of each cluster to an MHC molecule; and (3) training of a prediction model covering all MHCs present in the training set. NNAlign_MA was benchmarked on large and diverse data sets, covering class I and class II data. In all cases, the method was demonstrated to outperform state-of-the-art methods, effectively expanding the coverage of alleles for which accurate predictions can be made, resulting in improved identification of both eluted ligands and T-cell epitopes. Given its high flexibility and ease of use, we expect NNAlign_MA to serve as an effective tool to increase our understanding of the rules of MHC antigen presentation and guide the development of novel T-cell-based therapeutics.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Epítopos de Linfocito T/metabolismo , Antígenos de Histocompatibilidad Clase II/metabolismo , Antígenos de Histocompatibilidad Clase I/metabolismo , Secuencias de Aminoácidos , Animales , Benchmarking , Bovinos , Línea Celular , Bases de Datos de Proteínas , Conjuntos de Datos como Asunto , Humanos , Ligandos , Aprendizaje Automático , Espectrometría de Masas , Péptidos/metabolismo , Unión ProteicaRESUMEN
BACKGROUND: Major histocompatibility complex class II (MHC-II) molecules present peptide fragments to T cells for immune recognition. Current predictors for peptide to MHC-II binding are trained on binding affinity data, generated in vitro and therefore lacking information about antigen processing. METHODS: We generate prediction models of peptide to MHC-II binding trained with naturally eluted ligands derived from mass spectrometry in addition to peptide binding affinity data sets. RESULTS: We show that integrated prediction models incorporate identifiable rules of antigen processing. In fact, we observed detectable signals of protease cleavage at defined positions of the ligands. We also hypothesize a role of the length of the terminal ligand protrusions for trimming the peptide to the MHC presented ligand. CONCLUSIONS: The results of integrating binding affinity and eluted ligand data in a combined model demonstrate improved performance for the prediction of MHC-II ligands and T cell epitopes and foreshadow a new generation of improved peptide to MHC-II prediction tools accounting for the plurality of factors that determine natural presentation of antigens.
Asunto(s)
Antígenos HLA-DR/metabolismo , Antígenos de Histocompatibilidad Clase I/metabolismo , Modelos Teóricos , Péptidos/metabolismo , Animales , Presentación de Antígeno , Línea Celular , Humanos , Ligandos , RatonesRESUMEN
A key event in the generation of a cellular response against malicious organisms through the endocytic pathway is binding of peptidic antigens by major histocompatibility complex class II (MHC class II) molecules. The bound peptide is then presented on the cell surface where it can be recognized by T helper lymphocytes. NetMHCIIpan is a state-of-the-art method for the quantitative prediction of peptide binding to any human or mouse MHC class II molecule of known sequence. In this paper, we describe an updated version of the method with improved peptide binding register identification. Binding register prediction is concerned with determining the minimal core region of nine residues directly in contact with the MHC binding cleft, a crucial piece of information both for the identification and design of CD4(+) T cell antigens. When applied to a set of 51 crystal structures of peptide-MHC complexes with known binding registers, the new method NetMHCIIpan-3.1 significantly outperformed the earlier 3.0 version. We illustrate the impact of accurate binding core identification for the interpretation of T cell cross-reactivity using tetramer double staining with a CMV epitope and its variants mapped to the epitope binding core. NetMHCIIpan is publicly available at http://www.cbs.dtu.dk/services/NetMHCIIpan-3.1 .
Asunto(s)
Biología Computacional/métodos , Antígenos de Histocompatibilidad Clase II/inmunología , Antígenos de Histocompatibilidad Clase II/metabolismo , Redes Neurales de la Computación , Fragmentos de Péptidos/química , Fragmentos de Péptidos/metabolismo , Algoritmos , Secuencia de Aminoácidos , Animales , Sitios de Unión , Análisis por Conglomerados , Bases de Datos de Proteínas , Epítopos/inmunología , Antígenos de Histocompatibilidad Clase II/química , Humanos , Ratones , Modelos Moleculares , Datos de Secuencia Molecular , Unión Proteica , Homología de Secuencia de AminoácidoRESUMEN
The strongest genetic influence on immune control in HIV-1 infection is the HLA class I genotype. Rapid disease progression in B-clade infection has been linked to HLA-B*35 expression, in particular to the less common HLA-B*3502 and HLA-B*3503 subtypes but also to the most prevalent subtype, HLA-B*3501. In these studies we first demonstrated that whereas HLA-B*3501 is associated with a high viral set point in two further B-clade-infected cohorts, in Japan and Mexico, this association does not hold in two large C-clade-infected African cohorts. We tested the hypothesis that clade-specific differences in HLA associations with disease outcomes may be related to distinct targeting of critical CD8(+) T-cell epitopes. We observed that only one epitope was significantly targeted differentially, namely, the Gag-specific epitope NPPIPVGDIY (NY10, Gag positions 253 to 262) (P = 2 × 10(-5)). In common with two other HLA-B*3501-restricted epitopes, in Gag and Nef, that were not targeted differentially, a response toward NY10 was associated with a significantly lower viral set point. Nonimmunogenicity of NY10 in B-clade-infected subjects derives from the Gag-D260E polymorphism present in â¼90% of B-clade sequences, which critically reduces recognition of the Gag NY10 epitope. These data suggest that in spite of any inherent HLA-linked T-cell receptor repertoire differences that may exist, maximizing the breadth of the Gag-specific CD8(+) T-cell response, by the addition of even a single epitope, may be of overriding importance in achieving immune control of HIV infection. This distinction is of direct relevance to development of vaccines designed to optimize the anti-HIV CD8(+) T-cell response in all individuals, irrespective of HLA type.