RESUMEN
BACKGROUND: IgE is the least abundant immunoglobulin and tightly regulated, and IgE-producing B cells are rare. The cellular origin and evolution of IgE responses are poorly understood. OBJECTIVE: The cellular and clonal origin of IgE memory responses following mucosal allergen exposure by sublingual immunotherapy (SLIT) were investigated. METHODS: In a randomized double-blind, placebo-controlled, time course SLIT study, PBMCs and nasal biopsy samples were collected from 40 adults with seasonal allergic rhinitis at baseline and at 4, 8, 16, 28, and 52 weeks. RNA was extracted from PBMCs, sorted B cells, and nasal biopsy samples for heavy chain variable gene repertoire sequencing. Moreover, mAbs were derived from single B-cell transcriptomes. RESULTS: Combining heavy chain variable gene repertoire sequencing and single-cell transcriptomics yielded direct evidence of a parallel boost of 2 clonally and functionally related B-cell subsets of short-lived IgE+ plasmablasts and IgG+ memory B cells. Mucosal grass pollen allergen exposure by SLIT resulted in highly diverse IgE and IgGE repertoires. These were extensively mutated and appeared relatively stable as per heavy chain isotype, somatic hypermutations, and clonal composition. Single IgGE+ memory B-cell and IgE+ preplasmablast transcriptomes encoded antibodies that were specific for major grass pollen allergens and able to elicit basophil activation at very low allergen concentrations. CONCLUSION: For the first time, we have shown that on mucosal allergen exposure, human IgE memory resides in allergen-specific IgG+ memory B cells. These cells rapidly switch isotype, expand into short-lived IgE+ plasmablasts, and serve as a potential target for therapeutic intervention.
Asunto(s)
Alérgenos/inmunología , Linfocitos B/inmunología , Inmunoglobulina E/inmunología , Memoria Inmunológica , Polen/inmunología , Rinitis Alérgica Estacional/inmunología , Adulto , Linfocitos B/patología , Método Doble Ciego , Femenino , Humanos , Masculino , Rinitis Alérgica Estacional/patologíaRESUMEN
BACKGROUND: Ragweed frequently causes seasonal allergies in North America and Europe. In the United States, several related ragweed species with diverse geographical distribution cause allergic symptoms. Cross-reactivity towards related ragweed species of IgE and treatment-induced IgG4 has been demonstrated previously. However, less is known about the underlying T-cell cross-reactivity. METHODS: The allergen content of ragweed extracts was determined by mass spectrometry and related to T-cell epitopes of Amb a allergens (group 1, 3, 4, 5, 8 and 11) in 20 American ragweed allergic patients determined by FluoroSpot and proliferation assays. T-cell responses to 50 frequently recognized Amb a-derived T-cell epitopes and homologous peptides from western ragweed (Amb p), giant ragweed (Amb t) and mugwort (Art v) were investigated in an additional 11 American and 14 Slovakian ragweed allergic donors. RESULTS: Ragweed extracts contained all known allergens and isoallergens thereof. Donor T-cell responses were diverse and directed against all Amb a 1 isoallergens and to most minor allergens investigated. Similar response patterns were seen in American and Slovakia donors. Several epitopes were cross-reactive between isoallergens and ragweed species, some even including mugwort. T-cell cross-reactivity generally correlated with allergen sequence homology. CONCLUSION: T-cell epitopes of multiple allergens/isoallergens are involved in the diverse T-cell responses in ragweed allergic individuals. T-cell lines were highly cross-reactive to epitopes of related ragweed species without any apparent geographical response bias. These data support that different ragweed species can be considered an allergen homology group with Amb a as the representative species regarding diagnosis as well as allergy immunotherapy.
Asunto(s)
Alérgenos/inmunología , Ambrosia/inmunología , Epítopos de Linfocito T/inmunología , Hipersensibilidad Inmediata/inmunología , Linfocitos T/inmunología , Adulto , Reacciones Cruzadas , Femenino , Humanos , MasculinoRESUMEN
The immune epitope database analysis resource (IEDB-AR: http://tools.iedb.org) is a collection of tools for prediction and analysis of molecular targets of T- and B-cell immune responses (i.e. epitopes). Since its last publication in the NAR webserver issue in 2008, a new generation of peptide:MHC binding and T-cell epitope predictive tools have been added. As validated by different labs and in the first international competition for predicting peptide:MHC-I binding, their predictive performances have improved considerably. In addition, a new B-cell epitope prediction tool was added, and the homology mapping tool was updated to enable mapping of discontinuous epitopes onto 3D structures. Furthermore, to serve a wider range of users, the number of ways in which IEDB-AR can be accessed has been expanded. Specifically, the predictive tools can be programmatically accessed using a web interface and can also be downloaded as software packages.
Asunto(s)
Epítopos de Linfocito B/química , Epítopos de Linfocito T/química , Programas Informáticos , Antígenos de Histocompatibilidad Clase I/metabolismo , Antígenos de Histocompatibilidad Clase II/metabolismo , Humanos , Internet , Péptidos/inmunología , Homología Estructural de ProteínaRESUMEN
Mycobacterium avium subspecies paratuberculosis (MAP) is a global concern in modern livestock production worldwide. The available vaccines against paratuberculosis do not offer optimal protection and interfere with the diagnosis of bovine tuberculosis. The aim of this study was to identify immunogenic MAP-specific peptides that do not interfere with the diagnosis of bovine tuberculosis. Initially, 119 peptides were selected by either (1) identifying unique MAP peptides that were predicted to bind to bovine major histocompatibility complex class II (MHC-predicted peptides) or (2) selecting hydrophobic peptides unique to MAP within proteins previously shown to be immunogenic (hydrophobic peptides). Subsequent testing of peptide-specific CD4+ T-cell lines from MAP-infected, adult goats vaccinated with peptides in cationic liposome adjuvant pointed to 23 peptides as being most immunogenic. These peptides were included in a second vaccine trial where three groups of eight healthy goat kids were vaccinated with 14 MHC-predicted peptides, nine hydrophobic peptides, or no peptides in o/w emulsion adjuvant. The majority of the MHC-predicted (93%) and hydrophobic peptides (67%) induced interferon-gamma (IFN-γ) responses in at least one animal. Similarly, 86% of the MHC-predicted and 89% of the hydrophobic peptides induced antibody responses in at least one goat. The immunization of eight healthy heifers with all 119 peptides formulated in emulsion adjuvant identified more peptides as immunogenic, as peptide specific IFN-γ and antibody responses in at least one heifer was found toward 84% and 24% of the peptides, respectively. No peptide-induced reactivity was found with commercial ELISAs for detecting antibodies against Mycobacterium bovis or MAP or when performing tuberculin skin testing for bovine tuberculosis. The vaccinated animals experienced adverse reactions at the injection site; thus, it is recommend that future studies make improvements to the vaccine formulation. In conclusion, immunogenic MAP-specific peptides that appeared promising for use in a vaccine against paratuberculosis without interfering with surveillance and trade tests for bovine tuberculosis were identified by in silico analysis and ex vivo generation of CD4+ T-cell lines and validated by the immunization of goats and cattle. Future studies should test different peptide combinations in challenge trials to determine their protective effect and identify the most MHC-promiscuous vaccine candidates.
Asunto(s)
Mycobacterium avium subsp. paratuberculosis , Paratuberculosis , Tuberculosis Bovina , Animales , Femenino , Bovinos , Paratuberculosis/prevención & control , Emulsiones , Vacunas Bacterianas , Interferón gamma/metabolismo , Anticuerpos Antibacterianos , Adyuvantes Inmunológicos , Cabras , Línea CelularRESUMEN
The identification of peptides binding to major histocompatibility complexes (MHC) is a critical step in the understanding of T cell immune responses. The human MHC genomic region (HLA) is extremely polymorphic comprising several thousand alleles, many encoding a distinct molecule. The potentially unique specificities remain experimentally uncharacterized for the vast majority of HLA molecules. Likewise, for nonhuman species, only a minor fraction of the known MHC molecules have been characterized. Here, we describe a tool, MHCcluster, to functionally cluster MHC molecules based on their predicted binding specificity. The method has a flexible web interface that allows the user to include any MHC of interest in the analysis. The output consists of a static heat map and graphical tree-based visualizations of the functional relationship between MHC variants and a dynamic TreeViewer interface where both the functional relationship and the individual binding specificities of MHC molecules are visualized. We demonstrate that conventional sequence-based clustering will fail to identify the functional relationship between molecules, when applied to MHC system, and only through the use of the predicted binding specificity can a correct clustering be found. Clustering of prevalent HLA-A and HLA-B alleles using MHCcluster confirms the presence of 12 major specificity groups (supertypes) some however with highly divergent specificities. Importantly, some HLA molecules are shown not to fit any supertype classification. Also, we use MHCcluster to show that chimpanzee MHC class I molecules have a reduced functional diversity compared to that of HLA class I molecules. MHCcluster is available at www.cbs.dtu.dk/services/MHCcluster-2.0.
Asunto(s)
Antígenos de Histocompatibilidad/genética , Inmunidad Celular , Complejo Mayor de Histocompatibilidad/genética , Linfocitos T/inmunología , Algoritmos , Animales , Sitios de Unión , Humanos , Internet , Pan troglodytes/genética , Pan troglodytes/inmunología , Unión Proteica , Programas Informáticos , Interfaz Usuario-ComputadorRESUMEN
The major histocompatibility complex (MHC) genes are the most polymorphic genes found in the vertebrate genome, and they encode proteins that play an essential role in the adaptive immune response. Many songbirds (passerines) have been shown to have a large number of transcribed MHC class I genes compared to most mammals. To elucidate the reason for this large number of genes, we compared 14 MHC class I alleles (α1-α3 domains), from great reed warbler, house sparrow and tree sparrow, via phylogenetic analysis, homology modelling and in silico peptide-binding predictions to investigate their functional and genetic relationships. We found more pronounced clustering of the MHC class I allomorphs (allele specific proteins) in regards to their function (peptide-binding specificities) compared to their genetic relationships (amino acid sequences), indicating that the high number of alleles is of functional significance. The MHC class I allomorphs from house sparrow and tree sparrow, species that diverged 10 million years ago (MYA), had overlapping peptide-binding specificities, and these similarities across species were also confirmed in phylogenetic analyses based on amino acid sequences. Notably, there were also overlapping peptide-binding specificities in the allomorphs from house sparrow and great reed warbler, although these species diverged 30 MYA. This overlap was not found in a tree based on amino acid sequences. Our interpretation is that convergent evolution on the level of the protein function, possibly driven by selection from shared pathogens, has resulted in allomorphs with similar peptide-binding repertoires, although trans-species evolution in combination with gene conversion cannot be ruled out.
Asunto(s)
Aves/genética , Biología Computacional/métodos , Evolución Molecular , Antígenos de Histocompatibilidad Clase I/genética , Péptidos , Secuencia de Aminoácidos , Animales , Sitios de Unión , Aves/clasificación , Aves/inmunología , Análisis por Conglomerados , Simulación por Computador , Antígenos de Histocompatibilidad Clase I/química , Antígenos de Histocompatibilidad Clase I/metabolismo , Modelos Moleculares , Datos de Secuencia Molecular , Péptidos/química , Péptidos/metabolismo , Filogenia , Unión Proteica , Conformación Proteica , Alineación de SecuenciaRESUMEN
The interaction between antibodies and antigens is one of the most important immune system mechanisms for clearing infectious organisms from the host. Antibodies bind to antigens at sites referred to as B-cell epitopes. Identification of the exact location of B-cell epitopes is essential in several biomedical applications such as; rational vaccine design, development of disease diagnostics and immunotherapeutics. However, experimental mapping of epitopes is resource intensive making in silico methods an appealing complementary approach. To date, the reported performance of methods for in silico mapping of B-cell epitopes has been moderate. Several issues regarding the evaluation data sets may however have led to the performance values being underestimated: Rarely, all potential epitopes have been mapped on an antigen, and antibodies are generally raised against the antigen in a given biological context not against the antigen monomer. Improper dealing with these aspects leads to many artificial false positive predictions and hence to incorrect low performance values. To demonstrate the impact of proper benchmark definitions, we here present an updated version of the DiscoTope method incorporating a novel spatial neighborhood definition and half-sphere exposure as surface measure. Compared to other state-of-the-art prediction methods, Discotope-2.0 displayed improved performance both in cross-validation and in independent evaluations. Using DiscoTope-2.0, we assessed the impact on performance when using proper benchmark definitions. For 13 proteins in the training data set where sufficient biological information was available to make a proper benchmark redefinition, the average AUC performance was improved from 0.791 to 0.824. Similarly, the average AUC performance on an independent evaluation data set improved from 0.712 to 0.727. Our results thus demonstrate that given proper benchmark definitions, B-cell epitope prediction methods achieve highly significant predictive performances suggesting these tools to be a powerful asset in rational epitope discovery. The updated version of DiscoTope is available at www.cbs.dtu.dk/services/DiscoTope-2.0.
Asunto(s)
Linfocitos B/inmunología , Benchmarking , Epítopos/inmunología , Epítopos/química , Humanos , Modelos Moleculares , Oportunidad RelativaRESUMEN
A key role in cell-mediated immunity is dedicated to the major histocompatibility complex (MHC) molecules that bind peptides for presentation on the cell surface. Several in silico methods capable of predicting peptide binding to MHC class I have been developed. The accuracy of these methods depends on the data available characterizing the binding specificity of the MHC molecules. It has, moreover, been demonstrated that consensus methods defined as combinations of two or more different methods led to improved prediction accuracy. This plethora of methods makes it very difficult for the non-expert user to choose the most suitable method for predicting binding to a given MHC molecule. In this study, we have therefore made an in-depth analysis of combinations of three state-of-the-art MHC-peptide binding prediction methods (NetMHC, NetMHCpan and PickPocket). We demonstrate that a simple combination of NetMHC and NetMHCpan gives the highest performance when the allele in question is included in the training and is characterized by at least 50 data points with at least ten binders. Otherwise, NetMHCpan is the best predictor. When an allele has not been characterized, the performance depends on the distance to the training data. NetMHCpan has the highest performance when close neighbours are present in the training set, while the combination of NetMHCpan and PickPocket outperforms either of the two methods for alleles with more remote neighbours. The final method, NetMHCcons, is publicly available at www.cbs.dtu.dk/services/NetMHCcons , and allows the user in an automatic manner to obtain the most accurate predictions for any given MHC molecule.
Asunto(s)
Biología Computacional/métodos , Antígenos de Histocompatibilidad Clase I/química , Complejo Mayor de Histocompatibilidad , Programas Informáticos , Algoritmos , Alelos , Simulación por Computador , Consenso , Antígenos de Histocompatibilidad Clase I/metabolismo , Humanos , Internet , Complejo Mayor de Histocompatibilidad/genética , Complejo Mayor de Histocompatibilidad/inmunología , Péptidos/química , Péptidos/metabolismo , Unión Proteica/inmunología , Reproducibilidad de los ResultadosRESUMEN
CPHmodels-3.0 is a web server predicting protein 3D structure by use of single template homology modeling. The server employs a hybrid of the scoring functions of CPHmodels-2.0 and a novel remote homology-modeling algorithm. A query sequence is first attempted modeled using the fast CPHmodels-2.0 profile-profile scoring function suitable for close homology modeling. The new computational costly remote homology-modeling algorithm is only engaged provided that no suitable PDB template is identified in the initial search. CPHmodels-3.0 was benchmarked in the CASP8 competition and produced models for 94% of the targets (117 out of 128), 74% were predicted as high reliability models (87 out of 117). These achieved an average RMSD of 4.6 A when superimposed to the 3D structure. The remaining 26% low reliably models (30 out of 117) could superimpose to the true 3D structure with an average RMSD of 9.3 A. These performance values place the CPHmodels-3.0 method in the group of high performing 3D prediction tools. Beside its accuracy, one of the important features of the method is its speed. For most queries, the response time of the server is <20 min. The web server is available at http://www.cbs.dtu.dk/services/CPHmodels/.
Asunto(s)
Programas Informáticos , Homología Estructural de Proteína , Algoritmos , Internet , Pliegue de Proteína , Reproducibilidad de los Resultados , Análisis de Secuencia de Proteína , Interfaz Usuario-ComputadorRESUMEN
SUMMARY: Major histocompatibility complex class II (MHC-II) molecules sample peptides from the extracellular space, allowing the immune system to detect the presence of foreign microbes from this compartment. To be able to predict the immune response to given pathogens, a number of methods have been developed to predict peptide-MHC binding. However, few methods other than the pioneering TEPITOPE/ProPred method have been developed for MHC-II. Despite recent progress in method development, the predictive performance for MHC-II remains significantly lower than what can be obtained for MHC-I. One reason for this is that the MHC-II molecule is open at both ends allowing binding of peptides extending out of the groove. The binding core of MHC-II-bound peptides is therefore not known a priori and the binding motif is hence not readily discernible. Recent progress has been obtained by including the flanking residues in the predictions. All attempts to make ab initio predictions based on protein structure have failed to reach predictive performances similar to those that can be obtained by data-driven methods. Thousands of different MHC-II alleles exist in humans. Recently developed pan-specific methods have been able to make reasonably accurate predictions for alleles that were not included in the training data. These methods can be used to define supertypes (clusters) of MHC-II alleles where alleles within each supertype have similar binding specificities. Furthermore, the pan-specific methods have been used to make a graphical atlas such as the MHCMotifviewer, which allows for visual comparison of specificities of different alleles.
Asunto(s)
Biología Computacional/métodos , Epítopos/inmunología , Epítopos/metabolismo , Antígenos de Histocompatibilidad Clase II/inmunología , Antígenos de Histocompatibilidad Clase II/metabolismo , Animales , Epítopos/química , Epítopos/genética , Antígenos de Histocompatibilidad Clase II/química , Antígenos de Histocompatibilidad Clase II/genética , Humanos , Unión Proteica/inmunologíaRESUMEN
SUMMARY: Over the last decade, in silico models of the major histocompatibility complex (MHC) class I pathway have developed significantly. Before, peptide binding could only be reliably modelled for a few major human or mouse histocompatibility molecules; now, high-accuracy predictions are available for any human leucocyte antigen (HLA) -A or -B molecule with known protein sequence. Furthermore, peptide binding to MHC molecules from several non-human primates, mouse strains and other mammals can now be predicted. In this review, a number of different prediction methods are briefly explained, highlighting the most useful and historically important. Selected case stories, where these 'reverse immunology' systems have been used in actual epitope discovery, are briefly reviewed. We conclude that this new generation of epitope discovery systems has become a highly efficient tool for epitope discovery, and recommend that the less accurate prediction systems of the past be abandoned, as these are obsolete.
Asunto(s)
Biología Computacional/métodos , Epítopos de Linfocito T/inmunología , Epítopos de Linfocito T/metabolismo , Antígenos de Histocompatibilidad Clase I/inmunología , Antígenos de Histocompatibilidad Clase I/metabolismo , Animales , Epítopos de Linfocito T/química , Antígenos de Histocompatibilidad Clase I/química , Humanos , Unión Proteica/inmunologíaRESUMEN
Reliable predictions of immunogenic peptides are essential in rational vaccine design and can minimize the experimental effort needed to identify epitopes. In this work, we describe a pan-specific major histocompatibility complex (MHC) class I epitope predictor, NetCTLpan. The method integrates predictions of proteasomal cleavage, transporter associated with antigen processing (TAP) transport efficiency, and MHC class I binding affinity into a MHC class I pathway likelihood score and is an improved and extended version of NetCTL. The NetCTLpan method performs predictions for all MHC class I molecules with known protein sequence and allows predictions for 8-, 9-, 10-, and 11-mer peptides. In order to meet the need for a low false positive rate, the method is optimized to achieve high specificity. The method was trained and validated on large datasets of experimentally identified MHC class I ligands and cytotoxic T lymphocyte (CTL) epitopes. It has been reported that MHC molecules are differentially dependent on TAP transport and proteasomal cleavage. Here, we did not find any consistent signs of such MHC dependencies, and the NetCTLpan method is implemented with fixed weights for proteasomal cleavage and TAP transport for all MHC molecules. The predictive performance of the NetCTLpan method was shown to outperform other state-of-the-art CTL epitope prediction methods. Our results further confirm the importance of using full-type human leukocyte antigen restriction information when identifying MHC class I epitopes. Using the NetCTLpan method, the experimental effort to identify 90% of new epitopes can be reduced by 15% and 40%, respectively, when compared to the NetMHCpan and NetCTL methods. The method and benchmark datasets are available at http://www.cbs.dtu.dk/services/NetCTLpan/.
Asunto(s)
Transportadoras de Casetes de Unión a ATP/metabolismo , Epítopos de Linfocito T , Antígenos de Histocompatibilidad Clase I/inmunología , Linfocitos T Citotóxicos/inmunología , Humanos , Complejo de la Endopetidasa Proteasomal/fisiología , Transporte de ProteínasRESUMEN
MOTIVATION: MHC:peptide binding plays a central role in activating the immune surveillance. Computational approaches to determine T-cell epitopes restricted to any given major histocompatibility complex (MHC) molecule are of special practical value in the development of for instance vaccines with broad population coverage against emerging pathogens. Methods have recently been published that are able to predict peptide binding to any human MHC class I molecule. In contrast to conventional allele-specific methods, these methods do allow for extrapolation to uncharacterized MHC molecules. These pan-specific human lymphocyte antigen (HLA) predictors have not previously been compared using independent evaluation sets. RESULT: A diverse set of quantitative peptide binding affinity measurements was collected from Immune Epitope database (IEDB), together with a large set of HLA class I ligands from the SYFPEITHI database. Based on these datasets, three different pan-specific HLA web-accessible predictors NetMHCpan, adaptive double threading (ADT) and kernel-based inter-allele peptide binding prediction system (KISS) were evaluated. The performance of the pan-specific predictors was also compared with a well performing allele-specific MHC class I predictor, NetMHC, as well as a consensus approach integrating the predictions from the NetMHC and NetMHCpan methods. CONCLUSIONS: The benchmark demonstrated that pan-specific methods do provide accurate predictions also for previously uncharacterized MHC molecules. The NetMHCpan method trained to predict actual binding affinities was consistently top ranking both on quantitative (affinity) and binary (ligand) data. However, the KISS method trained to predict binary data was one of the best performing methods when benchmarked on binary data. Finally, a consensus method integrating predictions from the two best performing methods was shown to improve the prediction accuracy.
Asunto(s)
Biología Computacional/métodos , Antígenos de Histocompatibilidad Clase I/inmunología , Alelos , Área Bajo la Curva , Bases de Datos de Proteínas , Antígenos HLA/inmunología , Antígenos de Histocompatibilidad Clase I/química , Humanos , Ligandos , Péptidos/inmunología , Unión Proteica , Estadísticas no ParamétricasRESUMEN
NetMHC-3.0 is trained on a large number of quantitative peptide data using both affinity data from the Immune Epitope Database and Analysis Resource (IEDB) and elution data from SYFPEITHI. The method generates high-accuracy predictions of major histocompatibility complex (MHC): peptide binding. The predictions are based on artificial neural networks trained on data from 55 MHC alleles (43 Human and 12 non-human), and position-specific scoring matrices (PSSMs) for additional 67 HLA alleles. As only the MHC class I prediction server is available, predictions are possible for peptides of length 8-11 for all 122 alleles. artificial neural network predictions are given as actual IC(50) values whereas PSSM predictions are given as a log-odds likelihood scores. The output is optionally available as download for easy post-processing. The training method underlying the server is the best available, and has been used to predict possible MHC-binding peptides in a series of pathogen viral proteomes including SARS, Influenza and HIV, resulting in an average of 75-80% confirmed MHC binders. Here, the performance is further validated and benchmarked using a large set of newly published affinity data, non-redundant to the training set. The server is free of use and available at: http://www.cbs.dtu.dk/services/NetMHC.
Asunto(s)
Antígenos HLA/metabolismo , Antígenos de Histocompatibilidad Clase I/metabolismo , Péptidos/inmunología , Programas Informáticos , Alelos , Animales , Epítopos/química , Antígenos HLA/genética , Haplorrinos/genética , Antígenos de Histocompatibilidad Clase I/genética , Humanos , Internet , Ratones , Péptidos/químicaRESUMEN
We present a new release of the immune epitope database analysis resource (IEDB-AR, http://tools.immuneepitope.org), a repository of web-based tools for the prediction and analysis of immune epitopes. New functionalities have been added to most of the previously implemented tools, and a total of eight new tools were added, including two B-cell epitope prediction tools, four T-cell epitope prediction tools and two analysis tools.
Asunto(s)
Epítopos de Linfocito B/química , Epítopos de Linfocito T/química , Programas Informáticos , Gráficos por Computador , Bases de Datos Factuales , Epítopos de Linfocito B/inmunología , Epítopos de Linfocito T/inmunología , Antígenos de Histocompatibilidad Clase I/metabolismo , Antígenos de Histocompatibilidad Clase II/metabolismo , Internet , Péptidos/química , Péptidos/inmunología , Proteínas/química , Proteínas/inmunologíaRESUMEN
BACKGROUND: Estimation of the reliability of specific real value predictions is nontrivial and the efficacy of this is often questionable. It is important to know if you can trust a given prediction and therefore the best methods associate a prediction with a reliability score or index. For discrete qualitative predictions, the reliability is conventionally estimated as the difference between output scores of selected classes. Such an approach is not feasible for methods that predict a biological feature as a single real value rather than a classification. As a solution to this challenge, we have implemented a method that predicts the relative surface accessibility of an amino acid and simultaneously predicts the reliability for each prediction, in the form of a Z-score. RESULTS: An ensemble of artificial neural networks has been trained on a set of experimentally solved protein structures to predict the relative exposure of the amino acids. The method assigns a reliability score to each surface accessibility prediction as an inherent part of the training process. This is in contrast to the most commonly used procedures where reliabilities are obtained by post-processing the output. CONCLUSION: The performance of the neural networks was evaluated on a commonly used set of sequences known as the CB513 set. An overall Pearson's correlation coefficient of 0.72 was obtained, which is comparable to the performance of the currently best public available method, Real-SPINE. Both methods associate a reliability score with the individual predictions. However, our implementation of reliability scores in the form of a Z-score is shown to be the more informative measure for discriminating good predictions from bad ones in the entire range from completely buried to fully exposed amino acids. This is evident when comparing the Pearson's correlation coefficient for the upper 20% of predictions sorted according to reliability. For this subset, values of 0.79 and 0.74 are obtained using our and the compared method, respectively. This tendency is true for any selected subset.
Asunto(s)
Proteínas/química , Solventes/química , Algoritmos , Biología Computacional , Bases de Datos de Proteínas , Redes Neurales de la ComputaciónRESUMEN
UNLABELLED: Several accurate prediction systems have been developed for prediction of class I major histocompatibility complex (MHC):peptide binding. Most of these are trained on binding affinity data of primarily 9mer peptides. Here, we show how prediction methods trained on 9mer data can be used for accurate binding affinity prediction of peptides of length 8, 10 and 11. The method gives the opportunity to predict peptides with a different length than nine for MHC alleles where no such peptides have been measured. As validation, the performance of this approach is compared to predictors trained on peptides of the peptide length in question. In this validation, the approximation method has an accuracy that is comparable to or better than methods trained on a peptide length identical to the predicted peptides. AVAILABILITY: The algorithm has been implemented in the web-accessible servers NetMHC-3.0: http://www.cbs.dtu.dk/services/NetMHC-3.0, and NetMHCpan-1.1: http://www.cbs.dtu.dk/services/NetMHCpan-1.1
Asunto(s)
Algoritmos , Antígenos de Histocompatibilidad Clase I/química , Péptidos/química , Mapeo de Interacción de Proteínas/métodos , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Inteligencia Artificial , Sitios de Unión , Unión Proteica , Reproducibilidad de los Resultados , Sensibilidad y EspecificidadRESUMEN
CD4 positive T helper cells control many aspects of specific immunity. These cells are specific for peptides derived from protein antigens and presented by molecules of the extremely polymorphic major histocompatibility complex (MHC) class II system. The identification of peptides that bind to MHC class II molecules is therefore of pivotal importance for rational discovery of immune epitopes. HLA-DR is a prominent example of a human MHC class II. Here, we present a method, NetMHCIIpan, that allows for pan-specific predictions of peptide binding to any HLA-DR molecule of known sequence. The method is derived from a large compilation of quantitative HLA-DR binding events covering 14 of the more than 500 known HLA-DR alleles. Taking both peptide and HLA sequence information into account, the method can generalize and predict peptide binding also for HLA-DR molecules where experimental data is absent. Validation of the method includes identification of endogenously derived HLA class II ligands, cross-validation, leave-one-molecule-out, and binding motif identification for hitherto uncharacterized HLA-DR molecules. The validation shows that the method can successfully predict binding for HLA-DR molecules-even in the absence of specific data for the particular molecule in question. Moreover, when compared to TEPITOPE, currently the only other publicly available prediction method aiming at providing broad HLA-DR allelic coverage, NetMHCIIpan performs equivalently for alleles included in the training of TEPITOPE while outperforming TEPITOPE on novel alleles. We propose that the method can be used to identify those hitherto uncharacterized alleles, which should be addressed experimentally in future updates of the method to cover the polymorphism of HLA-DR most efficiently. We thus conclude that the presented method meets the challenge of keeping up with the MHC polymorphism discovery rate and that it can be used to sample the MHC "space," enabling a highly efficient iterative process for improving MHC class II binding predictions.
Asunto(s)
Antígenos HLA-DR/metabolismo , Mapeo de Interacción de Proteínas/métodos , Algoritmos , Alelos , Secuencia de Aminoácidos/fisiología , Sitios de Unión/genética , Sitios de Unión/inmunología , Bases de Datos de Proteínas , Antígenos HLA-DR/genética , Antígenos HLA-DR/inmunología , Humanos , Complejo Mayor de Histocompatibilidad/genética , Datos de Secuencia Molecular , Valor Predictivo de las Pruebas , Unión Proteica/inmunología , Reproducibilidad de los Resultados , Alineación de Secuencia , Análisis de Secuencia de ProteínaRESUMEN
Human leukocyte antigen class I (HLA-I) molecules are highly polymorphic peptide receptors, which select and present endogenously derived peptide epitopes to CD8+ cytotoxic T cells (CTL). The specificity of the HLA-I system is an important component of the overall specificity of the CTL immune system. Unfortunately, the large and rapidly increasing number of known HLA-I molecules seriously complicates a comprehensive analysis of the specificities of the entire HLA-I system (as of June 2008, the international HLA registry holds >1,650 unique HLA-I protein entries). In an attempt to reduce this complexity, it has been suggested to cluster the different HLA-I molecules into "supertypes" of largely overlapping peptide-binding specificities. Obviously, the HLA supertype concept is only valuable if membership can be assigned with reasonable accuracy. The supertype assignment of HLA-A*3001, a common HLA haplotype in populations of African descent, has variously been assigned to the A1, A3, or A24 supertypes. Using a biochemical HLA-A*3001 binding assay, and a large panel of nonamer peptides and peptide libraries, we here demonstrate that the specificity of HLA-A*3001 most closely resembles that of the HLA-A3 supertype. We discuss approaches to supertype assignment and underscore the importance of experimental verification.
Asunto(s)
Genes MHC Clase I , Antígenos HLA-A/química , Oligopéptidos/metabolismo , Secuencias de Aminoácidos , Secuencia de Aminoácidos , Técnicas Químicas Combinatorias , Antígenos HLA-A/genética , Antígenos HLA-A/metabolismo , Humanos , Familia de Multigenes , Biblioteca de Péptidos , Filogenia , Unión Proteica , Especificidad por SustratoRESUMEN
MOTIVATION: Immunological bioinformatics methods are applicable to a broad range of scientific areas. The specifics of how and where they might be implemented have recently been reviewed in the literature. However, the background and concerns for selecting between the different available methods have so far not been adequately covered. SUMMARY: Before using predictions systems, it is necessary to not only understand how the methods are constructed but also their strength and limitations. The prediction systems in humoral epitope discovery are still in their infancy, but have reached a reasonable level of predictive strength. In cellular immunology, MHC class I binding predictions are now very strong and cover most of the known HLA specificities. These systems work well for epitope discovery, and predictions of the MHC class I pathway have been further improved by integration with state-of-the-art prediction tools for proteasomal cleavage and TAP binding. By comparison, class II MHC binding predictions have not developed to a comparable accuracy level, but new tools have emerged that deliver significantly improved predictions not only in terms of accuracy, but also in MHC specificity coverage. Simulation systems and mathematical modeling are also now beginning to reach a level where these methods will be able to answer more complex immunological questions.