RESUMEN
T cell recognition of a cognate peptide-major histocompatibility complex (pMHC) presented on the surface of infected or malignant cells is of the utmost importance for mediating robust and long-term immune responses. Accurate predictions of cognate pMHC targets for T cell receptors would greatly facilitate identification of vaccine targets for both pathogenic diseases and personalized cancer immunotherapies. Predicting immunogenic peptides therefore has been at the center of intensive research for the past decades but has proven challenging. Although numerous models have been proposed, performance of these models has not been systematically evaluated and their success rate in predicting epitopes in the context of human pathology has not been measured and compared. In this study, we evaluated the performance of several publicly available models, in identifying immunogenic CD8+ T cell targets in the context of pathogens and cancers. We found that for predicting immunogenic peptides from an emerging virus such as severe acute respiratory syndrome coronavirus 2, none of the models perform substantially better than random or offer considerable improvement beyond HLA ligand prediction. We also observed suboptimal performance for predicting cancer neoantigens. Through investigation of potential factors associated with ill performance of models, we highlight several data- and model-associated issues. In particular, we observed that cross-HLA variation in the distribution of immunogenic and non-immunogenic peptides in the training data of the models seems to substantially confound the predictions. We additionally compared key parameters associated with immunogenicity between pathogenic peptides and cancer neoantigens and observed evidence for differences in the thresholds of binding affinity and stability, which suggested the need to modulate different features in identifying immunogenic pathogen versus cancer peptides. Overall, we demonstrate that accurate and reliable predictions of immunogenic CD8+ T cell targets remain unsolved; thus, we hope our work will guide users and model developers regarding potential pitfalls and unsettled questions in existing immunogenicity predictors.
Asunto(s)
COVID-19 , Neoplasias , Linfocitos T CD8-positivos/metabolismo , Simulación por Computador , Epítopos de Linfocito T , Humanos , PéptidosRESUMEN
The conditions and extent of cross-protective immunity between severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and common-cold human coronaviruses (HCoVs) remain open despite several reports of pre-existing T cell immunity to SARS-CoV-2 in individuals without prior exposure. Using a pool of functionally evaluated SARS-CoV-2 peptides, we report a map of 126 immunogenic peptides with high similarity to 285 MHC-presented peptides from at least one HCoV. Employing this map of SARS-CoV-2-non-homologous and homologous immunogenic peptides, we observe several immunogenic peptides with high similarity to human proteins, some of which have been reported to have elevated expression in severe COVID-19 patients. After combining our map with SARS-CoV-2-specific TCR repertoire data from COVID-19 patients and healthy controls, we show that public repertoires for the majority of convalescent patients are dominated by TCRs cognate to non-homologous SARS-CoV-2 peptides. We find that for a subset of patients, >50% of their public SARS-CoV-2-specific repertoires consist of TCRs cognate to homologous SARS-CoV-2-HCoV peptides. Further analysis suggests that this skewed distribution of TCRs cognate to homologous or non-homologous peptides in COVID-19 patients is likely to be HLA-dependent. Finally, we provide 10 SARS-CoV-2 peptides with known cognate TCRs that are conserved across multiple coronaviruses and are predicted to be recognized by a high proportion of the global population. These findings may have important implications for COVID-19 heterogeneity, vaccine-induced immune responses, and robustness of immunity to SARS-CoV-2 and its variants.
Asunto(s)
COVID-19 , SARS-CoV-2 , Linfocitos T CD8-positivos , Reacciones Cruzadas , Epítopos de Linfocito T , Humanos , Péptidos , Receptores de Antígenos de Linfocitos T , Glicoproteína de la Espiga del CoronavirusRESUMEN
T cell recognition of SARS-CoV-2 antigens after vaccination and/or natural infection has played a central role in resolving SARS-CoV-2 infections and generating adaptive immune memory. However, the clinical impact of SARS-CoV-2-specific T cell responses is variable and the mechanisms underlying T cell interaction with target antigens are not fully understood. This is especially true given the virus' rapid evolution, which leads to new variants with immune escape capacity. In this study, we used the Omicron variant as a model organism and took a systems approach to evaluate the impact of mutations on CD8+ T cell immunogenicity. We computed an immunogenicity potential score for each SARS-CoV-2 peptide antigen from the ancestral strain and Omicron, capturing both antigen presentation and T cell recognition probabilities. By comparing ancestral vs. Omicron immunogenicity scores, we reveal a divergent and heterogeneous landscape of impact for CD8+ T cell recognition of mutated targets in Omicron variants. While T cell recognition of Omicron peptides is broadly preserved, we observed mutated peptides with deteriorated immunogenicity that may assist breakthrough infection in some individuals. We then combined our scoring scheme with an in silico mutagenesis, to characterise the position- and residue-specific theoretical mutational impact on immunogenicity. While we predict many escape trajectories from the theoretical landscape of substitutions, our study suggests that Omicron mutations in T cell epitopes did not develop under cell-mediated pressure. Our study provides a generalisable platform for fostering a deeper understanding of existing and novel variant impact on antigen-specific vaccine- and/or infection-induced T cell immunity.
RESUMEN
BACKGROUND: T-cells play a crucial role in the adaptive immune system by triggering responses against cancer cells and pathogens, while maintaining tolerance against self-antigens, which has sparked interest in the development of various T-cell-focused immunotherapies. However, the identification of antigens recognised by T-cells is low-throughput and laborious. To overcome some of these limitations, computational methods for predicting CD8 + T-cell epitopes have emerged. Despite recent developments, most immunogenicity algorithms struggle to learn features of peptide immunogenicity from small datasets, suffer from HLA bias and are unable to reliably predict pathology-specific CD8 + T-cell epitopes. METHODS: We developed TRAP (T-cell recognition potential of HLA-I presented peptides), a robust deep learning workflow for predicting CD8 + T-cell epitopes from MHC-I presented pathogenic and self-peptides. TRAP uses transfer learning, deep learning architecture and MHC binding information to make context-specific predictions of CD8 + T-cell epitopes. TRAP also detects low-confidence predictions for peptides that differ significantly from those in the training datasets to abstain from making incorrect predictions. To estimate the immunogenicity of pathogenic peptides with low-confidence predictions, we further developed a novel metric, RSAT (relative similarity to autoantigens and tumour-associated antigens), as a complementary to 'dissimilarity to self' from cancer studies. RESULTS: TRAP was used to identify epitopes from glioblastoma patients as well as SARS-CoV-2 peptides, and it outperformed other algorithms in both cancer and pathogenic settings. TRAP was especially effective at extracting immunogenicity-associated properties from restricted data of emerging pathogens and translating them onto related species, as well as minimising the loss of likely epitopes in imbalanced datasets. We also demonstrated that the novel metric termed RSAT was able to estimate immunogenic of pathogenic peptides of various lengths and species. TRAP implementation is available at: https://github.com/ChloeHJ/TRAP . CONCLUSIONS: This study presents a novel computational workflow for accurately predicting CD8 + T-cell epitopes to foster a better understanding of antigen-specific T-cell response and the development of effective clinical therapeutics.
Asunto(s)
COVID-19 , Aprendizaje Profundo , Humanos , Epítopos de Linfocito T , Flujo de Trabajo , SARS-CoV-2 , Linfocitos T CD8-positivosRESUMEN
While individuals infected with coronavirus disease 2019 (COVID-19) manifested a broad range in susceptibility and severity to the disease, the pre-existing immune memory to related pathogens cross-reactive against SARS-CoV-2 can influence the disease outcome in COVID-19. Here, we investigated the potential extent of T cell cross-reactivity against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) that can be conferred by other coronaviruses and influenza virus, and generated an in silico map of public and private CD8+ T cell epitopes between coronaviruses. We observed 794 predicted SARS-CoV-2 epitopes of which 52% were private and 48% were public. Ninety-nine percent of the public epitopes were shared with SARS-CoV and 5.4% were shared with either one of four common coronaviruses, 229E, HKU1, NL63, and OC43. Moreover, to assess the potential risk of self-reactivity and/or diminished T cell response for peptides identical or highly similar to the host, we identified predicted epitopes with high sequence similarity with human proteome. Lastly, we compared predicted epitopes from coronaviruses with epitopes from influenza virus deposited in IEDB, and found only a small number of peptides with limited potential for cross-reactivity between the two virus families. We believe our comprehensive in silico profile of private and public epitopes across coronaviruses would facilitate design of vaccines, and provide insights into the presence of pre-existing coronavirus-specific memory CD8+ T cells that may influence immune responses against SARS-CoV-2.
Asunto(s)
Linfocitos T CD8-positivos/inmunología , Coronavirus/inmunología , Reacciones Cruzadas , SARS-CoV-2/inmunología , Secuencia de Aminoácidos , Vacunas contra la COVID-19/inmunología , Simulación por Computador , Bases de Datos Factuales , Epítopos de Linfocito T/inmunología , Humanos , Orthomyxoviridae/inmunologíaRESUMEN
Novel adjuvant technologies have a key role in the development of next-generation vaccines, due to their capacity to modulate the duration, strength and quality of the immune response. The AS01 adjuvant is used in the malaria vaccine RTS,S/AS01 and in the licensed herpes-zoster vaccine (Shingrix) where the vaccine has proven its ability to generate protective responses with both robust humoral and T-cell responses. For many years, animal models have provided insights into adjuvant mode-of-action (MoA), generally through investigating individual genes or proteins. Furthermore, modeling and simulation techniques can be utilized to integrate a variety of different data types; ranging from serum biomarkers to large scale "omics" datasets. In this perspective we present a framework to create a holistic integration of pre-clinical datasets and immunological literature in order to develop an evidence-based hypothesis of AS01 adjuvant MoA, creating a unified view of multiple experiments. Furthermore, we highlight how holistic systems-knowledge can serve as a basis for the construction of models and simulations supporting exploration of key questions surrounding adjuvant MoA. Using the Systems-Biology-Graphical-Notation, a tool for graphical representation of biological processes, we have captured high-level cellular behaviors and interactions, and cytokine dynamics during the early immune response, which are substantiated by a series of diagrams detailing cellular dynamics. Through explicitly describing AS01 MoA we have built a consensus of understanding across multiple experiments, and so we present a framework to integrate modeling approaches into exploring adjuvant MoA, in order to guide experimental design, interpret results and inform rational design of vaccines.