RESUMO
Increasing evidence indicates CD4+ T cells can recognize cancer-specific antigens and control tumor growth. However, it remains difficult to predict the antigens that will be presented by human leukocyte antigen class II molecules (HLA-II), hindering efforts to optimally target them therapeutically. Obstacles include inaccurate peptide-binding prediction and unsolved complexities of the HLA-II pathway. To address these challenges, we developed an improved technology for discovering HLA-II binding motifs and conducted a comprehensive analysis of tumor ligandomes to learn processing rules relevant in the tumor microenvironment. We profiled >40 HLA-II alleles and showed that binding motifs were highly sensitive to HLA-DM, a peptide-loading chaperone. We also revealed that intratumoral HLA-II presentation was dominated by professional antigen-presenting cells (APCs) rather than cancer cells. Integrating these observations, we developed algorithms that accurately predicted APC ligandomes, including peptides from phagocytosed cancer cells. These tools and biological insights will enable improved HLA-II-directed cancer therapies.
Assuntos
Células Apresentadoras de Antígenos/imunologia , Linfócitos T CD4-Positivos/imunologia , Vacinas Anticâncer/imunologia , Mapeamento de Epitopos/métodos , Antígenos HLA/metabolismo , Antígenos de Histocompatibilidade Classe II/genética , Imunoterapia/métodos , Espectrometria de Massas/métodos , Neoplasias/terapia , Algoritmos , Alelos , Apresentação de Antígeno , Antígenos de Neoplasias/imunologia , Antígenos de Neoplasias/metabolismo , Conjuntos de Dados como Assunto , Antígenos HLA/genética , Antígenos HLA-D/metabolismo , Humanos , Neoplasias/imunologia , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas/genética , SoftwareRESUMO
The enzyme-linked immunosorbent spot (ELISpot) assay is a powerful in vitro immunoassay that enables cost-effective quantification of antigen-specific T-cell reactivity. It is used widely in the context of cancer and infectious diseases to validate the immunogenicity of predicted epitopes. While technological advances have kept pace with the demand for increased throughput, efforts to increase scale are bottlenecked by current assay design and deconvolution methods, which have remained largely unchanged. Current methods for designing pooled ELISpot experiments offer limited flexibility of assay parameters, lack support for high-throughput scenarios and do not consider peptide identity during pool assignment. We introduce the ACE Configurator for ELISpot (ACE) to address these gaps. ACE generates optimized peptide-pool assignments from highly customizable user inputs and handles the deconvolution of positive peptides using assay readouts. In this study, we present a novel sequence-aware pooling strategy, powered by a fine-tuned ESM-2 model that groups immunologically similar peptides, reducing the number of false positives and subsequent confirmatory assays compared to existing combinatorial approaches. To validate ACE's performance on real-world datasets, we conducted a comprehensive benchmark study, contextualizing design choices with their impact on prediction quality. Our results demonstrate ACE's capacity to further increase precision of identified immunogenic peptides, directly optimizing experimental efficiency. ACE is freely available as an executable with a graphical user interface and command-line interfaces at https://github.com/pirl-unc/ace.
Assuntos
Benchmarking , Imunoadsorventes , Epitopos , PeptídeosRESUMO
MOTIVATION: Elimination of cancer cells by T cells is a critical mechanism of anti-tumor immunity and cancer immunotherapy response. T cells recognize cancer cells by engagement of T cell receptors with peptide epitopes presented by major histocompatibility complex molecules on the cancer cell surface. Peptide epitopes can be derived from antigen proteins coded for by multiple genomic sources. Bioinformatics tools used to identify tumor-specific epitopes via analysis of DNA and RNA-sequencing data have largely focused on epitopes derived from somatic variants, though a smaller number have evaluated potential antigens from other genomic sources. RESULTS: We report here an open-source workflow utilizing the Nextflow DSL2 workflow manager, Landscape of Effective Neoantigens Software (LENS), which predicts tumor-specific and tumor-associated antigens from single nucleotide variants, insertions and deletions, fusion events, splice variants, cancer-testis antigens, overexpressed self-antigens, viruses, and endogenous retroviruses. The primary advantage of LENS is that it expands the breadth of genomic sources of discoverable tumor antigens using genomics data. Other advantages include modularity, extensibility, ease of use, and harmonization of relative expression level and immunogenicity prediction across multiple genomic sources. We present an analysis of 115 acute myeloid leukemia samples to demonstrate the utility of LENS. We expect LENS will be a valuable platform and resource for T cell epitope discovery bioinformatics, especially in cancers with few somatic variants where tumor-specific epitopes from alternative genomic sources are an elevated priority. AVAILABILITY AND IMPLEMENTATION: More information about LENS, including code, workflow documentation, and instructions, can be found at (https://gitlab.com/landscape-of-effective-neoantigens-software).
Assuntos
Neoplasias , Masculino , Humanos , Antígenos de Neoplasias/genética , Epitopos de Linfócito T/genética , Peptídeos , SoftwareRESUMO
BACKGROUND: Early in the pandemic, we designed a SARS-CoV-2 peptide vaccine containing epitope regions optimized for concurrent B cell, CD4+ T cell, and CD8+ T cell stimulation. The rationale for this design was to drive both humoral and cellular immunity with high specificity while avoiding undesired effects such as antibody-dependent enhancement (ADE). METHODS: We explored the set of computationally predicted SARS-CoV-2 HLA-I and HLA-II ligands, examining protein source, concurrent human/murine coverage, and population coverage. Beyond MHC affinity, T cell vaccine candidates were further refined by predicted immunogenicity, sequence conservation, source protein abundance, and coverage of high frequency HLA alleles. B cell epitope regions were chosen from linear epitope mapping studies of convalescent patient serum, followed by filtering for surface accessibility, sequence conservation, spatial localization near functional domains of the spike glycoprotein, and avoidance of glycosylation sites. RESULTS: From 58 initial candidates, three B cell epitope regions were identified. From 3730 (MHC-I) and 5045 (MHC-II) candidate ligands, 292 CD8+ and 284 CD4+ T cell epitopes were identified. By combining these B cell and T cell analyses, as well as a manufacturability heuristic, we proposed a set of 22 SARS-CoV-2 vaccine peptides for use in subsequent murine studies. We curated a dataset of ~ 1000 observed T cell epitopes from convalescent COVID-19 patients across eight studies, showing 8/15 recurrent epitope regions to overlap with at least one of our candidate peptides. Of the 22 candidate vaccine peptides, 16 (n = 10 T cell epitope optimized; n = 6 B cell epitope optimized) were manually selected to decrease their degree of sequence overlap and then synthesized. The immunogenicity of the synthesized vaccine peptides was validated using ELISpot and ELISA following murine vaccination. Strong T cell responses were observed in 7/10 T cell epitope optimized peptides following vaccination. Humoral responses were deficient, likely due to the unrestricted conformational space inhabited by linear vaccine peptides. CONCLUSIONS: Overall, we find our selection process and vaccine formulation to be appropriate for identifying T cell epitopes and eliciting T cell responses against those epitopes. Further studies are needed to optimize prediction and induction of B cell responses, as well as study the protective capacity of predicted T and B cell epitopes.
Assuntos
Vacinas contra COVID-19/administração & dosagem , COVID-19/prevenção & controle , Biologia Computacional/métodos , Epitopos de Linfócito B/química , Epitopos de Linfócito T/química , Sequência de Aminoácidos , Animais , COVID-19/virologia , Vacinas contra COVID-19/química , Epitopos de Linfócito B/imunologia , Epitopos de Linfócito T/imunologia , Feminino , Antígenos de Histocompatibilidade Classe I/química , Antígenos de Histocompatibilidade Classe I/metabolismo , Antígenos de Histocompatibilidade Classe II/química , Antígenos de Histocompatibilidade Classe II/metabolismo , Humanos , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Peptídeos/química , Peptídeos/imunologia , SARS-CoV-2/isolamento & purificação , SARS-CoV-2/metabolismo , Glicoproteína da Espícula de Coronavírus/química , Glicoproteína da Espícula de Coronavírus/imunologiaRESUMO
MHCflurry is an open source package for peptide/MHC I binding affinity prediction. Its command-line and programmatic interfaces make it well-suited for integration into high-throughput bioinformatic pipelines. Users can download models fit to publicly available data or train predictors on their own affinity measurements or mass spec datasets. This chapter gives a tutorial on essential MHCflurry functionality, including generating predictions, training new models, and using the MHCflurry Python interface. MHCflurry is available at https://github.com/openvax/mhcflurry .
Assuntos
Antígenos de Histocompatibilidade Classe I/metabolismo , Peptídeos/metabolismo , Software , Biologia Computacional/métodos , Humanos , Ligantes , Peptídeos/química , Ligação ProteicaRESUMO
OpenVax is a computational workflow for identifying somatic variants, predicting neoantigens, and selecting the contents of personalized cancer vaccines. It is a Dockerized end-to-end pipeline that takes as input raw tumor/normal sequencing data. It is currently used in three clinical trials (NCT02721043, NCT03223103, and NCT03359239). In this chapter, we describe how to install and use OpenVax, as well as how to interpret the generated results.
Assuntos
Antígenos de Neoplasias/genética , Genômica/métodos , Neoplasias/genética , Software , Vacinas Anticâncer/genética , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Fluxo de TrabalhoRESUMO
Computational prediction of the peptides presented on major histocompatibility complex (MHC) class I proteins is an important tool for studying T cell immunity. The data available to develop such predictors have expanded with the use of mass spectrometry to identify naturally presented MHC ligands. In addition to elucidating binding motifs, the identified ligands also reflect the antigen processing steps that occur prior to MHC binding. Here, we developed an integrated predictor of MHC class I presentation that combines new models for MHC class I binding and antigen processing. Considering only peptides first predicted by the binding model to bind strongly to MHC, the antigen processing model is trained to discriminate published mass spectrometry-identified MHC class I ligands from unobserved peptides. The integrated model outperformed the two individual components as well as NetMHCpan 4.0 and MixMHCpred 2.0.2 on held-out mass spectrometry experiments. Our predictors are implemented in the open source MHCflurry package, version 2.0 (github.com/openvax/mhcflurry).
Assuntos
Alelos , Apresentação de Antígeno/genética , Antígenos de Histocompatibilidade Classe I/genética , Peptídeos/química , HumanosRESUMO
There is an urgent need for a vaccine with efficacy against SARS-CoV-2. We hypothesize that peptide vaccines containing epitope regions optimized for concurrent B cell, CD4+ T cell, and CD8+ T cell stimulation would drive both humoral and cellular immunity with high specificity, potentially avoiding undesired effects such as antibody-dependent enhancement (ADE). Additionally, such vaccines can be rapidly manufactured in a distributed manner. In this study, we combine computational prediction of T cell epitopes, recently published B cell epitope mapping studies, and epitope accessibility to select candidate peptide vaccines for SARS-CoV-2. We begin with an exploration of the space of possible T cell epitopes in SARS-CoV-2 with interrogation of predicted HLA-I and HLA-II ligands, overlap between predicted ligands, protein source, as well as concurrent human/murine coverage. Beyond MHC affinity, T cell vaccine candidates were further refined by predicted immunogenicity, viral source protein abundance, sequence conservation, coverage of high frequency HLA alleles and co-localization of CD4+ and CD8+ T cell epitopes. B cell epitope regions were chosen from linear epitope mapping studies of convalescent patient serum, followed by filtering to select regions with surface accessibility, high sequence conservation, spatial localization near functional domains of the spike glycoprotein, and avoidance of glycosylation sites. From 58 initial candidates, three B cell epitope regions were identified. By combining these B cell and T cell analyses, as well as a manufacturability heuristic, we propose a set of SARS-CoV-2 vaccine peptides for use in subsequent murine studies and clinical trials.
RESUMO
Tumor cells accumulate aberrations not present in normal cells, leading to presentation of neoantigens on MHC molecules on their surface. These non-self neoantigens distinguish tumor cells from normal cells to the immune system and are thus targets for cancer immunotherapy. The rapid development of molecular profiling platforms, such as next-generation sequencing, has enabled the generation of large datasets characterizing tumor cells. The simultaneous development of algorithms has enabled rapid and accurate processing of these data. Bioinformatic software tools encoding the algorithms can be strung together in a workflow to identify neoantigens. Here, with a focus on high-throughput sequencing, we review state-of-the art bioinformatic tools along with the steps and challenges involved in neoantigen identification and recognition.
Assuntos
Antígenos de Neoplasias/imunologia , Biologia Computacional/métodos , Neoplasias/imunologia , Apresentação de Antígeno/imunologia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Predicting the binding affinity of major histocompatibility complex I (MHC I) proteins and their peptide ligands is important for vaccine design. We introduce an open-source package for MHC I binding prediction, MHCflurry. The software implements allele-specific neural networks that use a novel architecture and peptide encoding scheme. When trained on affinity measurements, MHCflurry outperformed the standard predictors NetMHC 4.0 and NetMHCpan 3.0 overall and particularly on non-9-mer peptides in a benchmark of ligands identified by mass spectrometry. The released predictor, MHCflurry 1.2.0, uses mass spectrometry datasets for model selection and showed competitive accuracy with standard tools, including the recently released NetMHCpan 4.0, on a small benchmark of affinity measurements. MHCflurry's prediction speed exceeded 7,000 predictions per second, 396 times faster than NetMHCpan 4.0. MHCflurry is freely available to use, retrain, or extend, includes Python library and command line interfaces, may be installed using package managers, and applies software development best practices.
Assuntos
Previsões/métodos , Antígenos de Histocompatibilidade Classe I/genética , Ligação Proteica/imunologia , Algoritmos , Animais , Genes MHC Classe I/genética , Genes MHC Classe I/fisiologia , Antígenos de Histocompatibilidade Classe I/fisiologia , Humanos , Ligantes , Redes Neurais de Computação , Peptídeos/química , Ligação Proteica/fisiologia , SoftwareRESUMO
This paper describes the sequencing protocol and computational pipeline for the PGV-001 personalized vaccine trial. PGV-001 is a therapeutic peptide vaccine targeting neoantigens identified from patient tumor samples. Peptides are selected by a computational pipeline that identifies mutations from tumor/normal exome sequencing and ranks mutant sequences by a combination of predicted Class I MHC affinity and abundance estimated from tumor RNA. The personalized genomic vaccine (PGV) pipeline is modular and consists of independently usable tools and software libraries. We hope that the functionality of these tools may extend beyond the specifics of the PGV-001 trial and enable other research groups in their own neoantigen investigations.
RESUMO
Predictions of patient outcomes after a given therapy are fundamental to medical practice. We employ a machine learning approach towards predicting the outcomes after stereotactic radiosurgery for cerebral arteriovenous malformations (AVMs). Using three prospective databases, a machine learning approach of feature engineering and model optimization was implemented to create the most accurate predictor of AVM outcomes. Existing prognostic systems were scored for purposes of comparison. The final predictor was secondarily validated on an independent site's dataset not utilized for initial construction. Out of 1,810 patients, 1,674 to 1,291 patients depending upon time threshold, with 23 features were included for analysis and divided into training and validation sets. The best predictor had an average area under the curve (AUC) of 0.71 compared to existing clinical systems of 0.63 across all time points. On the heldout dataset, the predictor had an accuracy of around 0.74 at across all time thresholds with a specificity and sensitivity of 62% and 85% respectively. This machine learning approach was able to provide the best possible predictions of AVM radiosurgery outcomes of any method to date, identify a novel radiobiological feature (3D surface dose), and demonstrate a paradigm for further development of prognostic tools in medical care.