RESUMO
T cell-mediated immunity plays an important role in controlling SARS-CoV-2 infection, but the repertoire of naturally processed and presented viral epitopes on class I human leukocyte antigen (HLA-I) remains uncharacterized. Here, we report the first HLA-I immunopeptidome of SARS-CoV-2 in two cell lines at different times post infection using mass spectrometry. We found HLA-I peptides derived not only from canonical open reading frames (ORFs) but also from internal out-of-frame ORFs in spike and nucleocapsid not captured by current vaccines. Some peptides from out-of-frame ORFs elicited T cell responses in a humanized mouse model and individuals with COVID-19 that exceeded responses to canonical peptides, including some of the strongest epitopes reported to date. Whole-proteome analysis of infected cells revealed that early expressed viral proteins contribute more to HLA-I presentation and immunogenicity. These biological insights, as well as the discovery of out-of-frame ORF epitopes, will facilitate selection of peptides for immune monitoring and vaccine development.
Assuntos
Epitopos de Linfócito T/imunologia , Antígenos de Histocompatibilidade Classe I/imunologia , Fases de Leitura Aberta/genética , Peptídeos/imunologia , Proteoma/imunologia , SARS-CoV-2/imunologia , Células A549 , Alelos , Sequência de Aminoácidos , Animais , Apresentação de Antígeno/imunologia , COVID-19/imunologia , COVID-19/virologia , Feminino , Células HEK293 , Humanos , Cinética , Masculino , Camundongos , Peptídeos/química , Linfócitos T/imunologiaRESUMO
CD4+ T cell responses are exquisitely antigen specific and directed toward peptide epitopes displayed by human leukocyte antigen class II (HLA-II) on antigen-presenting cells. Underrepresentation of diverse alleles in ligand databases and an incomplete understanding of factors affecting antigen presentation in vivo have limited progress in defining principles of peptide immunogenicity. Here, we employed monoallelic immunopeptidomics to identify 358,024 HLA-II binders, with a particular focus on HLA-DQ and HLA-DP. We uncovered peptide-binding patterns across a spectrum of binding affinities and enrichment of structural antigen features. These aspects underpinned the development of context-aware predictor of T cell antigens (CAPTAn), a deep learning model that predicts peptide antigens based on their affinity to HLA-II and full sequence of their source proteins. CAPTAn was instrumental in discovering prevalent T cell epitopes from bacteria in the human microbiome and a pan-variant epitope from SARS-CoV-2. Together CAPTAn and associated datasets present a resource for antigen discovery and the unraveling genetic associations of HLA alleles with immunopathologies.
Assuntos
COVID-19 , Aprendizado Profundo , Humanos , Captana , SARS-CoV-2 , Antígenos HLA , Epitopos de Linfócito T , PeptídeosRESUMO
Increasing evidence indicates CD4+ T cells can recognize cancer-specific antigens and control tumor growth. However, it remains difficult to predict the antigens that will be presented by human leukocyte antigen class II molecules (HLA-II), hindering efforts to optimally target them therapeutically. Obstacles include inaccurate peptide-binding prediction and unsolved complexities of the HLA-II pathway. To address these challenges, we developed an improved technology for discovering HLA-II binding motifs and conducted a comprehensive analysis of tumor ligandomes to learn processing rules relevant in the tumor microenvironment. We profiled >40 HLA-II alleles and showed that binding motifs were highly sensitive to HLA-DM, a peptide-loading chaperone. We also revealed that intratumoral HLA-II presentation was dominated by professional antigen-presenting cells (APCs) rather than cancer cells. Integrating these observations, we developed algorithms that accurately predicted APC ligandomes, including peptides from phagocytosed cancer cells. These tools and biological insights will enable improved HLA-II-directed cancer therapies.
Assuntos
Células Apresentadoras de Antígenos/imunologia , Linfócitos T CD4-Positivos/imunologia , Vacinas Anticâncer/imunologia , Mapeamento de Epitopos/métodos , Antígenos HLA/metabolismo , Antígenos de Histocompatibilidade Classe II/genética , Imunoterapia/métodos , Espectrometria de Massas/métodos , Neoplasias/terapia , Algoritmos , Alelos , Apresentação de Antígeno , Antígenos de Neoplasias/imunologia , Antígenos de Neoplasias/metabolismo , Conjuntos de Dados como Assunto , Antígenos HLA/genética , Antígenos HLA-D/metabolismo , Humanos , Neoplasias/imunologia , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas/genética , SoftwareRESUMO
Identification of human leukocyte antigen (HLA)-bound peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS) is poised to provide a deep understanding of rules underlying antigen presentation. However, a key obstacle is the ambiguity that arises from the co-expression of multiple HLA alleles. Here, we have implemented a scalable mono-allelic strategy for profiling the HLA peptidome. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing an application-specific spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of subdominant binding motifs and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural-network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a strategy for systematically learning the rules of endogenous antigen presentation.
Assuntos
Algoritmos , Apresentação de Antígeno/imunologia , Perfilação da Expressão Gênica/métodos , Antígenos de Histocompatibilidade Classe I/imunologia , Espectrometria de Massas em Tandem/métodos , Alelos , Linhagem Celular , Cromatografia Líquida/métodos , Epitopos , Antígenos de Histocompatibilidade Classe I/genética , Humanos , Redes Neurais de Computação , Peptídeos/imunologia , Domínios e Motivos de Interação entre Proteínas/imunologiaRESUMO
Over the past 30 years, immunopeptidomics has grown alongside improvements in mass spectrometry technology, genomics, transcriptomics, T cell receptor sequencing, and immunological assays to identify and characterize the targets of activated T cells. Together, multiple research groups with expertise in immunology, biochemistry, chemistry, and peptide mass spectrometry have come together to enable the isolation and sequence identification of endogenous major histocompatibility complex (MHC)-bound peptides. The idea to apply highly sensitive mass spectrometry techniques to study the landscape of peptide antigens presented by cell surface MHCs was innovative and continues to be successfully used and improved upon to deepen our understanding of how peptide antigens are processed and presented to T cells. Multiple research groups were involved in this bringing immunopeptidomics to the forefront of translational research, and we will highlight the contributions of one of the earliest developers, Professor Donald F. Hunt, and his research group at the University of Virginia. The Hunt laboratory applied cutting edge mass spectroscopy-based immunopeptidomics to study cancer, autoimmunity, transplant rejection, and infectious diseases. Across these diverse research areas, the Hunt laboratory and collaborators would characterize previously unknown MHC peptide-binding motifs and identify immunologically active antigens using ultra sensitive mass spectrometry techniques. Amazingly, many of the MHC-bound peptide antigens discovered in collaborations with the Hunt laboratory were sequenced by mass spectrometry before the completion of the human genome using manual de novo sequencing. In this perspective article, we will chronicle the work of the Hunt laboratory and their many collaborators that would be a major part of the foundation for mass spectrometry-based immunopeptidomics and its application to immunology research.
Assuntos
Espectrometria de Massas , Peptídeos , Proteômica , Animais , Humanos , Apresentação de Antígeno/imunologia , Complexo Principal de Histocompatibilidade/imunologia , Espectrometria de Massas/história , Espectrometria de Massas/métodos , Neoplasias/imunologia , Peptídeos/imunologia , Peptídeos/metabolismo , Proteômica/história , Proteômica/métodos , História do Século XX , História do Século XXIRESUMO
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from â¼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
Assuntos
Biossíntese de Proteínas , Proteoma , Humanos , Proteoma/metabolismo , Proteômica/métodos , Perfil de Ribossomos , Ribossomos/metabolismo , Fases de Leitura AbertaRESUMO
Comprehensive and in-depth identification of the human leukocyte antigen class I (HLA-I) and class II (HLA-II) tumor immunopeptidome can inform the development of cancer immunotherapies. Mass spectrometry (MS) is a powerful technology for direct identification of HLA peptides from patient-derived tumor samples or cell lines. However, achieving sufficient coverage to detect rare and clinically relevant antigens requires highly sensitive MS-based acquisition methods and large amounts of sample. While immunopeptidome depth can be increased by off-line fractionation prior to MS, its use is impractical when analyzing limited amounts of primary tissue biopsies. To address this challenge, we developed and applied a high-throughput, sensitive, and single-shot MS-based immunopeptidomics workflow that leverages trapped ion mobility time-of-flight MS on the Bruker timsTOF single-cell proteomics system (SCP). We demonstrate greater than twofold improved coverage of HLA immunopeptidomes relative to prior methods with up to 15,000 distinct HLA-I and HLA-II peptides from 4e7 cells. Our optimized single-shot MS acquisition method on the timsTOF SCP maintains high coverage, eliminates the need for off-line fractionation, and reduces input requirements to as few as 1e6 A375 cells for >800 distinct HLA-I peptides. This depth is sufficient to identify HLA-I peptides derived from cancer-testis antigen and noncanonical proteins. We also apply our optimized single-shot SCP acquisition methods to tumor-derived samples, enabling sensitive, high-throughput, and reproducible immunopeptidome profiling with detection of clinically relevant peptides from less than 4e7 cells or 15 mg wet weight tissue.
Assuntos
Antígenos de Histocompatibilidade Classe I , Neoplasias , Masculino , Humanos , Antígenos de Histocompatibilidade Classe I/metabolismo , Espectrometria de Massas/métodos , Neoplasias/metabolismo , Peptídeos/metabolismo , Linhagem CelularRESUMO
Immunotherapies have emerged to treat diseases by selectively modulating a patient's immune response. Although the roles of T and B cells in adaptive immunity have been well studied, it remains difficult to select targets for immunotherapeutic strategies. Because human leukocyte antigen class II (HLA-II) peptides activate CD4+ T cells and regulate B cell activation, proliferation, and differentiation, these peptide antigens represent a class of potential immunotherapy targets and biomarkers. To better understand the molecular basis of how HLA-II antigen presentation is involved in disease progression and treatment, systematic HLA-II peptidomics combined with multiomic analyses of diverse cell types in healthy and diseased states is required. For this reason, MS-based innovations that facilitate investigations into the interplay between disease pathologies and the presentation of HLA-II peptides to CD4+ T cells will aid in the development of patient-focused immunotherapies.
Assuntos
Antígenos de Histocompatibilidade Classe II/imunologia , Imunoterapia , Peptídeos/imunologia , Animais , Apresentação de Antígeno , Genômica , Antígenos de Histocompatibilidade Classe II/genética , Humanos , Espectrometria de Massas , Peptídeos/genéticaRESUMO
MS is the most effective method to directly identify peptides presented on human leukocyte antigen (HLA) molecules. However, current standard approaches often use 500 million or more cells as input to achieve high coverage of the immunopeptidome, and therefore, these methods are not compatible with the often limited amounts of tissue available from clinical tumor samples. Here, we evaluated microscaled basic reversed-phase fractionation to separate HLA peptide samples offline followed by ion mobility coupled to LC-MS/MS for analysis. The combination of these two separation methods enabled identification of 20% to 50% more peptides compared with samples analyzed without either prior fractionation or use of ion mobility alone. We demonstrate coverage of HLA immunopeptidomes with up to 8107 distinct peptides starting with as few as 100 million cells. The increased sensitivity obtained using our methods can provide data useful to improve HLA-binding prediction algorithms as well as to enable detection of clinically relevant epitopes such as neoantigens.
Assuntos
Antígenos de Neoplasias/análise , Antígenos de Histocompatibilidade Classe I/análise , Peptídeos/análise , Linhagem Celular , Fracionamento Químico , Cromatografia Líquida , Humanos , Espectrometria de Mobilidade Iônica , Neoplasias/química , Espectrometria de Massas em TandemRESUMO
Mass spectrometry with data-independent acquisition (DIA) is a promising method to improve the comprehensiveness and reproducibility of targeted and discovery proteomics, in theory by systematically measuring all peptide precursors in a biological sample. However, the analytical challenges involved in discriminating between peptides with similar sequences in convoluted spectra have limited its applicability in important cases, such as the detection of single-nucleotide polymorphisms (SNPs) and alternative site localizations in phosphoproteomics data. We report Specter (https://github.com/rpeckner-broad/Specter), an open-source software tool that uses linear algebra to deconvolute DIA mixture spectra directly through comparison to a spectral library, thus circumventing the problems associated with typical fragment-correlation-based approaches. We validate the sensitivity of Specter and its performance relative to that of other methods, and show that Specter is able to successfully analyze cases involving highly similar peptides that are typically challenging for DIA analysis methods.
Assuntos
Espectrometria de Massas/métodos , Proteômica , Biblioteca de Peptídeos , Peptídeos/análise , Polimorfismo de Nucleotídeo Único , Proteoma , Reprodutibilidade dos Testes , SoftwareRESUMO
A challenge in developing personalized cancer immunotherapies is the prediction of putative cancer-specific antigens. Currently, predictive algorithms are used to infer binding of peptides to human leukocyte antigen (HLA) heterodimers to aid in the selection of putative epitope targets. One drawback of current epitope prediction algorithms is that they are trained on datasets containing biochemical HLA-peptide binding data that may not completely capture the rules associated with endogenous processing and presentation. The field of MS has made great improvements in instrumentation speed and sensitivity, chromatographic resolution, and proteogenomic database search strategies to facilitate the identification of HLA-ligands from a variety of cell types and tumor tissues. As such, these advances have enabled MS profiling of HLA-binding peptides to be a tractable, orthogonal approach to lower throughput biochemical assays for generating comprehensive datasets to train epitope prediction algorithms. In this review, we will highlight the progress made in the field of HLA-ligand profiling enabled by MS and its impact on current and future epitope prediction strategies.
Assuntos
Biologia Computacional/métodos , Epitopos/imunologia , Antígenos HLA/imunologia , Espectrometria de Massas/métodos , Proteogenômica/métodos , Epitopos/metabolismo , Antígenos HLA/metabolismo , HumanosRESUMO
Profiling post-translational modifications represents an alternative dimension to gene expression data in characterizing cellular processes. Many cellular responses to drugs are mediated by changes in cellular phosphosignaling. We sought to develop a common platform on which phosphosignaling responses could be profiled across thousands of samples, and created a targeted MS assay that profiles a reduced-representation set of phosphopeptides that we show to be strong indicators of responses to chemical perturbagens.To develop the assay, we investigated the coordinate regulation of phosphosites in samples derived from three cell lines treated with 26 different bioactive small molecules. Phosphopeptide analytes were selected from these discovery studies by clustering and picking 1 to 2 proxy members from each cluster. A quantitative, targeted parallel reaction monitoring assay was developed to directly measure 96 reduced-representation probes. Sample processing for proteolytic digestion, protein quantification, peptide desalting, and phosphopeptide enrichment have been fully automated, making possible the simultaneous processing of 96 samples in only 3 days, with a plate phosphopeptide enrichment variance of 12%. This highly reproducible process allowed â¼95% of the reduced-representation phosphopeptide probes to be detected in â¼200 samples.The performance of the assay was evaluated by measuring the probes in new samples generated under treatment conditions from discovery experiments, recapitulating the observations of deeper experiments using a fraction of the analytical effort. We measured these probes in new experiments varying the treatments, cell types, and timepoints to demonstrate generalizability. We demonstrated that the assay is sensitive to disruptions in common signaling pathways (e.g. MAPK, PI3K/mTOR, and CDK). The high-throughput, reduced-representation phosphoproteomics assay provides a platform for the comparison of perturbations across a range of biological conditions, suitable for profiling thousands of samples. We believe the assay will prove highly useful for classification of known and novel drug and genetic mechanisms through comparison of phosphoproteomic signatures.
Assuntos
Células-Tronco Embrionárias/metabolismo , Fosfoproteínas/análise , Proteômica/métodos , Bibliotecas de Moléculas Pequenas/farmacologia , Animais , Células Cultivadas , Células-Tronco Embrionárias/citologia , Ensaios de Triagem em Larga Escala , Humanos , Células MCF-7 , Camundongos , Fosfoproteínas/efeitos dos fármacos , Transdução de SinaisRESUMO
Targeted synthetic vaccines have the potential to transform our response to viral outbreaks, yet the design of these vaccines requires a comprehensive knowledge of viral immunogens. Here, we report severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) peptides that are naturally processed and loaded onto human leukocyte antigen-II (HLA-II) complexes in infected cells. We identify over 500 unique viral peptides from canonical proteins as well as from overlapping internal open reading frames. Most HLA-II peptides colocalize with known CD4+ T cell epitopes in coronavirus disease 2019 patients, including 2 reported immunodominant regions in the SARS-CoV-2 membrane protein. Overall, our analyses show that HLA-I and HLA-II pathways target distinct viral proteins, with the structural proteins accounting for most of the HLA-II peptidome and nonstructural and noncanonical proteins accounting for the majority of the HLA-I peptidome. These findings highlight the need for a vaccine design that incorporates multiple viral elements harboring CD4+ and CD8+ T cell epitopes to maximize vaccine effectiveness.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Epitopos de Linfócito T , Antígenos de Histocompatibilidade Classe I , Antígenos HLA , Antígenos de Histocompatibilidade , Linfócitos T CD8-Positivos , PeptídeosRESUMO
A major scientific drive is to characterize the protein-coding genome as it provides the primary basis for the study of human health. But the fundamental question remains: what has been missed in prior genomic analyses? Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states, with major implications for proteomics, genomics, and clinical science. However, the impact of ncORFs has been limited by the absence of a large-scale understanding of their contribution to the human proteome. Here, we report the collaborative efforts of stakeholders in proteomics, immunopeptidomics, Ribo-seq ORF discovery, and gene annotation, to produce a consensus landscape of protein-level evidence for ncORFs. We show that at least 25% of a set of 7,264 ncORFs give rise to translated gene products, yielding over 3,000 peptides in a pan-proteome analysis encompassing 3.8 billion mass spectra from 95,520 experiments. With these data, we developed an annotation framework for ncORFs and created public tools for researchers through GENCODE and PeptideAtlas. This work will provide a platform to advance ncORF-derived proteins in biomedical discovery and, beyond humans, diverse animals and plants where ncORFs are similarly observed.
RESUMO
Unveiling the complete proteome of viruses is crucial to our understanding of the viral life cycle and interaction with the host. We developed Massively Parallel Ribosome Profiling (MPRP) to experimentally determine open reading frames (ORFs) in 20,170 designed oligonucleotides across 679 human-associated viral genomes. We identified 5,381 ORFs, including 4,208 non-canonical ORFs, and show successful detection of both annotated coding sequences (CDSs) and reported non-canonical ORFs. By examining immunopeptidome datasets of infected cells, we found class I human leukocyte antigen (HLA-I) peptides originating from non-canonical ORFs identified through MPRP. By inspecting ribosome occupancies on the 5'UTR and CDS regions of annotated viral genes, we identified hundreds of upstream ORFs (uORFs) that negatively regulate the synthesis of canonical viral proteins. The unprecedented source of viral ORFs across a wide range of viral families, including highly pathogenic viruses, expands the repertoire of vaccine targets and exposes new cis-regulatory sequences in viral genomes.
RESUMO
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of non-canonical sites of ribosome translation outside of the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7,000 non-canonical open reading frames (ORFs) are translated, which, at first glance, has the potential to expand the number of human protein-coding sequences by 30%, from â¼19,500 annotated CDSs to over 26,000. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of non-canonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome, but searching for guidance on how to proceed. Here, we discuss the current state of non-canonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein-coding". In brief: The human genome encodes thousands of non-canonical open reading frames (ORFs) in addition to protein-coding genes. As a nascent field, many questions remain regarding non-canonical ORFs. How many exist? Do they encode proteins? What level of evidence is needed for their verification? Central to these debates has been the advent of ribosome profiling (Ribo-seq) as a method to discern genome-wide ribosome occupancy, and immunopeptidomics as a method to detect peptides that are processed and presented by MHC molecules and not observed in traditional proteomics experiments. This article provides a synthesis of the current state of non-canonical ORF research and proposes standards for their future investigation and reporting. Highlights: Combined use of Ribo-seq and proteomics-based methods enables optimal confidence in detecting non-canonical ORFs and their protein products.Ribo-seq can provide more sensitive detection of non-canonical ORFs, but data quality and analytical pipelines will impact results.Non-canonical ORF catalogs are diverse and span both high-stringency and low-stringency ORF nominations.A framework for standardized non-canonical ORF evidence will advance the research field.
RESUMO
Targeted synthetic vaccines have the potential to transform our response to viral outbreaks; yet the design of these vaccines requires a comprehensive knowledge of viral immunogens, including T-cell epitopes. Having previously mapped the SARS-CoV-2 HLA-I landscape, here we report viral peptides that are naturally processed and loaded onto HLA-II complexes in infected cells. We identified over 500 unique viral peptides from canonical proteins, as well as from overlapping internal open reading frames (ORFs), revealing, for the first time, the contribution of internal ORFs to the HLA-II peptide repertoire. Most HLA-II peptides co-localized with the known CD4+ T cell epitopes in COVID-19 patients. We also observed that two reported immunodominant regions in the SARS-CoV-2 membrane protein are formed at the level of HLA-II presentation. Overall, our analyses show that HLA-I and HLA-II pathways target distinct viral proteins, with the structural proteins accounting for most of the HLA-II peptidome and non-structural and non-canonical proteins accounting for the majority of the HLA-I peptidome. These findings highlight the need for a vaccine design that incorporates multiple viral elements harboring CD4+ and CD8+ T cell epitopes to maximize the vaccine effectiveness.
RESUMO
Serial multi-omic analysis of proteome, phosphoproteome, and acetylome provides insights into changes in protein expression, cell signaling, cross-talk and epigenetic pathways involved in disease pathology and treatment. However, ubiquitylome and HLA peptidome data collection used to understand protein degradation and antigen presentation have not together been serialized, and instead require separate samples for parallel processing using distinct protocols. Here we present MONTE, a highly sensitive multi-omic native tissue enrichment workflow, that enables serial, deep-scale analysis of HLA-I and HLA-II immunopeptidome, ubiquitylome, proteome, phosphoproteome, and acetylome from the same tissue sample. We demonstrate that the depth of coverage and quantitative precision of each 'ome is not compromised by serialization, and the addition of HLA immunopeptidomics enables the identification of peptides derived from cancer/testis antigens and patient specific neoantigens. We evaluate the technical feasibility of the MONTE workflow using a small cohort of patient lung adenocarcinoma tumors.
Assuntos
Neoplasias Pulmonares , Proteoma , Masculino , Humanos , Proteoma/metabolismo , Fluxo de Trabalho , Peptídeos , Proteômica/métodosRESUMO
Immunopeptidome profiling of infected cells is a powerful technique for detecting viral peptides that are naturally processed and loaded onto class I human leukocyte antigens (HLAs-I). Here, we provide a protocol for preparing samples for immunopeptidome profiling that can inactivate enveloped viruses while still preserving the integrity of the HLA-I complex. We detail steps for lysate preparation of infected cells followed by HLA-I immunoprecipitation and virus inactivation. We further describe peptide purification for mass spectrometry outside a high-containment facility. For complete details on the use and execution of this protocol, please refer to Weingarten-Gabbay et al. (2021).1.