The Choice of Search Engine Affects Sequencing Depth and HLA Class I Allele-Specific Peptide Repertoires.

Parker, Robert; Tailor, Arun; Peng, Xu; Nicastri, Annalisa; Zerweck, Johannes; Reimer, Ulf; Wenschuh, Holger; Schnatbaum, Karsten; Ternette, Nicola

Parker, Robert; Tailor, Arun; Peng, Xu; Nicastri, Annalisa; Zerweck, Johannes; Reimer, Ulf; Wenschuh, Holger; Schnatbaum, Karsten; Ternette, Nicola.

Afiliación

Parker R; Nuffield Department of Medicine, Centre for Cellar and Medical Physiology, University of Oxford, Oxford, UK. Electronic address: robert.parker@ndm.ox.ac.uk.
Tailor A; Nuffield Department of Medicine, Centre for Cellar and Medical Physiology, University of Oxford, Oxford, UK.
Peng X; Nuffield Department of Medicine, Centre for Cellar and Medical Physiology, University of Oxford, Oxford, UK.
Nicastri A; Nuffield Department of Medicine, Centre for Cellar and Medical Physiology, University of Oxford, Oxford, UK.
Zerweck J; JPT Peptide Technologies GmbH, Berlin, Germany.
Reimer U; JPT Peptide Technologies GmbH, Berlin, Germany.
Wenschuh H; JPT Peptide Technologies GmbH, Berlin, Germany.
Schnatbaum K; JPT Peptide Technologies GmbH, Berlin, Germany.
Ternette N; Nuffield Department of Medicine, Centre for Cellar and Medical Physiology, University of Oxford, Oxford, UK. Electronic address: nicola.ternette@ndm.ox.ac.uk.

Mol Cell Proteomics ; 20: 100124, 2021.

Article en En | MEDLINE | ID: mdl-34303857

ABSTRACT

ABSTRACT

Standardization of immunopeptidomics experiments across laboratories is a pressing issue within the field, and currently a variety of different methods for sample preparation and data analysis tools are applied. Here, we compared different software packages to interrogate immunopeptidomics datasets and found that Peaks reproducibly reports substantially more peptide sequences (~30-70%) compared with Maxquant, Comet, and MS-GF+ at a global false discovery rate (FDR) of <1%. We noted that these differences are driven by search space and spectral ranking. Furthermore, we observed differences in the proportion of peptides binding the human leukocyte antigen (HLA) alleles present in the samples, indicating that sequence-related differences affected the performance of each tested engine. Utilizing data from single HLA allele expressing cell lines, we observed significant differences in amino acid frequency among the peptides reported, with a broadly higher representation of hydrophobic amino acids L, I, P, and V reported by Peaks. We validated these results using data generated with a synthetic library of 2000 HLA-associated peptides from four common HLA alleles with distinct anchor residues. Our investigation highlights that search engines create a bias in peptide sequence depth and peptide amino acid composition, and resulting data should be interpreted with caution.

Asunto(s)

Antígenos de Histocompatibilidad Clase I/química; Péptidos/química; Motor de Búsqueda; Alelos; Secuencia de Aminoácidos; Antígenos de Histocompatibilidad Clase I/genética; Humanos; Espectrometría de Masas; Biblioteca de Péptidos; Péptidos/genética; Proteómica/métodos

Palabras clave

HLA; MHC; MS search engine; database search; de novo sequencing; human leukocyte antigen; immunopeptidomics; major histocompatibility complex; peptide sequence annotation; peptide spectrum match

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Péptidos / Antígenos de Histocompatibilidad Clase I / Motor de Búsqueda Límite: Humans Idioma: En Revista: Mol Cell Proteomics Asunto de la revista: BIOLOGIA MOLECULAR / BIOQUIMICA Año: 2021 Tipo del documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google