Pesquisa | BVS Integralidade em Saúde

1.

Comparison of N-Glycopeptide to Released N-Glycan Abundances and the Influence of Glycopeptide Mass and Charge States on N-Linked Glycosylation of IgG Antibodies.

Remoroza, Concepcion A; Burke, Meghan C; Mak, Tytus D; Sheetlin, Sergey L; Mirokhin, Yuri A; Cooper, Brian T; Goecker, Zachary C; Lowenthal, Mark S; Yang, Xiaoyu; Wang, Guanghui; Tchekhovskoi, Dmitrii V; Stein, Stephen E.

J Proteome Res ; 23(4): 1443-1457, 2024 Apr 05.

Artigo em Inglês | MEDLINE | ID: mdl-38450643

RESUMO

We report the comparison of mass-spectral-based abundances of tryptic glycopeptides to fluorescence abundances of released labeled glycans and the effects of mass and charge state and in-source fragmentation on glycopeptide abundances. The primary glycoforms derived from Rituximab, NISTmAb, Evolocumab, and Infliximab were high-mannose and biantennary complex galactosylated and fucosylated N-glycans. Except for Evolocumab, in-source ions derived from the loss of HexNAc or HexNAc-Hex sugars are prominent for other therapeutic IgGs. After excluding in-source fragmentation of glycopeptide ions from the results, a linear correlation was observed between fluorescently labeled N-glycan and glycopeptide abundances over a dynamic range of 500. Different charge states of human IgG-derived glycopeptides containing a wider variety of abundant attached glycans were also investigated to examine the effects of the charge state on ion abundances. These revealed a linear dependence of glycopeptide abundance on the mass of the glycan with higher charge states favoring higher-mass glycans. Findings indicate that the mass spectrometry-based bottom-up approach can provide results as accurate as those of glycan release studies while revealing the origin of each attached glycan. These site-specific relative abundances are conveniently displayed and compared using previously described glycopeptide abundance distribution spectra "GADS" representations. Mass spectrometry data are available from the MAssIVE repository (MSV000093562).

Assuntos

Imunoglobulina G , Espectrometria de Massas em Tandem , Humanos , Glicosilação , Glicopeptídeos/análise , Polissacarídeos/química , Íons

2.

Determining Site-Specific Glycan Profiles of Recombinant SARS-CoV-2 Spike Proteins from Multiple Sources.

Burke, Meghan C; Liu, Yi; Remoroza, Concepcion; Mirokhin, Yuri A; Sheetlin, Sergey L; Tchekhovskoi, Dmitrii V; Wang, Guanghui; Yang, Xiaoyu; Stein, Stephen E.

J Proteome Res ; 22(10): 3225-3241, 2023 Oct 06.

Artigo em Inglês | MEDLINE | ID: mdl-37647588

RESUMO

Glycopeptide Abundance Distribution Spectra (GADS) were recently introduced as a means of representing, storing, and comparing glycan profiles of intact glycopeptides. Here, using that representation, an extensive analysis is made of multiple commercial sources of the recombinant SARS-CoV-2 spike protein, each containing 22 N-linked glycan sites (sequons). Multiple proteases are used along with variable energy fragmentation followed by ion trap confirmation. This enables a detailed examination of the reproducibility of the method across multiple types of variability. These results show that GADS are consistent between replicates and laboratories for sufficiently abundant glycopeptides. Derived GADS enable the examination and comparison of the glycan profiles between commercial sources of the spike protein. Multiple distinct glycopeptide distributions, generated by multiple proteases, confirm these profiles. Comparisons of GADS derived from 11 sources of recombinant spike protein reveal that sources for which protein expression methods were the same produced near-identical glycan profiles, thereby demonstrating the ability of this method to measure GADS of sufficient reliability to distinguish different glycoform distributions between commercial vendors and potentially to reliably determine and compare differences in glycosylation for any glycoprotein under different conditions of production. All mass spectrometry data files have been deposited in the MassIVE repository under the identifier MSV000091776.

3.

Mass Spectral Library Methods for Analysis of Site-Specific N-Glycosylation: Application to Human Milk Proteins.

Remoroza, Concepcion A; Burke, Meghan C; Yang, Xiaoyu; Sheetlin, Sergey; Mirokhin, Yuri; Markey, Sanford P; Tchekhovskoi, Dmitrii V; Stein, Stephen E.

J Proteome Res ; 21(10): 2421-2434, 2022 Oct 07.

Artigo em Inglês | MEDLINE | ID: mdl-36112477

RESUMO

We present a mass spectral library-based method for analyzing site-specific N-linked protein glycosylation. Its operation and utility are illustrated by applying it to both newly measured and available proteomics data of human milk glycoproteins. It generates two varieties of mass spectral libraries. One contains glycopeptide abundance distribution spectra (GADS). The other contains tandem mass spectra of the underlying glycopeptides. Both originate from identified glycopeptides in proteolytic digests of human milk and purified glycoproteins, which include tenascin, lactoferrin, and several antibodies. Analysis was also applied to digests of a NIST human milk standard reference material (SRM), leading to a GADS library of N-glycopeptides, enabling the direct comparison of glycopeptide distributions for individual proteins. Tandem spectra underlying each glycopeptide GADS peak are combined to create a second type of library that contains spectra of the underlying glycopeptide spectra. These were acquired by higher-energy (stepped) collision dissociation fragmentation followed by ion-trap fragmentation. Spectra are annotated using MS_Piano, recently reported annotation software. This data, with extensions of a widely used spectral library search and display software, provides accessible mass spectral libraries.

Assuntos

Proteínas do Leite , Leite Humano , Glicopeptídeos/análise , Glicoproteínas/metabolismo , Glicosilação , Humanos , Lactoferrina/metabolismo , Proteínas do Leite/metabolismo , Leite Humano/química , Tenascina/metabolismo

4.

Representing and Comparing Site-Specific Glycan Abundance Distributions of Glycoproteins.

Remoroza, Concepcion A; Burke, Meghan C; Liu, Yi; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Yang, Xiaoyu; Stein, Stephen E.

J Proteome Res ; 20(9): 4475-4486, 2021 09 03.

Artigo em Inglês | MEDLINE | ID: mdl-34327998

RESUMO

A method for representing and comparing distributions of N-linked glycans located at specific sites on proteins is presented. The representation takes the form of a simple mass spectrum for a given peptide sequence, with each peak corresponding to a different glycopeptide. The mass (in place of m/z) of each peak is that of the glycan mass, and its abundance corresponds to its relative abundance in the electrospray MS1 spectrum. This provides a facile means of representing all identifiable glycopeptides arising from a single protein "sequon" on a specific sequence, thereby enabling the comparison and searching of these distributions as routinely done for mass spectra. Likewise, these reference glycopeptide abundance distribution spectra (GADS) can be stored in searchable libraries. A set of such libraries created from available data is provided along with an adapted version of the widely used NIST-MS library-search software. Since GADS contain only MS1 abundances and identifications, they are equally suitable for expressing collision-induced fragmentation and electron-transfer dissociation determinations of glycopeptide identity. Comparisons of GADS for N-glycosylated sites on several proteins, especially the SARS-CoV-2 spike protein, demonstrate the potential reproducibility of GADS and their utility for comparing site-specific distributions.

Assuntos

COVID-19 , Glicopeptídeos/metabolismo , Glicoproteínas , Glicosilação , Humanos , Polissacarídeos , Reprodutibilidade dos Testes , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus

5.

MS_Piano: A Software Tool for Annotating Peaks in CID Tandem Mass Spectra of Peptides and N-Glycopeptides.

Yang, Xiaoyu; Neta, Pedatsur; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Remoroza, Concepcion A; Burke, Meghan C; Liang, Yuxue; Markey, Sanford P; Stein, Stephen E.

J Proteome Res ; 20(9): 4603-4609, 2021 Sep 03.

Artigo em Inglês | MEDLINE | ID: mdl-34264676

RESUMO

Annotating product ion peaks in tandem mass spectra is essential for evaluating spectral quality and validating peptide identification. This task is more complex for glycopeptides and is crucial for the confident determination of glycosylation sites in glycoproteins. MS_Piano (Mass Spectrum Peptide Annotation) software was developed for reliable annotation of peaks in collision induced dissociation (CID) tandem mass spectra of peptides or N-glycopeptides for given peptide sequences, charge states, and optional modifications. The program annotates each peak in high or low resolution spectra with possible product ion(s) and the mass difference between the measured and theoretical m/z values. Spectral quality is measured by two major parameters: the ratio between the sum of unannotated vs all peak intensities in the top 20 peaks, and the intensity of the highest unannotated peak. The product ions of peptides, glycans, and glycopeptides in spectra are labeled in different class-type colors to facilitate interpretation. MS_Piano assists validating peptide and N-glycopeptide identification from database and library searches and provides quality control and optimizes search reliability in custom developed peptide mass spectral libraries. The software is freely available in .exe and .dll formats for the Windows operating system.

Assuntos

Glicopeptídeos , Proteômica , Reprodutibilidade dos Testes , Software , Espectrometria de Massas em Tandem

6.

Creation and filtering of a recurrent spectral library of CHO cell metabolites and media components.

Telu, Kelly H; Marupaka, Ramesh; Andriamaharavo, Nirina R; Simón-Manso, Yamil; Liang, Yuxue; Mirokhin, Yuri A; Bukhari, Tallat H; Preston, Renae J; Kashi, Lila; Kelman, Zvi; Stein, Stephen E.

Biotechnol Bioeng ; 118(4): 1491-1510, 2021 04.

Artigo em Inglês | MEDLINE | ID: mdl-33404064

RESUMO

This paper reports the first implementation of a new type of mass spectral library for the analysis of Chinese hamster ovary (CHO) cell metabolites that allows users to quickly identify most compounds in any complex metabolite sample. We also describe an annotation methodology developed to filter out artifacts and low-quality spectra from recurrent unidentified spectra of metabolites. CHO cells are commonly used to produce biological therapeutics. Metabolic profiles of CHO cells and media can be used to monitor process variability and look for markers that discriminate between batches of product. We have created a comprehensive library of both identified and unidentified metabolites derived from CHO cells that can be used in conjunction with tandem mass spectrometry to identify metabolites. In addition, we present a workflow that can be used for assigning confidence to a NIST MS/MS Library search match based on prior probability of general utility. The goal of our work is to annotate and identify (when possible), all liquid chromatography-mass spectrometry generated metabolite ions as well as create automatable library building and identification pipelines for use by others in the field.

Assuntos

Metaboloma , Metabolômica , Bibliotecas de Moléculas Pequenas , Animais , Células CHO , Cricetulus , Meios de Cultura/química

7.

Mass Spectral Library of Acylcarnitines Derived from Human Urine.

Yan, Xinjian; Markey, Sanford P; Marupaka, Ramesh; Dong, Qian; Cooper, Brian T; Mirokhin, Yuri A; Wallace, William E; Stein, Stephen E.

Anal Chem ; 92(9): 6521-6528, 2020 05 05.

Artigo em Inglês | MEDLINE | ID: mdl-32271007

RESUMO

We describe the creation of a mass spectral library of acylcarnitines and conjugated acylcarnitines from the LC-MS/MS analysis of six NIST urine reference materials. To recognize acylcarnitines, we conducted in-depth analyses of fragmentation patterns of acylcarnitines and developed a set of rules, derived from spectra in the NIST17 Tandem MS Library and those identified in urine, using the newly developed hybrid search method. Acylcarnitine tandem spectra were annotated with fragments from carnitine and acyl moieties as well as neutral loss peaks from precursors. Consensus spectra were derived from spectra having similar retention time, fragmentation pattern, and the same precursor m/z and collision energy. The library contains 157 different precursor masses, 586 unique acylcarnitines, and 4â¯332 acylcarnitine consensus spectra. Furthermore, from spectra that partially satisfied the fragmentation rules of acylcarnitines, we identified 125 conjugated acylcarnitines represented by 987 consensus spectra, which appear to originate from Phase II biotransformation reactions. To our knowledge, this is the first report of conjugated acylcarnitines. The mass spectra provided by this work may be useful for clinical screening of acylcarnitines as well as for studying relationships among fragmentation patterns, collision energies, structures, and retention times of acylcarnitines. Further, these methods are extensible to other classes of metabolites.

Assuntos

Carnitina/análogos & derivados , Carnitina/química , Carnitina/metabolismo , Carnitina/urina , Cromatografia Líquida , Humanos , Estrutura Molecular , Espectrometria de Massas em Tandem

8.

Increasing the Coverage of a Mass Spectral Library of Milk Oligosaccharides Using a Hybrid-Search-Based Bootstrapping Method and Milks from a Wide Variety of Mammals.

Remoroza, Concepcion Africano; Liang, Yuxue; Mak, Tytus D; Mirokhin, Yuri; Sheetlin, Sergey L; Yang, Xiaoyu; San Andres, Joice V; Power, Michael L; Stein, Stephen E.

Anal Chem ; 92(15): 10316-10326, 2020 08 04.

Artigo em Inglês | MEDLINE | ID: mdl-32639750

RESUMO

This study significantly expands both the scope and method of identification for construction of a previously reported tandem mass spectral library of 74 human milk oligosaccharides (HMOs) derived from results of combined LC-MS/MS experiments and comprehensive structural analysis of HMOs. In the present work, a hybrid search "bootstrap" identification method was employed that substantially broadens the coverage of milk oligosaccharides and thereby increases utility use of a spectrum library-based method for the rapid tentative identification of all distinguishable glycans in milk. This involved hybrid searching of the previous library, which was itself constructed using the hybrid search of oligosaccharide spectra in the NIST 17 Tandem MS Library. The general approach appears applicable to library construction of other classes of compounds. The coverage of oligosaccharides was significantly extended using milks from a variety of mammals, including bovine, Asian buffalo, African lion, and goat. This new method led to the identification of another 145 oligosaccharides, including an additional 80 HMOs from reanalysis of human milk. The newly identified compounds were added to a freely available mass spectral reference database of 219 milk oligosaccharides. We also provide suggestions to overcome several limitations and pitfalls in the interpretation of spectra of unknown oligosaccharides.

Assuntos

Mamíferos , Leite Humano/química , Leite/química , Oligossacarídeos/química , Bibliotecas de Moléculas Pequenas , Animais , Humanos , Especificidade da Espécie , Espectrometria de Massas em Tandem

9.

False Discovery Rate Estimation for Hybrid Mass Spectral Library Search Identifications in Bottom-up Proteomics.

Burke, Meghan C; Zhang, Zheng; Mirokhin, Yuri A; Tchekovskoi, Dmitrii V; Liang, Yuxue; Stein, Stephen E.

J Proteome Res ; 18(9): 3223-3234, 2019 09 06.

Artigo em Inglês | MEDLINE | ID: mdl-31364354

RESUMO

We present a method for FDR estimation of mass spectral library search identifications made by a recently developed method for peptide identification, the hybrid search, based on an extension of the target-decoy approach. In addition to estimating confidence for a given identification, this allows users to compare and integrate identifications from the hybrid mass spectral library search method with other peptide identification methods, such as a sequence database-based method. In addition to a score, each hybrid score is associated with a "DeltaMass" value, which is the difference in mass of the search and library peptide, which can correspond to the mass of a modification. We explored the relation between FDR and DeltaMass using 100 concatenated random decoy libraries and discovered that a small number of DeltaMass values were especially likely to result from decoy searches. Using these values, FDR values could be adjusted for these specific values and a reliable FDR generated for any DeltaMass value. Finally, using this method, we find and examine common, reliable identifications made by the hybrid search for a range of proteomic studies.

Assuntos

Bases de Dados de Proteínas , Peptídeos/genética , Proteômica/métodos , Algoritmos , Biblioteca de Peptídeos , Peptídeos/classificação , Software , Espectrometria de Massas em Tandem

10.

Hybrid Search: A Method for Identifying Metabolites Absent from Tandem Mass Spectrometry Libraries.

Cooper, Brian T; Yan, Xinjian; Simón-Manso, Yamil; Tchekhovskoi, Dmitrii V; Mirokhin, Yuri A; Stein, Stephen E.

Anal Chem ; 91(21): 13924-13932, 2019 11 05.

Artigo em Inglês | MEDLINE | ID: mdl-31600070

RESUMO

Metabolomics has a critical need for better tools for mass spectral identification. Common metabolites may be identified by searching libraries of tandem mass spectra, which offers important advantages over other approaches to identification. But tandem libraries are not nearly complete enough to represent the full molecular diversity present in complex biological samples. We present a novel hybrid search method that can help identify metabolites not in the library by similarity to compounds that are. We call it "hybrid" searching because it combines conventional, direct peak matching with the logical equivalent of neutral-loss matching. A successful hybrid search requires the library to contain "cognates" of the unknown: similar compounds with a structural difference confined to a single region of the molecule, that does not substantially alter its fragmentation behavior. We demonstrate that the hybrid search is highly likely to find similar compounds under such circumstances.

Assuntos

Bases de Dados Factuais , Metabolômica/métodos , Espectrometria de Massas em Tandem , Fragmentos de Peptídeos/química , Proteômica/métodos

11.

Mass Spectrometry Fingerprints of Small-Molecule Metabolites in Biofluids: Building a Spectral Library of Recurrent Spectra for Urine Analysis.

Simón-Manso, Yamil; Marupaka, Ramesh; Yan, Xinjian; Liang, Yuxue; Telu, Kelly H; Mirokhin, Yuri; Stein, Stephen E.

Anal Chem ; 91(18): 12021-12029, 2019 09 17.

Artigo em Inglês | MEDLINE | ID: mdl-31424920

RESUMO

A large fraction of ions observed in electrospray liquid chromatography-mass spectrometry (LC-ESI-MS) experiments of biological samples remain unidentified. One of the main reasons for this is that spectral libraries of pure compounds fail to account for the complexity of the metabolite profiling of complex materials. Recently, the NIST Mass Spectrometry Data Center has been developing a novel type of searchable mass spectral library that includes all recurrent unidentified spectra found in the sample profile. These libraries, in conjunction with the NIST tandem mass spectral library, allow analysts to explore most of the chemical space accessible to LC-MS analysis. In this work, we demonstrate how these libraries can provide a reliable fingerprint of the material by applying them to a variety of urine samples, including an extremely altered urine from cancer patients undergoing total body irradiation. The same workflow is applicable to any other biological fluid. The selected class of acylcarnitines is examined in detail, and derived libraries and related software are freely available. They are intended to serve as online resources for continuing community review and improvement.

Assuntos

Líquidos Corporais/química , Carnitina/análogos & derivados , Neoplasias/urina , Bibliotecas de Moléculas Pequenas/análise , Carnitina/urina , Cromatografia Líquida , Humanos , Espectrometria de Massas , Software

12.

Reverse and Random Decoy Methods for False Discovery Rate Estimation in High Mass Accuracy Peptide Spectral Library Searches.

Zhang, Zheng; Burke, Meghan; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Markey, Sanford P; Yu, Wen; Chaerkady, Raghothama; Hess, Sonja; Stein, Stephen E.

J Proteome Res ; 17(2): 846-857, 2018 02 02.

Artigo em Inglês | MEDLINE | ID: mdl-29281288

RESUMO

Spectral library searching (SLS) is an attractive alternative to sequence database searching (SDS) for peptide identification due to its speed, sensitivity, and ability to include any selected mass spectra. While decoy methods for SLS have been developed for low mass accuracy peptide spectral libraries, it is not clear that they are optimal or directly applicable to high mass accuracy spectra. Therefore, we report the development and validation of methods for high mass accuracy decoy libraries. Two types of decoy libraries were found to be suitable for this purpose. The first, referred to as Reverse, constructs spectra by reversing a library's peptide sequences except for the C-terminal residue. The second, termed Random, randomly replaces all non-C-terminal residues and either retains the original C-terminal residue or replaces it based on the amino-acid frequency of the library's C-terminus. In both cases the m/z values of fragment ions are shifted accordingly. Determination of FDR is performed in a manner equivalent to SDS, concatenating a library with its decoy prior to a search. The utility of Reverse and Random libraries for target-decoy SLS in estimating false-positives and FDRs was demonstrated using spectra derived from a recently published synthetic human proteome project (Zolg, D. P.; et al. Nat. Methods 2017, 14, 259-262). For data sets from two large-scale label-free and iTRAQ experiments, these decoy building methods yielded highly similar score thresholds and spectral identifications at 1% FDR. The results were also found to be equivalent to those of using the decoy-free PeptideProphet algorithm. Using these new methods for FDR estimation, MSPepSearch, which is freely available search software, led to 18% more identifications at 1% FDR and 23% more at 0.1% FDR when compared with other widely used SDS engines coupled to postprocessing approaches such as Percolator. An application of these methods for FDR estimation for the recently reported "hybrid" library search (Burke, M. C.; et al. J. Proteome Res. 2017, 16, 1924-1935) method is also made. The application of decoy methods for high mass accuracy SLS permits the merging of these results with those of SDS, thereby increasing the assignment of more peptides, leading to deeper proteome coverage.

Assuntos

Algoritmos , Aminoácidos/química , Processamento de Imagem Assistida por Computador/estatística & dados numéricos , Bibliotecas Especializadas/métodos , Biblioteca de Peptídeos , Peptídeos/análise , Sequência de Aminoácidos , Humanos , Peptídeos/química , Software , Espectrometria de Massas em Tandem

13.

Creating a Mass Spectral Reference Library for Oligosaccharides in Human Milk.

Remoroza, Connie A; Mak, Tytus D; De Leoz, Maria Lorna A; Mirokhin, Yuri A; Stein, Stephen E.

Anal Chem ; 90(15): 8977-8988, 2018 08 07.

Artigo em Inglês | MEDLINE | ID: mdl-29969231

RESUMO

We report the development and availability of a mass spectral reference library for oligosaccharides in human milk. This represents a new variety of spectral library that includes consensus spectra of compounds annotated through various data analysis methods, a concept that can be extended to other varieties of biological fluids. Oligosaccharides from the NIST Standard Reference Material (SRM) 1953, composed of human milk pooled from 100 breastfeeding mothers, were identified and characterized using hydrophilic interaction liquid chromatography electrospray ionization tandem mass spectrometry (HILIC-ESI-MS/MS) and the NIST 17 Tandem MS Library. Consensus reference spectra were generated, incorporated into a searchable library, and matched using the newly developed hybrid search algorithm to elucidate unknown oligosaccharides. The NIST hybrid search program facilitates the structural assignment of complex oligosaccharides especially when reference standards are not commercially available. High accuracy mass measurement for precursor and product ions, as well as the relatively high MS/MS signal intensities of various oligosaccharide precursors with Fourier transform ion trap (FT-IT) and higher energy dissociation (HCD) fragmentation techniques, enabled the assignment of multiple free and underivatized fucosyllacto- and sialyllacto-oligosaccharide spectra. Neutral and sialylated isomeric oligosaccharides have distinct retention times, allowing the identification of 74 oligosaccharides in the reference material. This collection of newly characterized spectra based on a searchable, reference MS library of annotated oligosaccharides can be applied to analyze similar compounds in other types of milk or any biological fluid containing milk oligosaccharides.

Assuntos

Leite Humano/química , Oligossacarídeos/análise , Espectrometria de Massas por Ionização por Electrospray/métodos , Espectrometria de Massas em Tandem/métodos , Cromatografia Líquida de Alta Pressão/métodos , Cromatografia Líquida de Alta Pressão/normas , Humanos , Padrões de Referência , Espectrometria de Massas por Ionização por Electrospray/normas , Espectrometria de Massas em Tandem/normas

14.

The Hybrid Search: A Mass Spectral Library Search Method for Discovery of Modifications in Proteomics.

Burke, Meghan C; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Markey, Sanford P; Heidbrink Thompson, Jenny; Larkin, Christopher; Stein, Stephen E.

J Proteome Res ; 16(5): 1924-1935, 2017 05 05.

Artigo em Inglês | MEDLINE | ID: mdl-28367633

RESUMO

We present a mass spectral library-based method to identify tandem mass spectra of peptides that contain unanticipated modifications and amino acid variants. We describe this as a "hybrid" method because it combines matching both ion m/z and mass losses. The mass loss is the difference between the mass of an ion peak and the mass of its precursor. This difference, termed DeltaMass, is used to shift the product ions in the library spectrum that contain the modification, thereby allowing library product ions that contain the unexpected modification to match the query spectrum. Clustered unidentified spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and Chinese hamster ovary cells were used to evaluate this method. The results demonstrate the ability of the hybrid method to identify unanticipated modifications, insertions, and deletions, which may include those due to an incomplete protein sequence database or to search settings that exclude the correct identification, in high-resolution tandem mass spectra without regard to their precursor mass. This has been made possible by indexing of the m/z value of each fragment ion and its difference in mass from its precursor ion.

Assuntos

Algoritmos , Bases de Dados de Proteínas , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Animais , Células CHO , Linhagem Celular Tumoral , Cricetulus , Bases de Dados Factuais , Humanos , Íons , Peso Molecular , Proteômica/normas

15.

Interconversion of Peptide Mass Spectral Libraries Derivatized with iTRAQ or TMT Labels.

Zhang, Zheng; Yang, Xiaoyu; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Ji, Weihua; Markey, Sanford P; Roth, Jeri; Neta, Pedatsur; Hizal, Deniz Baycin; Bowen, Michael A; Stein, Stephen E.

J Proteome Res ; 15(9): 3180-7, 2016 09 02.

Artigo em Inglês | MEDLINE | ID: mdl-27386737

RESUMO

Derivitization of peptides with isobaric tags such as iTRAQ and TMT is widely employed in proteomics due to their compatibility with multiplex quantitative measurements. We recently made publicly available a large peptide library derived from iTRAQ 4-plex labeled spectra. This resource has not been used for identifying peptides labeled with related tags with different masses, because values for virtually all masses of precursor and most product ions would differ for ions containing the different tags as well as containing different tag-specific peaks. We describe a method for interconverting spectra from iTRAQ 4-plex to TMT (6- and 10-plex) and to iTRAQ 8-plex. We interconvert spectra by appropriately mass shifting sequence ions and discarding derivative-specific peaks. After this "cleaning" of search spectra, we demonstrate that the converted libraries perform well in terms of peptide spectral matches. This is demonstrated by comparing results using sequence database searches as well as by comparing search effectiveness using original and converted libraries. At 1% FDR TMT labeled query spectra match 97% as many spectra against a converted iTRAQ library as compared to an original TMT library. Overall this interconversion strategy provides a practical way to extend results from one derivatization method to others that share related chemistry and do not significantly alter fragmentation profiles.

Assuntos

Biblioteca de Peptídeos , Proteômica/métodos , Bases de Dados de Proteínas , Espectrometria de Massas , Peso Molecular , Coloração e Rotulagem

16.

A Description of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) Common Data Analysis Pipeline.

Rudnick, Paul A; Markey, Sanford P; Roth, Jeri; Mirokhin, Yuri; Yan, Xinjian; Tchekhovskoi, Dmitrii V; Edwards, Nathan J; Thangudu, Ratna R; Ketchum, Karen A; Kinsinger, Christopher R; Mesri, Mehdi; Rodriguez, Henry; Stein, Stephen E.

J Proteome Res ; 15(3): 1023-32, 2016 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-26860878

RESUMO

The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics data sets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and nonreference markers of cancer. The CPTAC laboratories have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these data sets were produced from 2D liquid chromatography-tandem mass spectrometry analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false-discovery rate-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the data sets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level ("rolled-up") precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data to enable comparisons between different samples and cancer types as well as across the major omics fields.

Assuntos

Neoplasias/diagnóstico , Neoplasias/metabolismo , Proteômica , Biomarcadores Tumorais/metabolismo , Humanos , Proteoma/metabolismo

17.

Tandem mass spectral libraries of peptides in digests of individual proteins: Human Serum Albumin (HSA).

Dong, Qian; Yan, Xinjian; Kilpatrick, Lisa E; Liang, Yuxue; Mirokhin, Yuri A; Roth, Jeri S; Rudnick, Paul A; Stein, Stephen E.

Mol Cell Proteomics ; 13(9): 2435-49, 2014 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-24889059

RESUMO

This work presents a method for creating a mass spectral library containing tandem spectra of identifiable peptide ions in the tryptic digestion of a single protein. Human serum albumin (HSA(1)) was selected for this purpose owing to its ubiquity, high level of characterization and availability of digest data. The underlying experimental data consisted of â¼3000 one-dimensional LC-ESI-MS/MS runs with ion-trap fragmentation. In order to generate a wide range of peptides, studies covered a broad set of instrument and digestion conditions using multiple sources of HSA and trypsin. Computer methods were developed to enable the reliable identification and reference spectrum extraction of all peptide ions identifiable by current sequence search methods. This process made use of both MS2 (tandem) spectra and MS1 (electrospray) data. Identified spectra were generated for 2918 different peptide ions, using a variety of manually-validated filters to ensure spectrum quality and identification reliability. The resulting library was composed of 10% conventional tryptic and 29% semitryptic peptide ions, along with 42% tryptic peptide ions with known or unknown modifications, which included both analytical artifacts and post-translational modifications (PTMs) present in the original HSA. The remaining 19% contained unexpected missed-cleavages or were under/over alkylated. The methods described can be extended to create equivalent spectral libraries for any target protein. Such libraries have a number of applications in addition to their known advantages of speed and sensitivity, including the ready re-identification of known PTMs, rejection of artifact spectra and a means of assessing sample and digestion quality.

Assuntos

Biblioteca de Peptídeos , Albumina Sérica/química , Cromatografia Líquida , Humanos , Proteólise , Espectrometria de Massas por Ionização por Electrospray , Espectrometria de Massas em Tandem , Tripsina/química

18.

Correction to Mass Spectrometry Fingerprints of Small-Molecule Metabolites in Biofluids: Building a Spectral Library of Recurrent Spectra for Urine Analysis.

Simón-Manso, Yamil; Marupaka, Ramesh; Yan, Xinjian; Liang, Yuxue; Telu, Kelly H; Mirokhin, Yuri; Stein, Stephen E.

Anal Chem ; 92(1): 1643, 2020 Jan 07.

Artigo em Inglês | MEDLINE | ID: mdl-31859486

19.

Creation of libraries of recurring mass spectra from large data sets assisted by a dual-column workflow.

Mallard, W Gary; Andriamaharavo, N Rabe; Mirokhin, Yuri A; Halket, John M; Stein, Stephen E.

Anal Chem ; 86(20): 10231-8, 2014 Oct 21.

Artigo em Inglês | MEDLINE | ID: mdl-25233296

RESUMO

An analytical methodology has been developed for extracting recurrent unidentified spectra (RUS) from large GC/MS data sets. Spectra were first extracted from original data files by the Automated Mass Spectral Deconvolution and Identification System (AMDIS; Stein, S. E. J. Am. Soc. Mass Spectrom. 1999 , 10 , 770 - 781 ) using settings designed to minimize spurious spectra, followed by searching the NIST library with all unidentified spectra. The spectra that could not be identified were then filtered to remove poorly deconvoluted data and clustered. The results were assumed to be unidentified components. This was tested by requiring each unidentified spectrum to be found in two chromatographic columns with slightly different stationary phases. This methodology has been applied to a large set of pediatric urine samples. A library of spectra and retention indices for derivatized urine components, both identified and recurrent unidentified, has been created and is available for download.

Assuntos

Cromatografia Gasosa-Espectrometria de Massas , Bibliotecas de Moléculas Pequenas/química , Ácido Cítrico/química , Humanos , Urina/química

20.

Sensitive Method for the Confident Identification of Genetically Variant Peptides in Human Hair Keratin.

Zhang, Zheng; Burke, Meghan C; Wallace, William E; Liang, Yuxue; Sheetlin, Sergey L; Mirokhin, Yuri A; Tchekhovskoi, Dmitrii V; Stein, Stephen E.

J Forensic Sci ; 65(2): 406-420, 2020 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-31670846

RESUMO

Recent reports have demonstrated that genetically variant peptides derived from human hair shaft proteins can be used to differentiate individuals of different biogeographic origins. We report a method involving direct extraction of hair shaft proteins more sensitive than previously published methods regarding GVP detection. It involves one step for protein extraction and was found to provide reproducible results. A detailed proteomic analysis of this data is presented that led to the following four results: (i) A peptide spectral library was created and made available for download. It contains all identified peptides from this work, including GVPs that, when appropriately expanded with diverse hair-derived peptides, can provide a routine, reliable, and sensitive means of analyzing hair digests; (ii) an analysis of artifact peptides arising from side reactions is also made using a new method for finding unexpected modifications; (iii) detailed analysis of the gel-based method employed clearly shows the high degree of cross-linking or protein association involved in hair digestion, with major GVPs eluting over a wide range of high molecular weights while others apparently arise from distinct non-cross-linked proteins; and (v) finally, we show that some of the specific GVP identifications depend on the sample preparation method.

Assuntos

Cabelo/metabolismo , Queratinas Específicas do Cabelo/metabolismo , Peptídeos/metabolismo , Proteoma/metabolismo , Artefatos , Cromatografia Líquida , Bases de Dados de Proteínas , Medicina Legal , Humanos , Masculino , Espectrometria de Massas , Proteômica , Reprodutibilidade dos Testes

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa