Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 61
Filtrar
1.
ACS Chem Biol ; 15(10): 2692-2701, 2020 10 16.
Artigo em Inglês | MEDLINE | ID: mdl-32809798

RESUMO

Various biological processes at the cellular level are regulated by glycosylation which is a highly microheterogeneous post-translational modification (PTM) on proteins and lipids. The dynamic nature of glycosylation can be studied through metabolic incorporation of non-natural sugars into glycan epitopes and their detection using bio-orthogonal probes. However, this approach possesses a significant drawback due to nonspecific background reactions and ambiguity of non-natural sugar metabolism. Here, we report a probe-free strategy for their direct detection by glycoproteomics and glycomics using mass spectrometry (MS). The method dramatically simplifies the detection of non-natural functional group bearing monosaccharides installed through promiscuous sialic acid, N-acetyl-d-galactosamine (GalNAc) and N-acetyl-d-glucosamine (GlcNAc) biosynthetic pathways. Multistage enrichment of glycoproteins by cellular fractionation, subsequent ZIC-HILIC (zwitterionic-hydrophilic interaction chromatography) based glycopeptide enrichment, and a spectral enrichment algorithm for the MS data processing enabled direct detection of non-natural monosaccharides that are incorporated at low abundance on the N/O-glycopeptides along with their natural counterparts. Our approach allowed the detection of both natural and non-natural sugar bearing glycopeptides, N- and O-glycopeptides, differentiation of non-natural monosaccharide types on the glycans and also their incorporation efficiency through quantitation. Through this, we could deduce interconversion of monosaccharides during their processing through glycan salvage pathway and subsequent incorporation into glycan chains. The study of glycosylation dynamics through this method can be conducted in high throughput, as few sample processing steps are involved, enabling understanding of glycosylation dynamics under various external stimuli and thereby could bolster the use of metabolic glycan engineering in glycosylation functional studies.


Assuntos
Glicopeptídeos/análise , Glicoproteínas de Membrana/análise , Espectrometria de Massas em Tandem/métodos , Algoritmos , Sequência de Carboidratos , Linhagem Celular Tumoral , Cromatografia Líquida , Glicômica , Glicopeptídeos/metabolismo , Glicosilação , Humanos , Células Jurkat , Glicoproteínas de Membrana/química , Glicoproteínas de Membrana/metabolismo , Fragmentos de Peptídeos/análise , Fragmentos de Peptídeos/metabolismo , Processamento de Proteína Pós-Traducional , Proteólise , Proteômica , Espectrometria de Massas em Tandem/estatística & dados numéricos
2.
J Clin Endocrinol Metab ; 105(3)2020 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-31720688

RESUMO

CONTEXT: The lack of sensitive and robust analytical methods has hindered the reliable quantification of estrogen metabolites in subjects with low concentrations. OBJECTIVE: To establish sex-specific reference ranges for estrone (E1) and estradiol (E2) throughout life and to evaluate sex-differences using the state-of-the-art liquid chromatography tandem mass spectrometry (LC-MS/MS) method for quantification of E1, E2, and estriol (E3). DESIGN: LC-MS/MS method development and construction of estrogen reference ranges. SETTINGS: Population-based cross-sectional cohorts from the greater Copenhagen and Aarhus areas. PARTICIPANTS: Healthy participants aged 3 months to 61 years (n = 1838). RESULTS: An isotope diluted LC-MS/MS method was developed and validated for measurements of serum E1, E2, and E3. Limits of detections (LODs) were 3 pmol/L (E1), 4 pmol/L (E2), and 12 pmol/L (E3), respectively. This sensitive method made it possible to differentiate between male and female concentration levels of E1 and E2 in children. In girls, E2 levels ranged from

Assuntos
Cromatografia Líquida/estatística & dados numéricos , Estradiol/sangue , Estrona/sangue , Fatores Sexuais , Espectrometria de Massas em Tandem/estatística & dados numéricos , Adolescente , Adulto , Criança , Pré-Escolar , Cromatografia Líquida/métodos , Feminino , Voluntários Saudáveis , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Valores de Referência , Sensibilidade e Especificidade , Espectrometria de Massas em Tandem/métodos , Adulto Jovem
3.
Proc Natl Acad Sci U S A ; 116(49): 24408-24412, 2019 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-31740593

RESUMO

The gold standard for cystic fibrosis (CF) diagnosis is the determination of chloride concentration in sweat. Current testing methodology takes up to 3 h to complete and has recognized shortcomings on its diagnostic accuracy. We present an alternative method for the identification of CF by combining desorption electrospray ionization mass spectrometry and a machine-learning algorithm based on gradient boosted decision trees to analyze perspiration samples. This process takes as little as 2 min, and we determined its accuracy to be 98 ± 2% by cross-validation on analyzing 277 perspiration samples. With the introduction of statistical bootstrap, our method can provide a confidence estimate of our prediction, which helps diagnosis decision-making. We also identified important peaks by the feature selection algorithm and assigned the chemical structure of the metabolites by high-resolution and/or tandem mass spectrometry. We inspected the correlation between mild and severe CFTR gene mutation types and lipid profiles, suggesting a possible way to realize personalized medicine with this noninvasive, fast, and accurate method.


Assuntos
Algoritmos , Cloretos/análise , Fibrose Cística/diagnóstico , Espectrometria de Massas por Ionização por Electrospray/estatística & dados numéricos , Suor/química , Estudos de Casos e Controles , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Humanos , Lipídeos/análise , Lipídeos/química , Lipídeos/genética , Aprendizado de Máquina , Mutação , Estudo de Prova de Conceito , Reprodutibilidade dos Testes , Espectrometria de Massas por Ionização por Electrospray/métodos , Espectrometria de Massas em Tandem/métodos , Espectrometria de Massas em Tandem/estatística & dados numéricos
4.
Nat Biotechnol ; 37(4): 469-479, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30936560

RESUMO

Although mass spectrometry is well suited to identifying thousands of potential protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of covering enormous search spaces and distinguishing correct from incorrect spectrum interpretations. Here, we describe TagGraph, a computational tool that overcomes both challenges with an unrestricted string-based search method that is as much as 350-fold faster than existing approaches, and a probabilistic validation model that we optimized for PTM assignments. We applied TagGraph to a published human proteomic dataset of 25 million mass spectra and tripled confident spectrum identifications compared to its original analysis. We identified thousands of modification types on almost 1 million sites in the proteome. We show alternative contexts for highly abundant yet understudied PTMs such as proline hydroxylation, and its unexpected association with cancer mutations. By enabling broad characterization of PTMs, TagGraph informs as to how their functions and regulation intersect.


Assuntos
Bases de Dados de Proteínas/estatística & dados numéricos , Processamento de Proteína Pós-Traducional , Software , Espectrometria de Massas em Tandem/estatística & dados numéricos , Algoritmos , Sequência de Aminoácidos , Teorema de Bayes , Biotecnologia , Linhagem Celular Tumoral , Humanos , Hidroxilação , Modelos Estatísticos , Peptídeos/química , Peptídeos/genética , Proteoma , Proteômica/estatística & dados numéricos , Ferramenta de Busca , Alinhamento de Sequência/estatística & dados numéricos
5.
Brief Bioinform ; 20(1): 347-355, 2019 01 18.
Artigo em Inglês | MEDLINE | ID: mdl-30657890

RESUMO

Mass spectrometry (MS)-based proteomics has undergone rapid advancements in recent years, creating challenging problems for bioinformatics. We focus on four aspects where bioinformatics plays a crucial role (and proteomics is needed for clinical application): peptide-spectra matching (PSM) based on the new data-independent acquisition (DIA) paradigm, resolving missing proteins (MPs), dealing with biological and technical heterogeneity in data and statistical feature selection (SFS). DIA is a brute-force strategy that provides greater width and depth but, because it indiscriminately captures spectra such that signal from multiple peptides is mixed, getting good PSMs is difficult. We consider two strategies: simplification of DIA spectra to pseudo-data-dependent acquisition spectra or, alternatively, brute-force search of each DIA spectra against known reference libraries. The MP problem arises when proteins are never (or inconsistently) detected by MS. When observed in at least one sample, imputation methods can be used to guess the approximate protein expression level. If never observed at all, network/protein complex-based contextualization provides an independent prediction platform. Data heterogeneity is a difficult problem with two dimensions: technical (batch effects), which should be removed, and biological (including demography and disease subpopulations), which should be retained. Simple normalization is seldom sufficient, while batch effect-correction algorithms may create errors. Batch effect-resistant normalization methods are a viable alternative. Finally, SFS is vital for practical applications. While many methods exist, there is no best method, and both upstream (e.g. normalization) and downstream processing (e.g. multiple-testing correction) are performance confounders. We also discuss signal detection when class effects are weak.


Assuntos
Biologia Computacional/métodos , Proteômica/estatística & dados numéricos , Algoritmos , Biologia Computacional/estatística & dados numéricos , Bases de Dados de Proteínas/estatística & dados numéricos , Humanos , Peptídeos/química , Proteínas/química , Software , Espectrometria de Massas em Tandem/estatística & dados numéricos
6.
Anal Chem ; 91(2): 1335-1343, 2019 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-30525483

RESUMO

Mass spectrometry (MS) has become a predominant choice for large-scale absolute protein quantification, but its quantification accuracy still has substantial room for improvement. A crucial issue is the bias between the peptide MS intensity and the actual peptide abundance, i.e., the fact that peptides with equal abundance may have different MS intensities. This bias is mainly caused by the diverse physicochemical properties of peptides. Here, we propose an algorithm for label-free absolute protein quantification, LFAQ, which can correct the biased MS intensities by using the predicted peptide quantitative factors for all identified peptides. When validated on data sets produced by different MS instruments and data acquisition modes, LFAQ presented accuracy and precision superior to those of existing methods. In particular, it reduced the quantification error by an average of 46% for low-abundance proteins. The advantages of LFAQ were further confirmed using the data from published papers.


Assuntos
Algoritmos , Peptídeos/análise , Proteínas de Saccharomyces cerevisiae/análise , Animais , Cromatografia Líquida/métodos , Células HEK293 , Humanos , Camundongos , Células RAW 264.7 , Saccharomyces cerevisiae/química , Espectrometria de Massas em Tandem/métodos , Espectrometria de Massas em Tandem/estatística & dados numéricos
7.
J Proteome Res ; 17(11): 3644-3656, 2018 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-30221945

RESUMO

To achieve accurate assignment of peptide sequences to observed fragmentation spectra, a shotgun proteomics database search tool must make good use of the very high-resolution information produced by state-of-the-art mass spectrometers. However, making use of this information while also ensuring that the search engine's scores are well calibrated, that is, that the score assigned to one spectrum can be meaningfully compared to the score assigned to a different spectrum, has proven to be challenging. Here we describe a database search score function, the "residue evidence" (res-ev) score, that achieves both of these goals simultaneously. We also demonstrate how to combine calibrated res-ev scores with calibrated XCorr scores to produce a "combined p value" score function. We provide a benchmark consisting of four mass spectrometry data sets, which we use to compare the combined p value to the score functions used by several existing search engines. Our results suggest that the combined p value achieves state-of-the-art performance, generally outperforming MS Amanda and Morpheus and performing comparably to MS-GF+. The res-ev and combined p-value score functions are freely available as part of the Tide search engine in the Crux mass spectrometry toolkit ( http://crux.ms ).


Assuntos
Algoritmos , Proteínas de Escherichia coli/química , Mapeamento de Peptídeos/estatística & dados numéricos , Peptídeos/química , Proteínas de Protozoários/química , Espectrometria de Massas em Tandem/estatística & dados numéricos , Glândulas Suprarrenais/química , Sequência de Aminoácidos , Organismos Aquáticos/química , Benchmarking , Calibragem , Misturas Complexas/química , Bases de Dados de Proteínas , Conjuntos de Dados como Assunto , Proteínas de Escherichia coli/classificação , Proteínas de Escherichia coli/isolamento & purificação , Humanos , Mapeamento de Peptídeos/métodos , Peptídeos/classificação , Peptídeos/isolamento & purificação , Plasmodium falciparum/química , Proteólise , Proteômica/métodos , Proteínas de Protozoários/classificação , Proteínas de Protozoários/isolamento & purificação , Software , Espectrometria de Massas em Tandem/métodos
8.
Brief Bioinform ; 19(5): 954-970, 2018 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-28369237

RESUMO

While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally, we discuss the potential of de novo sequencing for now becoming more widely used in the field.


Assuntos
Algoritmos , Proteômica/métodos , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Animais , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados de Proteínas/estatística & dados numéricos , Humanos , Camundongos , Peptídeos/química , Proteômica/estatística & dados numéricos , Pyrococcus furiosus/genética , Saccharomyces cerevisiae/genética , Análise de Sequência de Proteína/estatística & dados numéricos , Sitios de Sequências Rotuladas , Software , Espectrometria de Massas em Tandem/métodos , Espectrometria de Massas em Tandem/estatística & dados numéricos
9.
Anal Chem ; 89(23): 12690-12697, 2017 12 05.
Artigo em Inglês | MEDLINE | ID: mdl-29125736

RESUMO

In tandem mass spectrometry (MS/MS)-based proteomics, search engines rely on comparison between an experimental MS/MS spectrum and the theoretical spectra of the candidate peptides. Hence, accurate prediction of the theoretical spectra of peptides appears to be particularly important. Here, we present pDeep, a deep neural network-based model for the spectrum prediction of peptides. Using the bidirectional long short-term memory (BiLSTM), pDeep can predict higher-energy collisional dissociation, electron-transfer dissociation, and electron-transfer and higher-energy collision dissociation MS/MS spectra of peptides with >0.9 median Pearson correlation coefficients. Further, we showed that intermediate layer of the neural network could reveal physicochemical properties of amino acids, for example the similarities of fragmentation behaviors between amino acids. We also showed the potential of pDeep to distinguish extremely similar peptides (peptides that contain isobaric amino acids, for example, GG = N, AG = Q, or even I = L), which were very difficult to distinguish using traditional search engines.


Assuntos
Aprendizado Profundo , Peptídeos/química , Espectrometria de Massas em Tandem , Bases de Dados de Proteínas/estatística & dados numéricos , Proteoma/química , Proteômica/métodos , Proteômica/estatística & dados numéricos , Espectrometria de Massas em Tandem/estatística & dados numéricos
10.
Mass Spectrom Rev ; 36(5): 634-648, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-27403644

RESUMO

Proteomics is a rapidly maturing field aimed at the high-throughput identification and quantification of all proteins in a biological system. The cornerstone of proteomic technology is tandem mass spectrometry of peptides resulting from the digestion of protein mixtures. The fragmentation pattern of each peptide ion is captured in its tandem mass spectrum, which enables its identification and acts as a fingerprint for the peptide. Spectral libraries are simply searchable collections of these fingerprints, which have taken on an increasingly prominent role in proteomic data analysis. This review describes the historical development of spectral libraries in proteomics, details the computational procedures behind library building and searching, surveys the current applications of spectral libraries, and discusses the outstanding challenges. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:634-648, 2017.


Assuntos
Bases de Dados de Proteínas , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Animais , Mineração de Dados , História do Século XX , História do Século XXI , Peptídeos/análise , Peptídeos/química , Peptídeos/metabolismo , Filogenia , Processamento de Proteína Pós-Traducional , Software , Espectrometria de Massas em Tandem/história , Espectrometria de Massas em Tandem/estatística & dados numéricos , Interface Usuário-Computador
11.
J Am Soc Mass Spectrom ; 27(2): 194-210, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-26510657

RESUMO

Correct and rapid identification of microorganisms is the key to the success of many important applications in health and safety, including, but not limited to, infection treatment, food safety, and biodefense. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is challenging correct microbial identification because of the large number of choices present. To properly disentangle candidate microbes, one needs to go beyond apparent morphology or simple 'fingerprinting'; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptidome profiles of microbes to better separate them and by designing an analysis method that yields accurate statistical significance. Here, we present an analysis pipeline that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using MS/MS data of 81 samples, each composed of a single known microorganism, that the proposed pipeline can correctly identify microorganisms at least at the genus and species levels. We have also shown that the proposed pipeline computes accurate statistical significances, i.e., E-values for identified peptides and unified E-values for identified microorganisms. The proposed analysis pipeline has been implemented in MiCId, a freely available software for Microorganism Classification and Identification. MiCId is available for download at http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html . Graphical Abstract ᅟ.


Assuntos
Bactérias/classificação , Espectrometria de Massas em Tandem/métodos , Espectrometria de Massas em Tandem/estatística & dados numéricos , Bactérias/química , Bases de Dados Factuais , Escherichia coli/classificação , Peptídeos/análise , Peptídeos/química , Pseudomonas aeruginosa/classificação , Software
12.
J Proteome Res ; 14(12): 5169-78, 2015 Dec 04.
Artigo em Inglês | MEDLINE | ID: mdl-26569054

RESUMO

In shotgun proteomics, peptides are typically identified using database searching, which involves scoring acquired tandem mass spectra against peptides derived from standard protein sequence databases such as Uniprot, Refseq, or Ensembl. In this strategy, the sensitivity of peptide identification is known to be affected by the size of the search space. Therefore, creating a targeted sequence database containing only peptides likely to be present in the analyzed sample can be a useful technique for improving the sensitivity of peptide identification. In this study, we describe how targeted peptide databases can be created based on the frequency of identification in the global proteome machine database (GPMDB), the largest publicly available repository of peptide and protein identification data. We demonstrate that targeted peptide databases can be easily integrated into existing proteome analysis workflows and describe a computational strategy for minimizing any loss of peptide identifications arising from potential search space incompleteness in the targeted search spaces. We demonstrate the performance of our workflow using several data sets of varying size and sample complexity.


Assuntos
Bases de Dados de Proteínas , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Bases de Dados de Proteínas/estatística & dados numéricos , Células HeLa , Humanos , Células K562 , Peptídeos/química , Peptídeos/genética , Proteômica/estatística & dados numéricos , Ferramenta de Busca , Alinhamento de Sequência , Espectrometria de Massas em Tandem/estatística & dados numéricos , Fluxo de Trabalho
13.
J Proteome Res ; 14(11): 4450-62, 2015 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-26412692

RESUMO

De novo sequencing of proteins and peptides is one of the most important problems in mass spectrometry-driven proteomics. A variety of methods have been developed to accomplish this task from a set of bottom-up tandem (MS/MS) mass spectra. However, a more recently emerged top-down technology, now gaining more and more popularity, opens new perspectives for protein analysis and characterization, implying a need for efficient algorithms to process this kind of MS/MS data. Here, we describe a method that allows for the retrieval, from a set of top-down MS/MS spectra, of long and accurate sequence fragments of the proteins contained in the sample. To this end, we outline a strategy for generating high-quality sequence tags from top-down spectra, and introduce the concept of a T-Bruijn graph by adapting to the case of tags the notion of an A-Bruijn graph widely used in genomics. The output of the proposed approach represents the set of amino acid strings spelled out by optimal paths in the connected components of a T-Bruijn graph. We illustrate its performance on top-down data sets acquired from carbonic anhydrase 2 (CAH2) and the Fab region of alemtuzumab.


Assuntos
Algoritmos , Peptídeos/isolamento & purificação , Proteômica/estatística & dados numéricos , Análise de Sequência de Proteína/estatística & dados numéricos , Espectrometria de Massas em Tandem/estatística & dados numéricos , Alemtuzumab , Sequência de Aminoácidos , Animais , Anticorpos Monoclonais Humanizados/química , Anidrase Carbônica II/química , Bovinos , Bases de Dados de Proteínas , Humanos , Fragmentos Fab das Imunoglobulinas/química , Dados de Sequência Molecular , Peptídeos/química , Proteômica/métodos , Coloração e Rotulagem/métodos
14.
Mol Cell Proteomics ; 14(5): 1411-8, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25713123

RESUMO

Proteomics by mass spectrometry technology is widely used for identifying and quantifying peptides and proteins. The breadth and sensitivity of peptide detection have been advanced by the advent of data-independent acquisition mass spectrometry. Analysis of such data, however, is challenging due to the complexity of fragment ion spectra that have contributions from multiple co-eluting precursor ions. We present SWATHProphet software that identifies and quantifies peptide fragment ion traces in data-independent acquisition data, provides accurate probabilities to ensure results are correct, and automatically detects and removes contributions to quantitation originating from interfering precursor ions. Integration in the widely used open source Trans-Proteomic Pipeline facilitates subsequent analyses such as combining results of multiple data sets together for improved discrimination using iProphet and inferring sample proteins using ProteinProphet. This novel development should greatly help make data-independent acquisition mass spectrometry accessible to large numbers of users.


Assuntos
Peptídeos/análise , Proteínas/análise , Proteinúria/urina , Software , Espectrometria de Massas em Tandem/estatística & dados numéricos , Humanos , Biblioteca de Peptídeos , Proteínas/química , Proteólise , Proteômica/métodos , Reprodutibilidade dos Testes , Tripsina/química
15.
Ann Clin Biochem ; 52(Pt 1): 18-38, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25313226

RESUMO

Clinical laboratory medicine has seen the introduction and evolution of liquid chromatography tandem mass spectrometry in routine clinical laboratories over the last 10-15 years. There still exists a wide diversity of assays from very esoteric and highly specialist manual assays to more simplified kit-based assays. The technology is not static as manufacturers are continually making improvements. Mass spectrometry is now commonly used in several areas of diagnostics including therapeutic drug monitoring, toxicology, endocrinology, paediatrics and microbiology. Some of the most high throughput analyses or common analytes include vitamin D, immunosuppressant monitoring, androgen measurement and newborn screening. It also offers flexibility for the measurement of analytes in a variety of different matrices which would prove difficult with immunoassays. Unlike immunoassays or high-pressure liquid chromatography assays using ultraviolet or fluorescence detection, mass spectrometry offers better specificity and reduced interferences if attention is paid to potential isobaric compounds. Furthermore, multiplexing, which enables multiple analytes to be measured with the same volume of serum is advantageous, and the requirement for large sample volumes is decreasing as instrument sensitivity increases. There are many emerging applications in the literature. Using mass spectrometry to identify novel isoforms or modified peptides is possible as is quantification of proteins and peptides, with or without protein digests. Future developments by the manufacturers may also include mechanisms to improve the throughput of samples and strategies to decrease the level of skill required by the operators.


Assuntos
Cromatografia Líquida/estatística & dados numéricos , Serviços de Laboratório Clínico , Laboratórios , Espectrometria de Massas em Tandem/estatística & dados numéricos , Androgênios/sangue , Androgênios/urina , Cromatografia Líquida/instrumentação , Humanos , Imunossupressores/sangue , Imunossupressores/urina , Recém-Nascido , Triagem Neonatal/instrumentação , Peptídeos/sangue , Peptídeos/urina , Sensibilidade e Especificidade , Espectrometria de Massas em Tandem/instrumentação , Vitamina D/sangue
16.
J Comput Biol ; 22(5): 353-66, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25423621

RESUMO

Tandem mass (MS/MS) spectrometry has become the method of choice for protein identification and has launched a quest for the identification of every translated protein and peptide. However, computational developments have lagged behind the pace of modern data acquisition protocols and have become a major bottleneck in proteomics analysis of complex samples. As it stands today, attempts to identify MS/MS spectra against large databases (e.g., the human microbiome or 6-frame translation of the human genome) face a search space that is 10-100 times larger than the human proteome, where it becomes increasingly challenging to separate between true and false peptide matches. As a result, the sensitivity of current state-of-the-art database search methods drops by nearly 38% to such low identification rates that almost 90% of all MS/MS spectra are left as unidentified. We address this problem by extending the generating function approach to rigorously compute the joint spectral probability of multiple spectra being matched to peptides with overlapping sequences, thus enabling the confident assignment of higher significance to overlapping peptide-spectrum matches (PSMs). We find that these joint spectral probabilities can be several orders of magnitude more significant than individual PSMs, even in the ideal case when perfect separation between signal and noise peaks could be achieved per individual MS/MS spectrum. After benchmarking this approach on a typical lysate MS/MS dataset, we show that the proposed intersecting spectral probabilities for spectra from overlapping peptides improve peptide identification by 30-62%.


Assuntos
Modelos Estatísticos , Peptídeos/análise , Proteômica/estatística & dados numéricos , Software , Espectrometria de Massas em Tandem/estatística & dados numéricos , Algoritmos , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Biblioteca de Peptídeos , Proteômica/métodos
17.
Clin Chim Acta ; 437: 211-7, 2014 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-25086281

RESUMO

BACKGROUND: Disorders of homocysteine and B-vitamin metabolism represent a significant problem in clinical practice. Establishing the diagnosis requires specialized tests with demanding preanalytical requirements. To advance the detection of patients with these disorders, we developed a method for the simultaneous determination of cystathionine (Cysta), methionine (Met) and total homocysteine (tHcy) in dried blood spots (DBSs). METHODS: A punch from a DBS sample was mixed with a solution of isotopically labeled internal standards, and analytes were extracted using methanol/0.1% formic acid/0.5mol/L dithiothreitol. The extract was injected into an LC-MS/MS system operating in MRM mode. RESULTS: The analytical performance of the method employing DBS is adequate for its purpose and the type of sample. Compared with Cysta, tHcy and Met plasma levels, our method exhibited a negative bias between -3.8% and -42.2% due to the lower concentrations of these analytes in erythrocytes. The tHcy level and the Met/Cysta ratio in DBS enabled the clear detection of 12 patients with disorders of transsulfuration and with genetic and nutritional remethylation defects. CONCLUSIONS: The ease of collecting and transporting DBS samples may advance diagnostic procedures in patients with neuropsychiatric disorders and thromboembolism. Consequently, this approach may facilitate detection and simplify the monitoring of patients with homocystinuria.


Assuntos
Cistationina/sangue , Teste em Amostras de Sangue Seco/métodos , Homocisteína/sangue , Homocistinúria/sangue , Metionina/sangue , Espectrometria de Massas em Tandem/métodos , Adolescente , Criança , Pré-Escolar , Cromatografia Líquida/métodos , Cromatografia Líquida/estatística & dados numéricos , Gerenciamento Clínico , Teste em Amostras de Sangue Seco/estatística & dados numéricos , Feminino , Homocistinúria/diagnóstico , Humanos , Lactente , Recém-Nascido , Masculino , Espectrometria de Massas em Tandem/estatística & dados numéricos
18.
J Proteome Res ; 13(3): 1281-92, 2014 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-24571364

RESUMO

Researchers are increasingly turning to label-free MS1 intensity-based quantification strategies within HPLC-ESI-MS/MS workflows to reveal biological variation at the molecule level. Unfortunately, HPLC-ESI-MS/MS workflows using these strategies produce results with poor repeatability and reproducibility, primarily due to systematic bias and complex variability. While current global normalization strategies can mitigate systematic bias, they fail when faced with complex variability stemming from transient stochastic events during HPLC-ESI-MS/MS analysis. To address these problems, we developed a novel local normalization method, proximity-based intensity normalization (PIN), based on the analysis of compositional data. We evaluated PIN against common normalization strategies. PIN outperforms them in dramatically reducing variance and in identifying 20% more proteins with statistically significant abundance differences that other strategies missed. Our results show the PIN enables the discovery of statistically significant biological variation that otherwise is falsely reported or missed.


Assuntos
Peptídeos/análise , Proteômica/estatística & dados numéricos , Saliva/química , Proteínas e Peptídeos Salivares/isolamento & purificação , Cromatografia Líquida de Alta Pressão/estatística & dados numéricos , Humanos , Proteômica/métodos , Razão Sinal-Ruído , Espectrometria de Massas por Ionização por Electrospray/estatística & dados numéricos , Espectrometria de Massas em Tandem/estatística & dados numéricos
19.
J Proteome Res ; 12(12): 5548-57, 2013 Dec 06.
Artigo em Inglês | MEDLINE | ID: mdl-24147958

RESUMO

The tandem mass spectrum extraction of phosphopeptides is more difficult and error-prone than that of unmodified peptides due to their lower abundance, lower ionization efficiency, the cofragmentation with other high-abundance peptides, and the use of MS(3) on MS(2) fragments with neutral losses. However, there are still no established methods to evaluate its correctness. Here we propose to identify and correct these errors via the combinatorial use of multiple spectrum extraction tools. We evaluated five free and two commercial extraction tools using Mascot and phosphoproteomics raw data from LTQ FT Ultra, in which RawXtract 1.9.9.2 identified the highest number of unique phosphopeptides (peptide expectation value <0.05). Surprisingly, ProteoWizzard (v. 3.0.3476) extracted wrong precursor mass for most MS(3) spectra. Comparison of the top three free extraction tools showed that only 54% of the identified spectra were identified consistently from all three tools, indicating that some errors might happen during spectrum extraction. Manual check of 258 spectra not identified from all three tools revealed 405 errors of spectrum extraction with 7.4% in selecting wrong precursor charge, 50.6% in selecting wrong precursor mass, and 42.1% in exporting MS/MS fragments. We then corrected the errors by selecting the best extracted MGF file for each spectrum among the three tools for another database search. With the errors corrected, it results in the 22.4 and 12.2% increase in spectrum matches and unique peptide identification, respectively, compared with the best single method. Correction of errors in spectrum extraction improves both the sensitivity and confidence of phosphopeptide identification. Data analysis on nonphosphopeptide spectra indicates that this strategy applies to unmodified peptides as well. The identification of errors in spectrum extraction will promote the improvement of spectrum extraction tools in future.


Assuntos
Artefatos , Peptídeos/análise , Fosfopeptídeos/análise , Software , Espectrometria de Massas em Tandem/normas , Algoritmos , Bases de Dados de Proteínas , Humanos , Fosforilação , Proteômica , Espectrometria de Massas em Tandem/métodos , Espectrometria de Massas em Tandem/estatística & dados numéricos
20.
J Proteome Res ; 12(12): 5971-7, 2013 Dec 06.
Artigo em Inglês | MEDLINE | ID: mdl-24125593

RESUMO

The identification of phosphorylated proteins remains a challenge in proteomics, partially due to the difficulty in assigning tandem mass (MS/MS) spectra to their originating peptide sequences with correct phosphosite localization. Because of its advantages in efficiency and sensitivity, spectral library searching is a promising alternative to conventional sequence database searching. Our work aims to construct the largest collision-induced dissociation (CID) MS/MS spectral libraries of phosphorylated peptides in human (Homo sapiens) and four model organisms (Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, and Mus musculus) to date, to facilitate phosphorylated peptide identification by spectral library searching. We employed state-of-the-art search methods to published data and applied two recently published phosphorylation site localization tools (PhosphoRS and PTMProphet) to ascertain the phosphorylation sites. To further increase the coverage of this library, we predicted "semi-empirical" spectra for peptides containing known phosphorylation sites from the corresponding template unphosphorylated peptide spectra. The performance of the spectral libraries built were evaluated and found to be superior to conventional database searching in terms of sensitivity. Updated spectral libraries of phosphorylated peptides are made freely available for use with the spectral search engine SpectraST. The work flow being developed will be used to continuously update the libraries when new data become available.


Assuntos
Biblioteca de Peptídeos , Fosfopeptídeos/análise , Espectrometria de Massas em Tandem/normas , Animais , Caenorhabditis elegans/química , Caenorhabditis elegans/metabolismo , Linhagem Celular Tumoral , Drosophila melanogaster/química , Drosophila melanogaster/metabolismo , Humanos , Camundongos , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/metabolismo , Espectrometria de Massas em Tandem/estatística & dados numéricos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA