Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Methods ; 8(7): 587-91, 2011 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-21572408

RESUMO

Tandem mass spectrometry (MS/MS) experiments yield multiple, nearly identical spectra of the same peptide in various laboratories, but proteomics researchers typically do not leverage the unidentified spectra produced in other labs to decode spectra they generate. We propose a spectral archives approach that clusters MS/MS datasets, representing similar spectra by a single consensus spectrum. Spectral archives extend spectral libraries by analyzing both identified and unidentified spectra in the same way and maintaining information about peptide spectra that are common across species and conditions. Thus archives offer both traditional library spectrum similarity-based search capabilities along with new ways to analyze the data. By developing a clustering tool, MS-Cluster, we generated a spectral archive from ∼1.18 billion spectra that greatly exceeds the size of existing spectral repositories. We advocate that publicly available data should be organized into spectral archives rather than be analyzed as disparate datasets, as is mostly the case today.


Assuntos
Bases de Dados Factuais , Peptídeos/análise , Proteínas/análise , Espectrometria de Massas em Tandem/métodos , Arquivos , Peptídeos/química , Proteínas/química , Proteômica/métodos
2.
J Proteome Res ; 12(6): 2846-57, 2013 Jun 07.
Artigo em Inglês | MEDLINE | ID: mdl-23679345

RESUMO

Full-length de novo sequencing of unknown proteins remains a challenging open problem. Traditional methods that sequence spectra individually are limited by short peptide length, incomplete peptide fragmentation, and ambiguous de novo interpretations. We address these issues by determining consensus sequences for assembled tandem mass (MS/MS) spectra from overlapping peptides (e.g., by using multiple enzymatic digests). We have combined electron-transfer dissociation (ETD) with collision-induced dissociation (CID) and higher-energy collision-induced dissociation (HCD) fragmentation methods to boost interpretation of long, highly charged peptides and take advantage of corroborating b/y/c/z ions in CID/HCD/ETD. Using these strategies, we show that triplet CID/HCD/ETD MS/MS spectra from overlapping peptides yield de novo sequences of average length 70 AA and as long as 200 AA at up to 99% sequencing accuracy.


Assuntos
Algoritmos , Fragmentos de Peptídeos/isolamento & purificação , Proteínas/química , Análise de Sequência de Proteína/métodos , Espectrometria de Massas em Tandem/métodos , Animais , Armoracia/química , Escherichia coli/química , Cavalos , Humanos , Camundongos , Proteólise , Sensibilidade e Especificidade
3.
J Proteome Res ; 8(5): 2241-52, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-19231891

RESUMO

The analysis of the large volume of tandem mass spectrometry (MS/MS) proteomics data that is generated these days relies on automated algorithms that identify peptides from their mass spectra. An essential component of these algorithms is the scoring function used to evaluate the quality of peptide-spectrum matches (PSMs). In this paper, we present new approach to scoring of PSMs. We argue that since this problem is at its core a ranking task (especially in the case of de novo sequencing), it can be solved effectively using machine learning ranking algorithms. We developed a new discriminative boosting-based approach to scoring. Our scoring models draw upon a large set of diverse feature functions that measure different qualities of PSMs. Our method improves the performance of our de novo sequencing algorithm beyond the current state-of-the-art, and also greatly enhances the performance of database search programs. Furthermore, by increasing the efficiency of tag filtration and improving the sensitivity of PSM scoring, we make it practical to perform large-scale MS/MS analysis, such as proteogenomic search of a six-frame translation of the human genome (in which we achieve a reduction of the running time by a factor of 15 and a 60% increase in the number of identified peptides, compared to the InsPecT database search tool). Our scoring function is incorporated into PepNovo+ which is available for download or can be run online at http://bix.ucsd.edu.


Assuntos
Peptídeos/análise , Análise de Sequência de Proteína/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Bases de Dados de Proteínas , Peptídeos/química , Proteômica/métodos , Reprodutibilidade dos Testes
4.
J Proteome Res ; 8(5): 2226-40, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-19256476

RESUMO

Accurate modeling of peptide fragmentation is necessary for the development of robust scoring functions for peptide-spectrum matches, which are the cornerstone of MS/MS-based identification algorithms. Unfortunately, peptide fragmentation is a complex process that can involve several competing chemical pathways, which makes it difficult to develop generative probabilistic models that describe it accurately. However, the vast amounts of MS/MS data being generated now make it possible to use data-driven machine learning methods to develop discriminative ranking-based models that predict the intensity ranks of a peptide's fragment ions. We use simple sequence-based features that get combined by a boosting algorithm into models that make peak rank predictions with high accuracy. In an accompanying manuscript, we demonstrate how these prediction models are used to significantly improve the performance of peptide identification algorithms. The models can also be useful in the design of optimal multiple reaction monitoring (MRM) transitions, in cases where there is insufficient experimental data to guide the peak selection process. The prediction algorithm can also be run independently through PepNovo+, which is available for download from http://bix.ucsd.edu/Software/PepNovo.html.


Assuntos
Algoritmos , Fragmentos de Peptídeos/análise , Espectrometria de Massas em Tandem/métodos , Fragmentos de Peptídeos/química , Peptídeos/análise , Peptídeos/química , Proteômica/métodos , Reprodutibilidade dos Testes , Análise de Sequência de Proteína/métodos
5.
Anal Chem ; 80(7): 2499-505, 2008 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-18302345

RESUMO

Recent advances in mass spectrometry instrumentation, such as FTICR and OrbiTrap, have made it possible to generate high-resolution spectra of entire proteins. While these methods offer new opportunities for performing "top-down" studies of proteins, the computational tools for analyzing top-down data are still scarce. In this paper we investigate the application of spectral alignment to the problem of identifying protein forms in top-down mass spectra (i.e., identifying the modifications, mutations, insertions, and deletions). We demonstrate how spectral alignment efficiently discovers protein forms even in the presence of numerous modifications and how the algorithm can be extended to discover positional isomers from spectra of mixtures of isobaric protein forms.


Assuntos
Espectrometria de Massas/instrumentação , Espectrometria de Massas/métodos , Algoritmos , Histonas/química , Humanos , Reprodutibilidade dos Testes
6.
J Proteome Res ; 7(1): 113-22, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18067247

RESUMO

Tandem mass spectrometry (MS/MS) experiments often generate redundant data sets containing multiple spectra of the same peptides. Clustering of MS/MS spectra takes advantage of this redundancy by identifying multiple spectra of the same peptide and replacing them with a single representative spectrum. Analyzing only representative spectra results in significant speed-up of MS/MS database searches. We present an efficient clustering approach for analyzing large MS/MS data sets (over 10 million spectra) with a capability to reduce the number of spectra submitted to further analysis by an order of magnitude. The MS/MS database search of clustered spectra results in fewer spurious hits to the database and increases number of peptide identifications as compared to regular nonclustered searches. Our open source software MS-Clustering is available for download at http://peptide.ucsd.edu or can be run online at http://proteomics.bioprojects.org/MassSpec.


Assuntos
Análise por Conglomerados , Peptídeos/análise , Proteômica/métodos , Espectrometria de Massas em Tandem , Sequência de Aminoácidos , Biologia Computacional , Dados de Sequência Molecular
7.
J Proteome Res ; 6(1): 114-23, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17203955

RESUMO

The recent proliferation of novel mass spectrometers such as Fourier transform, QTOF, and OrbiTrap marks a transition into the era of precision mass spectrometry, providing a 2 orders of magnitude boost to the mass resolution, as compared to low-precision ion-trap detectors. We investigate peptide de novo sequencing by precision mass spectrometry and explore some of the differences when compared to analysis of low-precision data. We demonstrate how the dramatically improved performance of de novo sequencing with precision mass spectrometry paves the way for novel approaches to peptide identification that are based on direct sequence lookups, rather than comparisons of spectra to a database. With the direct sequence lookup, it is not only possible to search a database very efficiently, but also to use the database in novel ways, such as searching for products of alternative splicing or products of fusion proteins in cancer. Our de novo sequencing software is available for download at http://peptide.ucsd.edu/.


Assuntos
Espectrometria de Massas/métodos , Peptídeos/química , Algoritmos , Processamento Alternativo , Biologia Computacional/métodos , Interpretação Estatística de Dados , Bases de Dados de Proteínas , Análise de Fourier , Genoma Humano , Genômica , Humanos , Proteínas de Neoplasias/química , Probabilidade , Análise de Sequência de Proteína , Espectroscopia de Infravermelho com Transformada de Fourier
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA