An automated proteogenomic method uses mass spectrometry to reveal novel genes in Zea mays.
Mol Cell Proteomics
; 13(1): 157-67, 2014 Jan.
Article
em En
| MEDLINE
| ID: mdl-24142994
New technologies in genomics and proteomics have influenced the emergence of proteogenomics, a field at the confluence of genomics, transcriptomics, and proteomics. First generation proteogenomic toolkits employ peptide mass spectrometry to identify novel protein coding regions. We extend first generation proteogenomic tools to achieve greater accuracy and enable the analysis of large, complex genomes. We apply our pipeline to Zea mays, which has a genome comparable in size to human. Our pipeline begins with the comparison of mass spectra to a putative translation of the genome. We select novel peptides, those that match a region of the genome that was not previously known to be protein coding, for grouping into refinement events. We present a novel, probabilistic framework for evaluating the accuracy of each event. Our calculated event probability, or eventProb, considers the number of supporting peptides and spectra, and the quality of each supporting peptide-spectrum match. Our pipeline predicts 165 novel protein-coding genes and proposes updated models for 741 additional genes.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Zea mays
/
Genômica
/
Proteômica
Tipo de estudo:
Prognostic_studies
Limite:
Humans
Idioma:
En
Revista:
Mol Cell Proteomics
Assunto da revista:
BIOLOGIA MOLECULAR
/
BIOQUIMICA
Ano de publicação:
2014
Tipo de documento:
Article