Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
Mol Cell Proteomics ; 11(2): M111.012161, 2012 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-22052992

RESUMO

The goal of many shotgun proteomics experiments is to determine the protein complement of a complex biological mixture. For many mixtures, most methodological approaches fall significantly short of this goal. Existing solutions to this problem typically subdivide the task into two stages: first identifying a collection of peptides with a low false discovery rate and then inferring from the peptides a corresponding set of proteins. In contrast, we formulate the protein identification problem as a single optimization problem, which we solve using machine learning methods. This approach is motivated by the observation that the peptide and protein level tasks are cooperative, and the solution to each can be improved by using information about the solution to the other. The resulting algorithm directly controls the relevant error rate, can incorporate a wide variety of evidence and, for complex samples, provides 18-34% more protein identifications than the current state of the art approaches.


Assuntos
Inteligência Artificial , Misturas Complexas/análise , Modelos Estatísticos , Proteínas/análise , Proteômica , Espectrometria de Massas em Tandem/métodos , Algoritmos , Líquido Amniótico/química , Líquido Amniótico/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Bases de Dados de Proteínas , Humanos , Refluxo Laringofaríngeo , Fragmentos de Peptídeos/análise , Proteínas de Saccharomyces cerevisiae/metabolismo , Software
2.
J Proteome Res ; 11(9): 4499-508, 2012 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-22866926

RESUMO

The identification of proteins from spectra derived from a tandem mass spectrometry experiment involves several challenges: matching each observed spectrum to a peptide sequence, ranking the resulting collection of peptide-spectrum matches, assigning statistical confidence estimates to the matches, and identifying the proteins. The present work addresses algorithms to rank peptide-spectrum matches. Many of these algorithms, such as PeptideProphet, IDPicker, or Q-ranker, follow a similar methodology that includes representing peptide-spectrum matches as feature vectors and using optimization techniques to rank them. We propose a richer and more flexible feature set representation that is based on the parametrization of the SEQUEST XCorr score and that can be used by all of these algorithms. This extended feature set allows a more effective ranking of the peptide-spectrum matches based on the target-decoy strategy, in comparison to a baseline feature set devoid of these XCorr-based features. Ranking using the extended feature set gives 10-40% improvement in the number of distinct peptide identifications relative to a range of q-value thresholds. While this work is inspired by the model of the theoretical spectrum and the similarity measure between spectra used specifically by SEQUEST, the method itself can be applied to the output of any database search. Further, our approach can be trivially extended beyond XCorr to any linear operator that can serve as similarity score between experimental spectra and peptide sequences.


Assuntos
Inteligência Artificial , Mapeamento de Peptídeos/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Proteínas de Caenorhabditis elegans/química , Fragmentos de Peptídeos/química , Reprodutibilidade dos Testes , Proteínas de Saccharomyces cerevisiae/química , Software
3.
Proc Natl Acad Sci U S A ; 104(49): 19169-74, 2007 Dec 04.
Artigo em Inglês | MEDLINE | ID: mdl-18032599

RESUMO

We have developed a mathematical approach to the study of dynamical biological networks, based on combining large-scale numerical simulation with nonlinear "dimensionality reduction" methods. Our work was motivated by an interest in the complex organization of the signaling cascade centered on the neuronal phosphoprotein DARPP-32 (dopamine- and cAMP-regulated phosphoprotein of molecular weight 32,000). Our approach has allowed us to detect robust features of the system in the presence of noise. In particular, the global network topology serves to stabilize the net state of DARPP-32 phosphorylation in response to variation of the input levels of the neurotransmitters dopamine and glutamate, despite significant perturbation to the concentrations and levels of activity of a number of intermediate chemical species. Further, our results suggest that the entire topology of the network is needed to impart this stability to one portion of the network at the expense of the rest. This could have significant implications for systems biology, in that large, complex pathways may have properties that are not easily replicated with simple modules.


Assuntos
Simulação por Computador , Computação Matemática , Modelos Biológicos , Transdução de Sinais , Animais , Dopamina/metabolismo , Fosfoproteína 32 Regulada por cAMP e Dopamina/metabolismo , Ácido Glutâmico/metabolismo , Humanos , Fosforilação , Transmissão Sináptica
4.
OMICS ; 7(3): 253-68, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-14583115

RESUMO

We collaborate in a research program aimed at creating a rigorous framework, experimental infrastructure, and computational environment for understanding, experimenting with, manipulating, and modifying a diverse set of fundamental biological processes at multiple scales and spatio-temporal modes. The novelty of our research is based on an approach that (i) requires coevolution of experimental science and theoretical techniques and (ii) exploits a certain universality in biology guided by a parsimonious model of evolutionary mechanisms operating at the genomic level and manifesting at the proteomic, transcriptomic, phylogenic, and other higher levels. Our current program in "systems biology" endeavors to marry large-scale biological experiments with the tools to ponder and reason about large, complex, and subtle natural systems. To achieve this ambitious goal, ideas and concepts are combined from many different fields: biological experimentation, applied mathematical modeling, computational reasoning schemes, and large-scale numerical and symbolic simulations. From a biological viewpoint, the basic issues are many: (i) understanding common and shared structural motifs among biological processes; (ii) modeling biological noise due to interactions among a small number of key molecules or loss of synchrony; (iii) explaining the robustness of these systems in spite of such noise; and (iv) cataloging multistatic behavior and adaptation exhibited by many biological processes.


Assuntos
Biologia Computacional/métodos , Evolução Molecular , Modelos Biológicos , Animais , Bioquímica/métodos , Células/citologia , Células/metabolismo , Humanos , Modelos Genéticos , Purinas/metabolismo , Software , Análise de Sistemas
5.
Free Radic Biol Med ; 51(6): 1221-34, 2011 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-21466848

RESUMO

Salinity reduces Ca(2+) availability, transport, and mobility to growing regions of the plant and supplemental Ca(2+) is known to reduce salinity damages. This study was undertaken to unravel some of the ameliorative mechanisms of Ca(2+) on salt stress at the cellular and tissue levels. Zea mays L. plants were grown in nutrient solution containing 1 or 80 mM NaCl with various Ca(2+) levels. Measurements of growth and physiological parameters, such as ion imbalance, indicated that the Ca(2+)-induced alleviation mechanisms differed between plant organs. Under salinity, H(2)O(2) levels increased in the leaf-growing tissue with increasing levels of supplemental Ca(2+) and reached the levels of control plants, whereas superoxide levels remained low at all Ca(2+) levels, indicating that Ca(2+) affected growth by increasing H(2)O(2) but not superoxide levels. Salinity completely abolished apoplastic peroxidase activity. Supplemental Ca(2+) increased its activity only slightly. However, under salinity, polyamine oxidase (PAO) activity was shifted toward the leaf base probably as an adaptive mechanism aimed at restoring normal levels of reactive oxygen species (ROS) at the expansion zone where NADPH oxidase could no longer provide the required ROS for growth. Interestingly, addition of Ca(2+) shifted the PAO-activity peak back to its original location in addition to its enhancement. The increase in PAO activity in conjunction with low levels of apoplastic peroxidase is supportive of cellular growth via nonenzymatic wall loosening derived by the increase in H(2)O(2) and less supportive of the peroxidase-mediated cross-linking of wall material. Thus extracellular Ca(2+) can modulate ROS levels at specific tissue localization and developmental stages thereby affecting cellular extension.


Assuntos
Cálcio/metabolismo , Folhas de Planta/metabolismo , Proteínas de Plantas/metabolismo , Espécies Reativas de Oxigênio/metabolismo , Salinidade , Peróxido de Hidrogênio/metabolismo , Estresse Oxidativo , Oxirredutases atuantes sobre Doadores de Grupo CH-NH/metabolismo , Peroxidase/metabolismo , Reguladores de Crescimento de Plantas/metabolismo , Raízes de Plantas/metabolismo , Zea mays , Poliamina Oxidase
6.
J Proteome Res ; 8(7): 3737-45, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19385687

RESUMO

Shotgun proteomics coupled with database search software allows the identification of a large number of peptides in a single experiment. However, some existing search algorithms, such as SEQUEST, use score functions that are designed primarily to identify the best peptide for a given spectrum. Consequently, when comparing identifications across spectra, the SEQUEST score function Xcorr fails to discriminate accurately between correct and incorrect peptide identifications. Several machine learning methods have been proposed to address the resulting classification task of distinguishing between correct and incorrect peptide-spectrum matches (PSMs). A recent example is Percolator, which uses semisupervised learning and a decoy database search strategy to learn to distinguish between correct and incorrect PSMs identified by a database search algorithm. The current work describes three improvements to Percolator. (1) Percolator's heuristic optimization is replaced with a clear objective function, with intuitive reasons behind its choice. (2) Tractable nonlinear models are used instead of linear models, leading to improved accuracy over the original Percolator. (3) A method, Q-ranker, for directly optimizing the number of identified spectra at a specified q value is proposed, which achieves further gains.


Assuntos
Espectrometria de Massas/métodos , Peptídeos/química , Proteômica/métodos , Algoritmos , Animais , Inteligência Artificial , Quimotripsina/química , Biologia Computacional/métodos , Bases de Dados de Proteínas , Modelos Estatísticos , Análise de Sequência de Proteína/métodos , Software , Tripsina/química
7.
Proc Natl Acad Sci U S A ; 102(18): 6245-50, 2005 May 03.
Artigo em Inglês | MEDLINE | ID: mdl-15843460

RESUMO

Various biological processes exhibit characteristics that vary dramatically in response to different input conditions or changes in the history of the process itself. One of the examples studied here, the Ras-PKC-mitogen-activated protein kinase (MAPK) bistable pathway, follows two distinct dynamics (modes) depending on duration and strength of EGF stimulus. Similar examples are found in the behavior of the cell cycle and the immune system. A classification methodology, based on time-frequency analysis, was developed and tested on these systems to understand global behavior of biological processes. Contrary to most traditionally used statistical and spectral methods, our approach captures complex functional relations between parts of the systems in a simple way. The resulting algorithms are capable of analyzing and classifying sets of time-series data obtained from in vivo or in vitro experiments, or in silico simulation of biological processes. The method was found to be considerably stable under stochastic noise perturbation and, therefore, suitable for the analysis of real experimental data.


Assuntos
Algoritmos , Ciclo Celular/fisiologia , Imunidade/fisiologia , Modelos Teóricos , Transdução de Sinais/fisiologia , Biologia de Sistemas , Proteínas Quinases Ativadas por Mitógeno/metabolismo , Proteína Quinase C/metabolismo , Fatores de Tempo , Proteínas ras/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA