Búsqueda | Portal de Búsqueda de la BVS Ecuador

Direct maximization of protein identifications from tandem mass spectra.

Spivak, Marina; Weston, Jason; Tomazela, Daniela; MacCoss, Michael J; Noble, William Stafford.

Mol Cell Proteomics ; 11(2): M111.012161, 2012 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-22052992

RESUMEN

The goal of many shotgun proteomics experiments is to determine the protein complement of a complex biological mixture. For many mixtures, most methodological approaches fall significantly short of this goal. Existing solutions to this problem typically subdivide the task into two stages: first identifying a collection of peptides with a low false discovery rate and then inferring from the peptides a corresponding set of proteins. In contrast, we formulate the protein identification problem as a single optimization problem, which we solve using machine learning methods. This approach is motivated by the observation that the peptide and protein level tasks are cooperative, and the solution to each can be improved by using information about the solution to the other. The resulting algorithm directly controls the relevant error rate, can incorporate a wide variety of evidence and, for complex samples, provides 18-34% more protein identifications than the current state of the art approaches.

Asunto(s)

Inteligencia Artificial , Mezclas Complejas/análisis , Modelos Estadísticos , Proteínas/análisis , Proteómica , Espectrometría de Masas en Tándem/métodos , Algoritmos , Líquido Amniótico/química , Líquido Amniótico/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Bases de Datos de Proteínas , Humanos , Reflujo Laringofaríngeo , Fragmentos de Péptidos/análisis , Proteínas de Saccharomyces cerevisiae/metabolismo , Programas Informáticos

Learning score function parameters for improved spectrum identification in tandem mass spectrometry experiments.

Spivak, Marina; Bereman, Michael S; Maccoss, Michael J; Noble, William Stafford.

J Proteome Res ; 11(9): 4499-508, 2012 Sep 07.

Artículo en Inglés | MEDLINE | ID: mdl-22866926

RESUMEN

The identification of proteins from spectra derived from a tandem mass spectrometry experiment involves several challenges: matching each observed spectrum to a peptide sequence, ranking the resulting collection of peptide-spectrum matches, assigning statistical confidence estimates to the matches, and identifying the proteins. The present work addresses algorithms to rank peptide-spectrum matches. Many of these algorithms, such as PeptideProphet, IDPicker, or Q-ranker, follow a similar methodology that includes representing peptide-spectrum matches as feature vectors and using optimization techniques to rank them. We propose a richer and more flexible feature set representation that is based on the parametrization of the SEQUEST XCorr score and that can be used by all of these algorithms. This extended feature set allows a more effective ranking of the peptide-spectrum matches based on the target-decoy strategy, in comparison to a baseline feature set devoid of these XCorr-based features. Ranking using the extended feature set gives 10-40% improvement in the number of distinct peptide identifications relative to a range of q-value thresholds. While this work is inspired by the model of the theoretical spectrum and the similarity measure between spectra used specifically by SEQUEST, the method itself can be applied to the output of any database search. Further, our approach can be trivially extended beyond XCorr to any linear operator that can serve as similarity score between experimental spectra and peptide sequences.

Asunto(s)

Inteligencia Artificial , Mapeo Peptídico/métodos , Espectrometría de Masas en Tándem/métodos , Algoritmos , Proteínas de Caenorhabditis elegans/química , Fragmentos de Péptidos/química , Reproducibilidad de los Resultados , Proteínas de Saccharomyces cerevisiae/química , Programas Informáticos

A mathematical tool for exploring the dynamics of biological networks.

Barbano, Paolo E; Spivak, Marina; Flajolet, Marc; Nairn, Angus C; Greengard, Paul; Greengard, Leslie.

Proc Natl Acad Sci U S A ; 104(49): 19169-74, 2007 Dec 04.

Artículo en Inglés | MEDLINE | ID: mdl-18032599

RESUMEN

We have developed a mathematical approach to the study of dynamical biological networks, based on combining large-scale numerical simulation with nonlinear "dimensionality reduction" methods. Our work was motivated by an interest in the complex organization of the signaling cascade centered on the neuronal phosphoprotein DARPP-32 (dopamine- and cAMP-regulated phosphoprotein of molecular weight 32,000). Our approach has allowed us to detect robust features of the system in the presence of noise. In particular, the global network topology serves to stabilize the net state of DARPP-32 phosphorylation in response to variation of the input levels of the neurotransmitters dopamine and glutamate, despite significant perturbation to the concentrations and levels of activity of a number of intermediate chemical species. Further, our results suggest that the entire topology of the network is needed to impart this stability to one portion of the network at the expense of the rest. This could have significant implications for systems biology, in that large, complex pathways may have properties that are not easily replicated with simple modules.

Asunto(s)

Simulación por Computador , Cómputos Matemáticos , Modelos Biológicos , Transducción de Señal , Animales , Dopamina/metabolismo , Fosfoproteína 32 Regulada por Dopamina y AMPc/metabolismo , Ácido Glutámico/metabolismo , Humanos , Fosforilación , Transmisión Sináptica

A sense of life: computational and experimental investigations with models of biochemical and evolutionary processes.

Mishra, Bud; Daruwala, Raoul-Sam; Zhou, Yi; Ugel, Nadia; Policriti, Alberto; Antoniotti, Marco; Paxia, Salvatore; Rejali, Marc; Rudra, Archisman; Cherepinsky, Vera; Silver, Naomi; Casey, William; Piazza, Carla; Simeoni, Marta; Barbano, Paolo; Spivak, Marina; Feng, Jiawu; Gill, Ofer; Venkatesh, Mysore; Cheng, Fang; Sun, Bing; Ioniata, Iuliana; Anantharaman, Thomas; Hubbard, E Jane Albert; Pnueli, Amir; Harel, David; Chandru, Vijay; Hariharan, Ramesh; Wigler, Michael; Park, Frank; Lin, Shih-Chieh; Lazebnik, Yuri; Winkler, Franz; Cantor, Charles R; Carbone, Alessandra; Gromov, Mikhael.

OMICS ; 7(3): 253-68, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-14583115

RESUMEN

We collaborate in a research program aimed at creating a rigorous framework, experimental infrastructure, and computational environment for understanding, experimenting with, manipulating, and modifying a diverse set of fundamental biological processes at multiple scales and spatio-temporal modes. The novelty of our research is based on an approach that (i) requires coevolution of experimental science and theoretical techniques and (ii) exploits a certain universality in biology guided by a parsimonious model of evolutionary mechanisms operating at the genomic level and manifesting at the proteomic, transcriptomic, phylogenic, and other higher levels. Our current program in "systems biology" endeavors to marry large-scale biological experiments with the tools to ponder and reason about large, complex, and subtle natural systems. To achieve this ambitious goal, ideas and concepts are combined from many different fields: biological experimentation, applied mathematical modeling, computational reasoning schemes, and large-scale numerical and symbolic simulations. From a biological viewpoint, the basic issues are many: (i) understanding common and shared structural motifs among biological processes; (ii) modeling biological noise due to interactions among a small number of key molecules or loss of synchrony; (iii) explaining the robustness of these systems in spite of such noise; and (iv) cataloging multistatic behavior and adaptation exhibited by many biological processes.

Asunto(s)

Biología Computacional/métodos , Evolución Molecular , Modelos Biológicos , Animales , Bioquímica/métodos , Células/citología , Células/metabolismo , Humanos , Modelos Genéticos , Purinas/metabolismo , Programas Informáticos , Análisis de Sistemas

Involvement of calcium-mediated effects on ROS metabolism in the regulation of growth improvement under salinity.

Shoresh, Michal; Spivak, Marina; Bernstein, Nirit.

Free Radic Biol Med ; 51(6): 1221-34, 2011 Sep 15.

Artículo en Inglés | MEDLINE | ID: mdl-21466848

RESUMEN

Salinity reduces Ca(2+) availability, transport, and mobility to growing regions of the plant and supplemental Ca(2+) is known to reduce salinity damages. This study was undertaken to unravel some of the ameliorative mechanisms of Ca(2+) on salt stress at the cellular and tissue levels. Zea mays L. plants were grown in nutrient solution containing 1 or 80 mM NaCl with various Ca(2+) levels. Measurements of growth and physiological parameters, such as ion imbalance, indicated that the Ca(2+)-induced alleviation mechanisms differed between plant organs. Under salinity, H(2)O(2) levels increased in the leaf-growing tissue with increasing levels of supplemental Ca(2+) and reached the levels of control plants, whereas superoxide levels remained low at all Ca(2+) levels, indicating that Ca(2+) affected growth by increasing H(2)O(2) but not superoxide levels. Salinity completely abolished apoplastic peroxidase activity. Supplemental Ca(2+) increased its activity only slightly. However, under salinity, polyamine oxidase (PAO) activity was shifted toward the leaf base probably as an adaptive mechanism aimed at restoring normal levels of reactive oxygen species (ROS) at the expansion zone where NADPH oxidase could no longer provide the required ROS for growth. Interestingly, addition of Ca(2+) shifted the PAO-activity peak back to its original location in addition to its enhancement. The increase in PAO activity in conjunction with low levels of apoplastic peroxidase is supportive of cellular growth via nonenzymatic wall loosening derived by the increase in H(2)O(2) and less supportive of the peroxidase-mediated cross-linking of wall material. Thus extracellular Ca(2+) can modulate ROS levels at specific tissue localization and developmental stages thereby affecting cellular extension.

Asunto(s)

Calcio/metabolismo , Hojas de la Planta/metabolismo , Proteínas de Plantas/metabolismo , Especies Reactivas de Oxígeno/metabolismo , Salinidad , Peróxido de Hidrógeno/metabolismo , Estrés Oxidativo , Oxidorreductasas actuantes sobre Donantes de Grupo CH-NH/metabolismo , Peroxidasa/metabolismo , Reguladores del Crecimiento de las Plantas/metabolismo , Raíces de Plantas/metabolismo , Zea mays , Poliamino Oxidasa

Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Spivak, Marina; Weston, Jason; Bottou, Léon; Käll, Lukas; Noble, William Stafford.

J Proteome Res ; 8(7): 3737-45, 2009 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-19385687

RESUMEN

Shotgun proteomics coupled with database search software allows the identification of a large number of peptides in a single experiment. However, some existing search algorithms, such as SEQUEST, use score functions that are designed primarily to identify the best peptide for a given spectrum. Consequently, when comparing identifications across spectra, the SEQUEST score function Xcorr fails to discriminate accurately between correct and incorrect peptide identifications. Several machine learning methods have been proposed to address the resulting classification task of distinguishing between correct and incorrect peptide-spectrum matches (PSMs). A recent example is Percolator, which uses semisupervised learning and a decoy database search strategy to learn to distinguish between correct and incorrect PSMs identified by a database search algorithm. The current work describes three improvements to Percolator. (1) Percolator's heuristic optimization is replaced with a clear objective function, with intuitive reasons behind its choice. (2) Tractable nonlinear models are used instead of linear models, leading to improved accuracy over the original Percolator. (3) A method, Q-ranker, for directly optimizing the number of identified spectra at a specified q value is proposed, which achieves further gains.

Asunto(s)

Espectrometría de Masas/métodos , Péptidos/química , Proteómica/métodos , Algoritmos , Animales , Inteligencia Artificial , Quimotripsina/química , Biología Computacional/métodos , Bases de Datos de Proteínas , Modelos Estadísticos , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Tripsina/química

A coherent framework for multiresolution analysis of biological networks with "memory": Ras pathway, cell cycle, and immune system.

Barbano, Paolo Emilio; Spivak, Marina; Feng, Jiawu; Antoniotti, Marco; Mishra, Bud.

Proc Natl Acad Sci U S A ; 102(18): 6245-50, 2005 May 03.

Artículo en Inglés | MEDLINE | ID: mdl-15843460

RESUMEN

Various biological processes exhibit characteristics that vary dramatically in response to different input conditions or changes in the history of the process itself. One of the examples studied here, the Ras-PKC-mitogen-activated protein kinase (MAPK) bistable pathway, follows two distinct dynamics (modes) depending on duration and strength of EGF stimulus. Similar examples are found in the behavior of the cell cycle and the immune system. A classification methodology, based on time-frequency analysis, was developed and tested on these systems to understand global behavior of biological processes. Contrary to most traditionally used statistical and spectral methods, our approach captures complex functional relations between parts of the systems in a simple way. The resulting algorithms are capable of analyzing and classifying sets of time-series data obtained from in vivo or in vitro experiments, or in silico simulation of biological processes. The method was found to be considerably stable under stochastic noise perturbation and, therefore, suitable for the analysis of real experimental data.

Asunto(s)

Algoritmos , Ciclo Celular/fisiología , Inmunidad/fisiología , Modelos Teóricos , Transducción de Señal/fisiología , Biología de Sistemas , Proteínas Quinasas Activadas por Mitógenos/metabolismo , Proteína Quinasa C/metabolismo , Factores de Tiempo , Proteínas ras/metabolismo

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA