Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Nucleic Acids Res ; 49(D1): D831-D847, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33037820

RESUMO

Bgee is a database to retrieve and compare gene expression patterns in multiple animal species, produced by integrating multiple data types (RNA-Seq, Affymetrix, in situ hybridization, and EST data). It is based exclusively on curated healthy wild-type expression data (e.g., no gene knock-out, no treatment, no disease), to provide a comparable reference of normal gene expression. Curation includes very large datasets such as GTEx (re-annotation of samples as 'healthy' or not) as well as many small ones. Data are integrated and made comparable between species thanks to consistent data annotation and processing, and to calls of presence/absence of expression, along with expression scores. As a result, Bgee is capable of detecting the conditions of expression of any single gene, accommodating any data type and species. Bgee provides several tools for analyses, allowing, e.g., automated comparisons of gene expression patterns within and between species, retrieval of the prefered conditions of expression of any gene, or enrichment analyses of conditions with expression of sets of genes. Bgee release 14.1 includes 29 animal species, and is available at https://bgee.org/ and through its Bioconductor R package BgeeDB.


Assuntos
Curadoria de Dados , Bases de Dados Genéticas , Transcriptoma/genética , Animais , Regulação da Expressão Gênica , Anotação de Sequência Molecular , Interface Usuário-Computador
2.
Bioinformatics ; 36(24): 5712-5718, 2021 04 05.
Artigo em Inglês | MEDLINE | ID: mdl-32637990

RESUMO

MOTIVATION: A large variety of molecular interactions occurs between biomolecular components in cells. When a molecular interaction results in a regulatory effect, exerted by one component onto a downstream component, a so-called 'causal interaction' takes place. Causal interactions constitute the building blocks in our understanding of larger regulatory networks in cells. These causal interactions and the biological processes they enable (e.g. gene regulation) need to be described with a careful appreciation of the underlying molecular reactions. A proper description of this information enables archiving, sharing and reuse by humans and for automated computational processing. Various representations of causal relationships between biological components are currently used in a variety of resources. RESULTS: Here, we propose a checklist that accommodates current representations, called the Minimum Information about a Molecular Interaction CAusal STatement (MI2CAST). This checklist defines both the required core information, as well as a comprehensive set of other contextual details valuable to the end user and relevant for reusing and reproducing causal molecular interaction information. The MI2CAST checklist can be used as reporting guidelines when annotating and curating causal statements, while fostering uniformity and interoperability of the data across resources. AVAILABILITY AND IMPLEMENTATION: The checklist together with examples is accessible at https://github.com/MI2CAST/MI2CAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Causalidade , Humanos
3.
Nucleic Acids Res ; 47(D1): D596-D600, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30272209

RESUMO

Rhea (http://www.rhea-db.org) is a comprehensive and non-redundant resource of over 11 000 expert-curated biochemical reactions that uses chemical entities from the ChEBI ontology to represent reaction participants. Originally designed as an annotation vocabulary for the UniProt Knowledgebase (UniProtKB), Rhea also provides reaction data for a range of other core knowledgebases and data repositories including ChEBI and MetaboLights. Here we describe recent developments in Rhea, focusing on a new resource description framework representation of Rhea reaction data and an SPARQL endpoint (https://sparql.rhea-db.org/sparql) that provides access to it. We demonstrate how federated queries that combine the Rhea SPARQL endpoint and other SPARQL endpoints such as that of UniProt can provide improved metabolite annotation and support integrative analyses that link the metabolome through the proteome to the transcriptome and genome. These developments will significantly boost the utility of Rhea as a means to link chemistry and biology for a more holistic understanding of biological systems and their function in health and disease.


Assuntos
Bases de Dados de Compostos Químicos , Bases de Dados de Proteínas , Metabolômica/métodos , Software/normas , Humanos , Bases de Conhecimento , Biologia de Sistemas/métodos
4.
Nucleic Acids Res ; 45(D1): D415-D418, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27789701

RESUMO

Rhea (http://www.rhea-db.org) is a comprehensive and non-redundant resource of expert-curated biochemical reactions designed for the functional annotation of enzymes and the description of metabolic networks. Rhea describes enzyme-catalyzed reactions covering the IUBMB Enzyme Nomenclature list as well as additional reactions, including spontaneously occurring reactions, using entities from the ChEBI (Chemical Entities of Biological Interest) ontology of small molecules. Here we describe developments in Rhea since our last report in the database issue of Nucleic Acids Research. These include the first implementation of a simple hierarchical classification of reactions, improved coverage of the IUBMB Enzyme Nomenclature list and additional reactions through continuing expert curation, and the development of a new website to serve this improved dataset.

5.
Nucleic Acids Res ; 43(Database issue): D459-64, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25332395

RESUMO

Rhea (http://www.ebi.ac.uk/rhea) is a comprehensive and non-redundant resource of expert-curated biochemical reactions described using species from the ChEBI (Chemical Entities of Biological Interest) ontology of small molecules. Rhea has been designed for the functional annotation of enzymes and the description of genome-scale metabolic networks, providing stoichiometrically balanced enzyme-catalyzed reactions (covering the IUBMB Enzyme Nomenclature list and additional reactions), transport reactions and spontaneously occurring reactions. Rhea reactions are extensively curated with links to source literature and are mapped to other publicly available enzyme and pathway databases such as Reactome, BioCyc, KEGG and UniPathway, through manual curation and computational methods. Here we describe developments in Rhea since our last report in the 2012 database issue of Nucleic Acids Research. These include significant growth in the number of Rhea reactions and the inclusion of reactions involving complex macromolecules such as proteins, nucleic acids and other polymers that lie outside the scope of ChEBI. Together these developments will significantly increase the utility of Rhea as a tool for the description, analysis and reconciliation of genome-scale metabolic models.


Assuntos
Bases de Dados de Compostos Químicos , Enzimas/metabolismo , Redes e Vias Metabólicas , Fenômenos Bioquímicos , Biopolímeros/metabolismo , Genômica , Internet , Redes e Vias Metabólicas/genética
6.
BMC Bioinformatics ; 17(1): 410, 2016 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-27716031

RESUMO

BACKGROUND: Prior knowledge networks (PKNs) provide a framework for the development of computational biological models, including Boolean models of regulatory networks which are the focus of this work. PKNs are created by a painstaking process of literature curation, and generally describe all relevant regulatory interactions identified using a variety of experimental conditions and systems, such as specific cell types or tissues. Certain of these regulatory interactions may not occur in all biological contexts of interest, and their presence may dramatically change the dynamical behaviour of the resulting computational model, hindering the elucidation of the underlying mechanisms and reducing the usefulness of model predictions. Methods are therefore required to generate optimized contextual network models from generic PKNs. RESULTS: We developed a new approach to generate and optimize Boolean networks, based on a given PKN. Using a genetic algorithm, a model network is built as a sub-network of the PKN and trained against experimental data to reproduce the experimentally observed behaviour in terms of attractors and the transitions that occur between them under specific perturbations. The resulting model network is therefore contextualized to the experimental conditions and constitutes a dynamical Boolean model closer to the observed biological process used to train the model than the original PKN. Such a model can then be interrogated to simulate response under perturbation, to detect stable states and their properties, to get insights into the underlying mechanisms and to generate new testable hypotheses. CONCLUSIONS: Generic PKNs attempt to synthesize knowledge of all interactions occurring in a biological process of interest, irrespective of the specific biological context. This limits their usefulness as a basis for the development of context-specific, predictive dynamical Boolean models. The optimization method presented in this article produces specific, contextualized models from generic PKNs. These contextualized models have improved utility for hypothesis generation and experimental design. The general applicability of this methodological approach makes it suitable for a variety of biological systems and of general interest for biological and medical research. Our method was implemented in the software optimusqual, available online at http://www.vital-it.ch/software/optimusqual/ .


Assuntos
Algoritmos , Simulação por Computador , Redes Reguladoras de Genes , Bases de Conhecimento , Modelos Genéticos , Humanos , Modelos Biológicos , Publicações , Software
7.
Bioinformatics ; 31(17): 2860-6, 2015 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-25943471

RESUMO

MOTIVATION: Lipids are a large and diverse group of biological molecules with roles in membrane formation, energy storage and signaling. Cellular lipidomes may contain tens of thousands of structures, a staggering degree of complexity whose significance is not yet fully understood. High-throughput mass spectrometry-based platforms provide a means to study this complexity, but the interpretation of lipidomic data and its integration with prior knowledge of lipid biology suffers from a lack of appropriate tools to manage the data and extract knowledge from it. RESULTS: To facilitate the description and exploration of lipidomic data and its integration with prior biological knowledge, we have developed a knowledge resource for lipids and their biology-SwissLipids. SwissLipids provides curated knowledge of lipid structures and metabolism which is used to generate an in silico library of feasible lipid structures. These are arranged in a hierarchical classification that links mass spectrometry analytical outputs to all possible lipid structures, metabolic reactions and enzymes. SwissLipids provides a reference namespace for lipidomic data publication, data exploration and hypothesis generation. The current version of SwissLipids includes over 244 000 known and theoretically possible lipid structures, over 800 proteins, and curated links to published knowledge from over 620 peer-reviewed publications. We are continually updating the SwissLipids hierarchy with new lipid categories and new expert curated knowledge. AVAILABILITY: SwissLipids is freely available at http://www.swisslipids.org/. CONTACT: alan.bridge@isb-sib.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Bases de Conhecimento , Metabolismo dos Lipídeos , Lipídeos/química , Lipídeos/fisiologia , Espectrometria de Massas/métodos , Humanos , Lipídeos/análise
9.
Life (Basel) ; 14(2)2024 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-38398771

RESUMO

Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.

10.
Bioinformatics ; 28(7): 1017-20, 2012 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-22285560

RESUMO

MOTIVATION: Most anatomical ontologies are species-specific, whereas a framework for comparative studies is needed. We describe the vertebrate Homologous Organs Groups ontology, vHOG, used to compare expression patterns between species. RESULTS: vHOG is a multispecies anatomical ontology for the vertebrate lineage. It is based on the HOGs used in the Bgee database of gene expression evolution. vHOG version 1.4 includes 1184 terms, follows OBO principles and is based on the Common Anatomy Reference Ontology (CARO). vHOG only describes structures with historical homology relations between model vertebrate species. The mapping to species-specific anatomical ontologies is provided as a separate file, so that no homology hypothesis is stated within the ontology itself. Each mapping has been manually reviewed, and we provide support codes and references when available. AVAILABILITY AND IMPLEMENTATION: vHOG is available from the Bgee download site (http://bgee.unil.ch/), as well as from the OBO Foundry and the NCBO Bioportal websites. CONTACT: bgee@isb-sib.ch; frederic.bastian@unil.ch.


Assuntos
Bases de Dados Factuais , Terminologia como Assunto , Vertebrados/anatomia & histologia , Animais , Evolução Biológica , Biologia Computacional/métodos , Expressão Gênica , Vertebrados/classificação , Vocabulário Controlado
11.
Nat Commun ; 14(1): 2533, 2023 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-37137910

RESUMO

We identify biomarkers for disease progression in three type 2 diabetes cohorts encompassing 2,973 individuals across three molecular classes, metabolites, lipids and proteins. Homocitrulline, isoleucine and 2-aminoadipic acid, eight triacylglycerol species, and lowered sphingomyelin 42:2;2 levels are predictive of faster progression towards insulin requirement. Of ~1,300 proteins examined in two cohorts, levels of GDF15/MIC-1, IL-18Ra, CRELD1, NogoR, FAS, and ENPP7 are associated with faster progression, whilst SMAC/DIABLO, SPOCK1 and HEMK2 predict lower progression rates. In an external replication, proteins and lipids are associated with diabetes incidence and prevalence. NogoR/RTN4R injection improved glucose tolerance in high fat-fed male mice but impaired it in male db/db mice. High NogoR levels led to islet cell apoptosis, and IL-18R antagonised inflammatory IL-18 signalling towards nuclear factor kappa-B in vitro. This comprehensive, multi-disciplinary approach thus identifies biomarkers with potential prognostic utility, provides evidence for possible disease mechanisms, and identifies potential therapeutic avenues to slow diabetes progression.


Assuntos
Diabetes Mellitus Tipo 2 , Ilhotas Pancreáticas , Camundongos , Animais , Masculino , Diabetes Mellitus Tipo 2/metabolismo , Glicemia/metabolismo , Ilhotas Pancreáticas/metabolismo , Insulina/metabolismo , Lipídeos , Biomarcadores/metabolismo , Moléculas de Adesão Celular/metabolismo , Proteínas da Matriz Extracelular/metabolismo
12.
J Thromb Haemost ; 18(6): 1425-1434, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32077582

RESUMO

BACKGROUND: On-clopidogrel platelet reactivity (PR) is associated with the risk of thrombotic or bleeding event in selected populations of high-risk patients. PR is a highly heritable phenotype and a few variants of cytochrome genes, essentially CYP2C19, are associated with PR but only explain 5% to 12% of the variability. OBJECTIVE: The aim of this study is to delineate genetic determinants of on-clopidogrel PR using high-throughput sequencing. METHODS: We performed a whole exome sequencing of 96 low- and matched high-PR patients in a discovery cohort. Exomes from genes with variants significantly associated with PR were sequenced in 96 low- and matched high-PR patients from an independent replication cohort. RESULTS: We identified 585 variants in 417 genes with an adjusted P value < .05. In the replication cohort, all top variants including CYP2C8, CYP2C18, and CYP2C19 from the discovery population were found again. An original network analysis identified several candidate genes of potential interest such as a regulator of PI3K, a key actor in the downstream signaling pathway of the P2Y12 receptor. CONCLUSION: This study emphasizes the role of CYP-related genes as major regulators of clopidogrel response, including the poorly investigated CYP2C8 and CYP2C18.


Assuntos
Clopidogrel , Inibidores da Agregação Plaquetária , Clopidogrel/farmacologia , Clopidogrel/uso terapêutico , Citocromo P-450 CYP2C19/genética , Exoma , Genótipo , Humanos , Agregação Plaquetária , Sequenciamento do Exoma
14.
Genome Med ; 8(1): 105, 2016 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-27799067

RESUMO

BACKGROUND: Smith-Magenis syndrome (SMS) is a developmental disability/multiple congenital anomaly disorder resulting from haploinsufficiency of RAI1. It is characterized by distinctive facial features, brachydactyly, sleep disturbances, and stereotypic behaviors. METHODS: We investigated a cohort of 15 individuals with a clinical suspicion of SMS who showed neither deletion in the SMS critical region nor damaging variants in RAI1 using whole exome sequencing. A combination of network analysis (co-expression and biomedical text mining), transcriptomics, and circularized chromatin conformation capture (4C-seq) was applied to verify whether modified genes are part of the same disease network as known SMS-causing genes. RESULTS: Potentially deleterious variants were identified in nine of these individuals using whole-exome sequencing. Eight of these changes affect KMT2D, ZEB2, MAP2K2, GLDC, CASK, MECP2, KDM5C, and POGZ, known to be associated with Kabuki syndrome 1, Mowat-Wilson syndrome, cardiofaciocutaneous syndrome, glycine encephalopathy, mental retardation and microcephaly with pontine and cerebellar hypoplasia, X-linked mental retardation 13, X-linked mental retardation Claes-Jensen type, and White-Sutton syndrome, respectively. The ninth individual carries a de novo variant in JAKMIP1, a regulator of neuronal translation that was recently found deleted in a patient with autism spectrum disorder. Analyses of co-expression and biomedical text mining suggest that these pathologies and SMS are part of the same disease network. Further support for this hypothesis was obtained from transcriptome profiling that showed that the expression levels of both Zeb2 and Map2k2 are perturbed in Rai1 -/- mice. As an orthogonal approach to potentially contributory disease gene variants, we used chromatin conformation capture to reveal chromatin contacts between RAI1 and the loci flanking ZEB2 and GLDC, as well as between RAI1 and human orthologs of the genes that show perturbed expression in our Rai1 -/- mouse model. CONCLUSIONS: These holistic studies of RAI1 and its interactions allow insights into SMS and other disorders associated with intellectual disability and behavioral abnormalities. Our findings support a pan-genomic approach to the molecular diagnosis of a distinctive disorder.


Assuntos
Exoma/genética , Redes Reguladoras de Genes , Genômica/métodos , Mutação/genética , Síndrome de Smith-Magenis/genética , Fatores de Transcrição/fisiologia , Transcriptoma , Animais , Embrião de Mamíferos/citologia , Embrião de Mamíferos/metabolismo , Feminino , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Fenótipo , RNA Mensageiro/genética , Reação em Cadeia da Polimerase em Tempo Real , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Transativadores
15.
PLoS One ; 10(8): e0134214, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26244885

RESUMO

Cytokinesis in fission yeast is controlled by the Septation Initiation Network (SIN), a protein kinase signaling network using the spindle pole body as scaffold. In order to describe the qualitative behavior of the system and predict unknown mutant behaviors we decided to adopt a Boolean modeling approach. In this paper, we report the construction of an extended, Boolean model of the SIN, comprising most SIN components and regulators as individual, experimentally testable nodes. The model uses CDK activity levels as control nodes for the simulation of SIN related events in different stages of the cell cycle. The model was optimized using single knock-out experiments of known phenotypic effect as a training set, and was able to correctly predict a double knock-out test set. Moreover, the model has made in silico predictions that have been validated in vivo, providing new insights into the regulation and hierarchical organization of the SIN.


Assuntos
Citocinese/genética , Regulação Fúngica da Expressão Gênica , Redes Reguladoras de Genes/genética , Modelos Genéticos , Proteínas de Schizosaccharomyces pombe/genética , Schizosaccharomyces/genética , Mutação , Reprodutibilidade dos Testes , Schizosaccharomyces/metabolismo , Proteínas de Schizosaccharomyces pombe/metabolismo , Transdução de Sinais/genética
16.
Database (Oxford) ; 2015: bav043, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25957950

RESUMO

Biocuration has become a cornerstone for analyses in biology, and to meet needs, the amount of annotations has considerably grown in recent years. However, the reliability of these annotations varies; it has thus become necessary to be able to assess the confidence in annotations. Although several resources already provide confidence information about the annotations that they produce, a standard way of providing such information has yet to be defined. This lack of standardization undermines the propagation of knowledge across resources, as well as the credibility of results from high-throughput analyses. Seeded at a workshop during the Biocuration 2012 conference, a working group has been created to address this problem. We present here the elements that were identified as essential for assessing confidence in annotations, as well as a draft ontology--the Confidence Information Ontology--to illustrate how the problems identified could be addressed. We hope that this effort will provide a home for discussing this major issue among the biocuration community. Tracker URL: https://github.com/BgeeDB/confidence-information-ontology Ontology URL: https://raw.githubusercontent.com/BgeeDB/confidence-information-ontology/master/src/ontology/cio-simple.obo


Assuntos
Ontologias Biológicas , Curadoria de Dados/normas , Congressos como Assunto
17.
J Biomed Semantics ; 5: 21, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25009735

RESUMO

BACKGROUND: Elucidating disease and developmental dysfunction requires understanding variation in phenotype. Single-species model organism anatomy ontologies (ssAOs) have been established to represent this variation. Multi-species anatomy ontologies (msAOs; vertebrate skeletal, vertebrate homologous, teleost, amphibian AOs) have been developed to represent 'natural' phenotypic variation across species. Our aim has been to integrate ssAOs and msAOs for various purposes, including establishing links between phenotypic variation and candidate genes. RESULTS: Previously, msAOs contained a mixture of unique and overlapping content. This hampered integration and coordination due to the need to maintain cross-references or inter-ontology equivalence axioms to the ssAOs, or to perform large-scale obsolescence and modular import. Here we present the unification of anatomy ontologies into Uberon, a single ontology resource that enables interoperability among disparate data and research groups. As a consequence, independent development of TAO, VSAO, AAO, and vHOG has been discontinued. CONCLUSIONS: The newly broadened Uberon ontology is a unified cross-taxon resource for metazoans (animals) that has been substantially expanded to include a broad diversity of vertebrate anatomical structures, permitting reasoning across anatomical variation in extinct and extant taxa. Uberon is a core resource that supports single- and cross-species queries for candidate genes using annotations for phenotypes from the systematics, biodiversity, medical, and model organism communities, while also providing entities for logical definitions in the Cell and Gene Ontologies. THE ONTOLOGY RELEASE FILES ASSOCIATED WITH THE ONTOLOGY MERGE DESCRIBED IN THIS MANUSCRIPT ARE AVAILABLE AT: http://purl.obolibrary.org/obo/uberon/releases/2013-02-21/ CURRENT ONTOLOGY RELEASE FILES ARE AVAILABLE ALWAYS AVAILABLE AT: http://purl.obolibrary.org/obo/uberon/releases/

18.
Database (Oxford) ; 2013: bat010, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23487185

RESUMO

As part of the development of the database Bgee (a dataBase for Gene Expression Evolution), we annotate and analyse expression data from different types and different sources, notably Affymetrix data from GEO and ArrayExpress, and RNA-Seq data from SRA. During our quality control procedure, we have identified duplicated content in GEO and ArrayExpress, affecting ∼14% of our data: fully or partially duplicated experiments from independent data submissions, Affymetrix chips reused in several experiments, or reused within an experiment. We present here the procedure that we have established to filter such duplicates from Affymetrix data, and our procedure to identify future potential duplicates in RNA-Seq data.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica , Estatística como Assunto , Regulação da Expressão Gênica , Humanos , Análise de Sequência com Séries de Oligonucleotídeos
19.
J Proteome Res ; 4(1): 167-74, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-15707372

RESUMO

There is growing interest to use mass spectrometry data to search genome sequences directly. Previous work by other authors demonstrated that this approach is able to correct and complement available genome annotations. We discuss the practical difficulty of searching large eukaryotic genomes with peptide ion trap tandem mass spectra of small proteins (<40 kDa). The challenging problem of automatically identifying peptides that span across exon/intron boundaries is explored for the first time by using experimental data. In a human genome search, we find that roughly 30% of the peptides are missed, due to various reasons, compared to a Swiss-Prot search. We show that this percentage is significantly reduced with improved parent mass accuracy. We finally provide several examples of predicted gene structures that could be improved by proteomics data, in particular by peptides spanning across exon/intron boundaries.


Assuntos
Genômica/métodos , Proteômica/métodos , Adulto , Sequência de Aminoácidos , Bases de Dados de Proteínas , Éxons , Feminino , Humanos , Íntrons , Espectrometria de Massas/métodos , Peptídeos , Gravidez
20.
Proteomics ; 4(7): 1977-84, 2004 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-15221758

RESUMO

In a previous paper we introduced a novel model-based approach (OLAV) to the problem of identifying peptides via tandem mass spectrometry, for which early implementations showed promising performance. We recently further improved this performance to a remarkable level (1-2% false positive rate at 95% true positive rate) and characterized key properties of OLAV like robustness and training set size. We present these results in a synthetic and coherent way along with detailed performance comparisons, a new scoring component making use of peptide amino acidic composition, and new developments like automatic parameter learning. Finally, we discuss the impact of OLAV on the automation of proteomics projects.


Assuntos
Peptídeos/química , Proteômica/métodos , Algoritmos , Automação , Reações Falso-Positivas , Humanos , Espectrometria de Massas/métodos , Curva ROC , Espectrometria de Massas por Ionização por Electrospray/métodos , Fatores de Tempo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA