Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 76
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 50(D1): D480-D487, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34850135

RESUMEN

The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.


Asunto(s)
Bases de Datos de Proteínas , Proteínas Intrínsecamente Desordenadas/metabolismo , Anotación de Secuencia Molecular , Programas Informáticos , Secuencia de Aminoácidos , ADN/genética , ADN/metabolismo , Conjuntos de Datos como Asunto , Ontología de Genes , Humanos , Internet , Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/genética , Unión Proteica , ARN/genética , ARN/metabolismo
2.
Bioinformatics ; 36(24): 5712-5718, 2021 04 05.
Artículo en Inglés | MEDLINE | ID: mdl-32637990

RESUMEN

MOTIVATION: A large variety of molecular interactions occurs between biomolecular components in cells. When a molecular interaction results in a regulatory effect, exerted by one component onto a downstream component, a so-called 'causal interaction' takes place. Causal interactions constitute the building blocks in our understanding of larger regulatory networks in cells. These causal interactions and the biological processes they enable (e.g. gene regulation) need to be described with a careful appreciation of the underlying molecular reactions. A proper description of this information enables archiving, sharing and reuse by humans and for automated computational processing. Various representations of causal relationships between biological components are currently used in a variety of resources. RESULTS: Here, we propose a checklist that accommodates current representations, called the Minimum Information about a Molecular Interaction CAusal STatement (MI2CAST). This checklist defines both the required core information, as well as a comprehensive set of other contextual details valuable to the end user and relevant for reusing and reproducing causal molecular interaction information. The MI2CAST checklist can be used as reporting guidelines when annotating and curating causal statements, while fostering uniformity and interoperability of the data across resources. AVAILABILITY AND IMPLEMENTATION: The checklist together with examples is accessible at https://github.com/MI2CAST/MI2CAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Causalidad , Humanos
3.
Nucleic Acids Res ; 48(D1): D328-D334, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31724716

RESUMEN

The neXtProt knowledgebase (https://www.nextprot.org) is an integrative resource providing both data on human protein and the tools to explore these. In order to provide comprehensive and up-to-date data, we evaluate and add new data sets. We describe the incorporation of three new data sets that provide expression, function, protein-protein binary interaction, post-translational modifications (PTM) and variant information. New SPARQL query examples illustrating uses of the new data were added. neXtProt has continued to develop tools for proteomics. We have improved the peptide uniqueness checker and have implemented a new protein digestion tool. Together, these tools make it possible to determine which proteases can be used to identify trypsin-resistant proteins by mass spectrometry. In terms of usability, we have finished revamping our web interface and completely rewritten our API. Our SPARQL endpoint now supports federated queries. All the neXtProt data are available via our user interface, API, SPARQL endpoint and FTP site, including the new PEFF 1.0 format files. Finally, the data on our FTP site is now CC BY 4.0 to promote its reuse.


Asunto(s)
Bases de Datos de Proteínas , Bases del Conocimiento , Humanos , Internet , Espectrometría de Masas , Péptidos/química , Proteínas Quinasas/química , Proteínas Quinasas/metabolismo , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Análisis de Secuencia de ARN , Programas Informáticos , Tripsina , Interfaz Usuario-Computador
4.
Bioinformatics ; 36(10): 3244-3245, 2020 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-31985787

RESUMEN

SUMMARY: The Feature-Viewer is a lightweight library for the visualization of biological data mapped to a protein or nucleotide sequence. It is designed for ease of use while allowing for a full customization. The library is already used by several biological data resources and allows intuitive visual mapping of a full spectra of sequence features for different usages. AVAILABILITY AND IMPLEMENTATION: The Feature-Viewer is open source, compatible with state-of-the-art development technologies and responsive, also for mobile viewing. Documentation and usage examples are available online.


Asunto(s)
Computadores , Programas Informáticos
5.
Nucleic Acids Res ; 47(D1): D1186-D1194, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30407590

RESUMEN

The Evidence and Conclusion Ontology (ECO) contains terms (classes) that describe types of evidence and assertion methods. ECO terms are used in the process of biocuration to capture the evidence that supports biological assertions (e.g. gene product X has function Y as supported by evidence Z). Capture of this information allows tracking of annotation provenance, establishment of quality control measures and query of evidence. ECO contains over 1500 terms and is in use by many leading biological resources including the Gene Ontology, UniProt and several model organism databases. ECO is continually being expanded and revised based on the needs of the biocuration community. The ontology is freely available for download from GitHub (https://github.com/evidenceontology/) or the project's website (http://evidenceontology.org/). Users can request new terms or changes to existing terms through the project's GitHub site. ECO is released into the public domain under CC0 1.0 Universal.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Ontología de Genes , Proteínas/genética , Animales , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Proteínas/metabolismo , Análisis de Secuencia de Proteína , Interfaz Usuario-Computador
6.
Hum Genomics ; 12(1): 36, 2018 07 11.
Artículo en Inglés | MEDLINE | ID: mdl-29996917

RESUMEN

BACKGROUND: Germline pathogenic variants in the breast cancer type 1 susceptibility gene BRCA1 are associated with a 60% lifetime risk for breast and ovarian cancer. This overall risk estimate is for all BRCA1 variants; obviously, not all variants confer the same risk of developing a disease. In cancer patients, loss of BRCA1 function in tumor tissue has been associated with an increased sensitivity to platinum agents and to poly-(ADP-ribose) polymerase (PARP) inhibitors. For clinical management of both at-risk individuals and cancer patients, it would be important that each identified genetic variant be associated with clinical significance. Unfortunately for the vast majority of variants, the clinical impact is unknown. The availability of results from studies assessing the impact of variants on protein function may provide insight of crucial importance. RESULTS AND CONCLUSION: We have collected, curated, and structured the molecular and cellular phenotypic impact of 3654 distinct BRCA1 variants. The data was modeled in triple format, using the variant as a subject, the studied function as the object, and a predicate describing the relation between the two. Each annotation is supported by a fully traceable evidence. The data was captured using standard ontologies to ensure consistency, and enhance searchability and interoperability. We have assessed the extent to which functional defects at the molecular and cellular levels correlate with the clinical interpretation of variants by ClinVar submitters. Approximately 30% of the ClinVar BRCA1 missense variants have some molecular or cellular assay available in the literature. Pathogenic variants (as assigned by ClinVar) have at least some significant functional defect in 94% of testable cases. For benign variants, 77% of ClinVar benign variants, for which neXtProt Cancer variant portal has data, shows either no or mild experimental functional defects. While this does not provide evidence for clinical interpretation of variants, it may provide some guidance for variants of unknown significance, in the absence of more reliable data. The neXtProt Cancer variant portal ( https://www.nextprot.org/portals/breast-cancer ) contains over 6300 observations at the molecular and/or cellular level for BRCA1 variants.


Asunto(s)
Proteína BRCA1/genética , Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad , Neoplasias Ováricas/genética , Adulto , Anciano , Proteína BRCA1/química , Neoplasias de la Mama/patología , Biología Computacional , Femenino , Variación Genética , Mutación de Línea Germinal/genética , Humanos , Persona de Mediana Edad , Neoplasias Ováricas/patología , Conformación Proteica
7.
Nucleic Acids Res ; 45(D1): D177-D182, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899619

RESUMEN

The neXtProt human protein knowledgebase (https://www.nextprot.org) continues to add new content and tools, with a focus on proteomics and genetic variation data. neXtProt now has proteomics data for over 85% of the human proteins, as well as new tools tailored to the proteomics community.Moreover, the neXtProt release 2016-08-25 includes over 8000 phenotypic observations for over 4000 variations in a number of genes involved in hereditary cancers and channelopathies. These changes are presented in the current neXtProt update. All of the neXtProt data are available via our user interface and FTP site. We also provide an API access and a SPARQL endpoint for more technical applications.


Asunto(s)
Bases de Datos de Proteínas , Proteómica , Estudios de Asociación Genética , Variación Genética , Humanos , Internet , Fenotipo , Proteómica/métodos , Programas Informáticos , Navegador Web
8.
Hum Mutat ; 38(5): 485-493, 2017 05.
Artículo en Inglés | MEDLINE | ID: mdl-28168870

RESUMEN

Voltage-gated sodium channels are pore-forming transmembrane proteins that selectively allow sodium ions to flow across the plasma membrane according to the electro-chemical gradient thus mediating the rising phase of action potentials in excitable cells and playing key roles in physiological processes such as neurotransmission, skeletal muscle contraction, heart rhythm, and pain sensation. Genetic variations in the nine human genes encoding these channels are known to cause a large range of diseases affecting the nervous and cardiac systems. Understanding the molecular effect of genetic variations is critical for elucidating the pathologic mechanisms of known variations and in predicting the effect of newly discovered ones. To this end, we have created a Web-based tool, the Ion Channels Variants Portal, which compiles all variants characterized functionally in the human sodium channel genes. This portal describes 672 variants each associated with at least one molecular or clinical phenotypic impact, for a total of 4,658 observations extracted from 264 different research articles. These data were captured as structured annotations using standardized vocabularies and ontologies, such as the Gene Ontology and the Ion Channel ElectroPhysiology Ontology. All these data are available to the scientific community via neXtProt at https://www.nextprot.org/portals/navmut.


Asunto(s)
Biología Computacional , Bases de Datos Genéticas , Mutación , Canales de Sodio Activados por Voltaje/genética , Canales de Sodio Activados por Voltaje/metabolismo , Animales , Biología Computacional/métodos , Fenómenos Electrofisiológicos/genética , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Anotación de Secuencia Molecular , Fenotipo , Dominios Proteicos , Índice de Severidad de la Enfermedad , Programas Informáticos , Canales de Sodio Activados por Voltaje/química , Navegador Web
9.
Hum Mol Genet ; 24(18): 5270-84, 2015 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-26123492

RESUMEN

In pancreatic ß-cells, mitochondria play a central role in coupling glucose metabolism to insulin secretion. Chronic exposure of ß-cells to metabolic stresses impairs their function and potentially induces apoptosis. Little is known on mitochondrial adaptation to metabolic stresses, i.e. high glucose, fatty acids or oxidative stress; being all highlighted in the pathogenesis of type 2 diabetes. Here, human islets were exposed for 3 days to 25 mm glucose, 0.4 mm palmitate, 0.4 mm oleate and transiently to H2O2. Culture at physiological 5.6 mm glucose served as no-stress control. Expression of mitochondrion-associated genes was quantified, including the transcriptome of mitochondrial inner membrane carriers. Targets of interest were further evaluated at the protein level. Three days after acute oxidative stress, no significant alteration in ß-cell function or apoptosis was detected in human islets. Palmitate specifically increased expression of the pyruvate carriers MPC1 and MPC2, whereas the glutamate carrier GC1 and the aspartate/glutamate carrier AGC1 were down-regulated by palmitate and oleate, respectively. High glucose decreased mRNA levels of key transcription factors (HNF4A, IPF1, PPARA and TFAM) and energy-sensor SIRT1. High glucose also reduced expression of 11 mtDNA-encoded respiratory chain subunits. Interestingly, transcript levels of the carriers for aspartate/glutamate AGC2, malate DIC and malate/oxaloacetate/aspartate UCP2 were increased by high glucose, a profile suggesting important mitochondrial anaplerotic/cataplerotic activities and NADPH-generating shuttles. Chronic exposure to high glucose impaired glucose-stimulated insulin secretion, decreased insulin content, promoted caspase-3 cleavage and cell death, revealing glucotoxicity. Overall, expression profile of mitochondrion-associated genes was selectively modified by glucose, delineating a glucotoxic-specific signature.


Asunto(s)
Diferenciación Celular/genética , Diabetes Mellitus/genética , Islotes Pancreáticos/citología , Islotes Pancreáticos/metabolismo , Mitocondrias/genética , Transcriptoma , Apoptosis/genética , Línea Celular , Supervivencia Celular/genética , ADN Mitocondrial/genética , Diabetes Mellitus/metabolismo , Transporte de Electrón , Expresión Génica , Glucosa/metabolismo , Humanos , Insulina/metabolismo , Mitocondrias/metabolismo , Bombas de Protones/metabolismo , Superóxidos/metabolismo
10.
Nucleic Acids Res ; 43(Database issue): D764-70, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25593349

RESUMEN

neXtProt (http://www.nextprot.org) is a human protein-centric knowledgebase developed at the SIB Swiss Institute of Bioinformatics. Focused solely on human proteins, neXtProt aims to provide a state of the art resource for the representation of human biology by capturing a wide range of data, precise annotations, fully traceable data provenance and a web interface which enables researchers to find and view information in a comprehensive manner. Since the introductory neXtProt publication, significant advances have been made on three main aspects: the representation of proteomics data, an extended representation of human variants and the development of an advanced search capability built around semantic technologies. These changes are presented in the current neXtProt update.


Asunto(s)
Bases de Datos de Proteínas , Variación Genética , Proteínas/genética , Proteómica , Línea Celular , Enfermedad/genética , Humanos , Internet , Proteoma
11.
PLoS Comput Biol ; 11(4): e1004143, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25856076

RESUMEN

In recent years, high-throughput technologies have brought big data to the life sciences. The march of progress has been rapid, leaving in its wake a demand for courses in data analysis, data stewardship, computing fundamentals, etc., a need that universities have not yet been able to satisfy--paradoxically, many are actually closing "niche" bioinformatics courses at a time of critical need. The impact of this is being felt across continents, as many students and early-stage researchers are being left without appropriate skills to manage, analyse, and interpret their data with confidence. This situation has galvanised a group of scientists to address the problems on an international scale. For the first time, bioinformatics educators and trainers across the globe have come together to address common needs, rising above institutional and international boundaries to cooperate in sharing bioinformatics training expertise, experience, and resources, aiming to put ad hoc training practices on a more professional footing for the benefit of all.


Asunto(s)
Biología Computacional/educación , Biología Computacional/organización & administración , Curriculum , Relaciones Interinstitucionales , Internacionalidad , Enseñanza/organización & administración
12.
Drug Discov Today Technol ; 14: 11-6, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-26194582

RESUMEN

Protein databases are a gold mine of potential new drug targets. The ready access to a complete overview of all aspects of protein biology provides the most benefit at the outset of drug discovery pipelines. Ideally, curation strategies used to move from the raw data to the validated knowledge should contain the checks and balances necessary for accuracy. The neXtProt human protein knowledgebase is used here as an example to give insight into these methods.


Asunto(s)
Bases de Datos de Proteínas , Descubrimiento de Drogas , Minería de Datos , Humanos , Conformación Proteica
13.
J Proteome Res ; 13(1): 15-20, 2014 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-24364385

RESUMEN

One year ago the Human Proteome Project (HPP) leadership designated the baseline metrics for the Human Proteome Project to be based on neXtProt with a total of 13,664 proteins validated at protein evidence level 1 (PE1) by mass spectrometry, antibody-capture, Edman sequencing, or 3D structures. Corresponding chromosome-specific data were provided from PeptideAtlas, GPMdb, and Human Protein Atlas. This year, the neXtProt total is 15,646 and the other resources, which are inputs to neXtProt, have high-quality identifications and additional annotations for 14,012 in PeptideAtlas, 14,869 in GPMdb, and 10,976 in HPA. We propose to remove 638 genes from the denominator that are "uncertain" or "dubious" in Ensembl, UniProt/SwissProt, and neXtProt. That leaves 3844 "missing proteins", currently having no or inadequate documentation, to be found from a new denominator of 19,490 protein-coding genes. We present those tabulations and web links and discuss current strategies to find the missing proteins.


Asunto(s)
Proteoma , Cromosomas Humanos , Humanos , Espectrometría de Masas
14.
Nucleic Acids Res ; 40(Database issue): D76-83, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22139911

RESUMEN

neXtProt (http://www.nextprot.org/) is a new human protein-centric knowledge platform. Developed at the Swiss Institute of Bioinformatics (SIB), it aims to help researchers answer questions relevant to human proteins. To achieve this goal, neXtProt is built on a corpus containing both curated knowledge originating from the UniProtKB/Swiss-Prot knowledgebase and carefully selected and filtered high-throughput data pertinent to human proteins. This article presents an overview of the database and the data integration process. We also lay out the key future directions of neXtProt that we consider the necessary steps to make neXtProt the one-stop-shop for all research projects focusing on human proteins.


Asunto(s)
Bases de Datos de Proteínas , Humanos , Bases del Conocimiento , Proteínas/genética , Proteínas/metabolismo , Interfaz Usuario-Computador
15.
J Proteome Res ; 12(1): 293-8, 2013 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-23205526

RESUMEN

About 5000 (25%) of the ~20400 human protein-coding genes currently lack any experimental evidence at the protein level. For many others, there is only little information relative to their abundance, distribution, subcellular localization, interactions, or cellular functions. The aim of the HUPO Human Proteome Project (HPP, www.thehpp.org ) is to collect this information for every human protein. HPP is based on three major pillars: mass spectrometry (MS), antibody/affinity capture reagents (Ab), and bioinformatics-driven knowledge base (KB). To meet this objective, the Chromosome-Centric Human Proteome Project (C-HPP) proposes to build this catalog chromosome-by-chromosome ( www.c-hpp.org ) by focusing primarily on proteins that currently lack MS evidence or Ab detection. These are termed "missing proteins" by the HPP consortium. The lack of observation of a protein can be due to various factors including incorrect and incomplete gene annotation, low or restricted expression, or instability. neXtProt ( www.nextprot.org ) is a new web-based knowledge platform specific for human proteins that aims to complement UniProtKB/Swiss-Prot ( www.uniprot.org ) with detailed information obtained from carefully selected high-throughput experiments on genomic variation, post-translational modifications, as well as protein expression in tissues and cells. This article describes how neXtProt contributes to prioritize C-HPP efforts and integrates C-HPP results with other research efforts to create a complete human proteome catalog.


Asunto(s)
Bases de Datos de Proteínas , Proteínas , Proteoma , Cromosomas Humanos , Biología Computacional , Genoma Humano , Humanos , Internet , Bases del Conocimiento , Espectrometría de Masas , Procesamiento Proteico-Postraduccional , Proteínas/genética , Proteínas/metabolismo
16.
J Proteome Res ; 12(1): 45-57, 2013 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-23259914

RESUMEN

We report progress assembling the parts list for chromosome 17 and illustrate the various processes that we have developed to integrate available data from diverse genomic and proteomic knowledge bases. As primary resources, we have used GPMDB, neXtProt, PeptideAtlas, Human Protein Atlas (HPA), and GeneCards. All sites share the common resource of Ensembl for the genome modeling information. We have defined the chromosome 17 parts list with the following information: 1169 protein-coding genes, the numbers of proteins confidently identified by various experimental approaches as documented in GPMDB, neXtProt, PeptideAtlas, and HPA, examples of typical data sets obtained by RNASeq and proteomic studies of epithelial derived tumor cell lines (disease proteome) and a normal proteome (peripheral mononuclear cells), reported evidence of post-translational modifications, and examples of alternative splice variants (ASVs). We have constructed a list of the 59 "missing" proteins as well as 201 proteins that have inconclusive mass spectrometric (MS) identifications. In this report we have defined a process to establish a baseline for the incorporation of new evidence on protein identification and characterization as well as related information from transcriptome analyses. This initial list of "missing" proteins that will guide the selection of appropriate samples for discovery studies as well as antibody reagents. Also we have illustrated the significant diversity of protein variants (including post-translational modifications, PTMs) using regions on chromosome 17 that contain important oncogenes. We emphasize the need for mandated deposition of proteomics data in public databases, the further development of improved PTM, ASV, and single nucleotide variant (SNV) databases, and the construction of Web sites that can integrate and regularly update such information. In addition, we describe the distribution of both clustered and scattered sets of protein families on the chromosome. Since chromosome 17 is rich in cancer-associated genes, we have focused the clustering of cancer-associated genes in such genomic regions and have used the ERBB2 amplicon as an example of the value of a proteogenomic approach in which one integrates transcriptomic with proteomic information and captures evidence of coexpression through coordinated regulation.


Asunto(s)
Cromosomas Humanos Par 17 , Genoma Humano , Proteínas , Proteómica , Secuencia de Aminoácidos , Cromosomas Humanos Par 17/genética , Cromosomas Humanos Par 17/metabolismo , Bases de Datos de Proteínas , Expresión Génica , Proyecto Genoma Humano , Humanos , Proteínas/clasificación , Proteínas/genética , Proteínas/metabolismo
17.
Brief Bioinform ; 12(5): 449-62, 2011 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-21873635

RESUMEN

The goal of the Gene Ontology (GO) project is to provide a uniform way to describe the functions of gene products from organisms across all kingdoms of life and thereby enable analysis of genomic data. Protein annotations are either based on experiments or predicted from protein sequences. Since most sequences have not been experimentally characterized, most available annotations need to be based on predictions. To make as accurate inferences as possible, the GO Consortium's Reference Genome Project is using an explicit evolutionary framework to infer annotations of proteins from a broad set of genomes from experimental annotations in a semi-automated manner. Most components in the pipeline, such as selection of sequences, building multiple sequence alignments and phylogenetic trees, retrieving experimental annotations and depositing inferred annotations, are fully automated. However, the most crucial step in our pipeline relies on software-assisted curation by an expert biologist. This curation tool, Phylogenetic Annotation and INference Tool (PAINT) helps curators to infer annotations among members of a protein family. PAINT allows curators to make precise assertions as to when functions were gained and lost during evolution and record the evidence (e.g. experimentally supported GO annotations and phylogenetic information including orthology) for those assertions. In this article, we describe how we use PAINT to infer protein function in a phylogenetic context with emphasis on its strengths, limitations and guidelines. We also discuss specific examples showing how PAINT annotations compare with those generated by other highly used homology-based methods.


Asunto(s)
Genómica/métodos , Anotación de Secuencia Molecular/métodos , Filogenia , Proteínas/química , Bases de Datos Genéticas , Genoma , Proteínas/genética
18.
Nucleic Acids Res ; 39(Database issue): D620-4, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21087999

RESUMEN

dictyBase (http://www.dictybase.org), the model organism database for Dictyostelium, aims to provide the broad biomedical research community with well integrated, high quality data and tools for Dictyostelium discoideum and related species. dictyBase houses the complete genome sequence, ESTs, and the entire body of literature relevant to Dictyostelium. This information is curated to provide accurate gene models and functional annotations, with the goal of fully annotating the genome to provide a 'reference genome' in the Amoebozoa clade. We highlight several new features in the present update: (i) new annotations; (ii) improved interface with web 2.0 functionality; (iii) the initial steps towards a genome portal for the Amoebozoa; (iv) ortholog display; and (v) the complete integration of the Dicty Stock Center with dictyBase.


Asunto(s)
Bases de Datos Genéticas , Dictyostelium/genética , Amebozoos/genética , Genoma de Protozoos , Internet , Anotación de Secuencia Molecular , Proteínas Protozoarias/química , Proteínas Protozoarias/genética , Integración de Sistemas
19.
Nucleic Acids Res ; 39(Database issue): D7-10, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21097465

RESUMEN

The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.


Asunto(s)
Bases de Datos Factuales/normas , Difusión de la Información
20.
Genetics ; 224(1)2023 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-36866529

RESUMEN

The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.


Asunto(s)
Bases de Datos Genéticas , Proteínas , Ontología de Genes , Proteínas/genética , Anotación de Secuencia Molecular , Biología Computacional
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA