Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Nucleic Acids Res ; 52(D1): D255-D264, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37971353

RESUMO

RegulonDB is a database that contains the most comprehensive corpus of knowledge of the regulation of transcription initiation of Escherichia coli K-12, including data from both classical molecular biology and high-throughput methodologies. Here, we describe biological advances since our last NAR paper of 2019. We explain the changes to satisfy FAIR requirements. We also present a full reconstruction of the RegulonDB computational infrastructure, which has significantly improved data storage, retrieval and accessibility and thus supports a more intuitive and user-friendly experience. The integration of graphical tools provides clear visual representations of genetic regulation data, facilitating data interpretation and knowledge integration. RegulonDB version 12.0 can be accessed at https://regulondb.ccg.unam.mx.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12 , Regulação Bacteriana da Expressão Gênica , Biologia Computacional/métodos , Escherichia coli K12/genética , Internet , Transcrição Gênica
2.
Nucleic Acids Res ; 47(D1): D212-D220, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30395280

RESUMO

RegulonDB, first published 20 years ago, is a comprehensive electronic resource about regulation of transcription initiation of Escherichia coli K-12 with decades of knowledge from classic molecular biology experiments, and recently also from high-throughput genomic methodologies. We curated the literature to keep RegulonDB up to date, and initiated curation of ChIP and gSELEX experiments. We estimate that current knowledge describes between 10% and 30% of the expected total number of transcription factor- gene regulatory interactions in E. coli. RegulonDB provides datasets for interactions for which there is no evidence that they affect expression, as well as expression datasets. We developed a proof of concept pipeline to merge binding and expression evidence to identify regulatory interactions. These datasets can be visualized in the RegulonDB JBrowse. We developed the Microbial Conditions Ontology with a controlled vocabulary for the minimal properties to reproduce an experiment, which contributes to integrate data from high throughput and classic literature. At a higher level of integration, we report Genetic Sensory-Response Units for 200 transcription factors, including their regulation at the metabolic level, and include summaries for 70 of them. Finally, we summarize our research with Natural language processing strategies to enhance our biocuration work.


Assuntos
Biologia Computacional/métodos , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Genômica , Ontologia Genética , Redes Reguladoras de Genes , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala
3.
Nucleic Acids Res ; 45(D1): D543-D550, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899573

RESUMO

EcoCyc (EcoCyc.org) is a freely accessible, comprehensive database that collects and summarizes experimental data for Escherichia coli K-12, the best-studied bacterial model organism. New experimental discoveries about gene products, their function and regulation, new metabolic pathways, enzymes and cofactors are regularly added to EcoCyc. New SmartTable tools allow users to browse collections of related EcoCyc content. SmartTables can also serve as repositories for user- or curator-generated lists. EcoCyc now supports running and modifying E. coli metabolic models directly on the EcoCyc website.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Metabolismo Energético , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes e Vias Metabólicas , Transdução de Sinais , Software , Fatores de Transcrição/metabolismo , Navegador
4.
BMC Biol ; 16(1): 91, 2018 08 16.
Artigo em Inglês | MEDLINE | ID: mdl-30115066

RESUMO

BACKGROUND: Our understanding of the regulation of gene expression has benefited from the availability of high-throughput technologies that interrogate the whole genome for the binding of specific transcription factors and gene expression profiles. In the case of widely used model organisms, such as Escherichia coli K-12, the new knowledge gained from these approaches needs to be integrated with the legacy of accumulated knowledge from genetic and molecular biology experiments conducted in the pre-genomic era in order to attain the deepest level of understanding possible based on the available data. RESULTS: In this paper, we describe an expansion of RegulonDB, the database containing the rich legacy of decades of classic molecular biology experiments supporting what we know about gene regulation and operon organization in E. coli K-12, to include the genome-wide dataset collections from 32 ChIP and 19 gSELEX publications, in addition to around 60 genome-wide expression profiles relevant to the functional significance of these datasets and used in their curation. Three essential features for the integration of this information coming from different methodological approaches are: first, a controlled vocabulary within an ontology for precisely defining growth conditions; second, the criteria to separate elements with enough evidence to consider them involved in gene regulation from isolated transcription factor binding sites without such support; and third, an expanded computational model supporting this knowledge. Altogether, this constitutes the basis for adequately gathering and enabling the comparisons and integration needed to manage and access such wealth of knowledge. CONCLUSIONS: This version 10.0 of RegulonDB is a first step toward what should become the unifying access point for current and future knowledge on gene regulation in E. coli K-12. Furthermore, this model platform and associated methodologies and criteria can be emulated for gathering knowledge on other microbial organisms.


Assuntos
Bases de Dados como Assunto , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Transcrição Gênica
5.
Nucleic Acids Res ; 44(D1): D133-43, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26527724

RESUMO

RegulonDB (http://regulondb.ccg.unam.mx) is one of the most useful and important resources on bacterial gene regulation,as it integrates the scattered scientific knowledge of the best-characterized organism, Escherichia coli K-12, in a database that organizes large amounts of data. Its electronic format enables researchers to compare their results with the legacy of previous knowledge and supports bioinformatics tools and model building. Here, we summarize our progress with RegulonDB since our last Nucleic Acids Research publication describing RegulonDB, in 2013. In addition to maintaining curation up-to-date, we report a collection of 232 interactions with small RNAs affecting 192 genes, and the complete repertoire of 189 Elementary Genetic Sensory-Response units (GENSOR units), integrating the signal, regulatory interactions, and metabolic pathways they govern. These additions represent major progress to a higher level of understanding of regulated processes. We have updated the computationally predicted transcription factors, which total 304 (184 with experimental evidence and 120 from computational predictions); we updated our position-weight matrices and have included tools for clustering them in evolutionary families. We describe our semiautomatic strategy to accelerate curation, including datasets from high-throughput experiments, a novel coexpression distance to search for 'neighborhood' genes to known operons and regulons, and computational developments.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Regulon , Análise por Conglomerados , Escherichia coli K12/metabolismo , Redes Reguladoras de Genes , Óperon , Matrizes de Pontuação de Posição Específica , Pequeno RNA não Traduzido/metabolismo , Fatores de Transcrição/classificação
6.
Nucleic Acids Res ; 41(Database issue): D605-12, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23143106

RESUMO

EcoCyc (http://EcoCyc.org) is a model organism database built on the genome sequence of Escherichia coli K-12 MG1655. Expert manual curation of the functions of individual E. coli gene products in EcoCyc has been based on information found in the experimental literature for E. coli K-12-derived strains. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Sítios de Ligação , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/classificação , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Internet , Proteínas de Membrana Transportadoras/classificação , Proteínas de Membrana Transportadoras/metabolismo , Modelos Genéticos , Anotação de Sequência Molecular , Fenótipo , Matrizes de Pontuação de Posição Específica , Regiões Promotoras Genéticas , Biologia de Sistemas , Fatores de Transcrição/metabolismo , Transcrição Gênica
7.
Nucleic Acids Res ; 41(Database issue): D203-13, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23203884

RESUMO

This article summarizes our progress with RegulonDB (http://regulondb.ccg.unam.mx/) during the past 2 years. We have kept up-to-date the knowledge from the published literature regarding transcriptional regulation in Escherichia coli K-12. We have maintained and expanded our curation efforts to improve the breadth and quality of the encoded experimental knowledge, and we have implemented criteria for the quality of our computational predictions. Regulatory phrases now provide high-level descriptions of regulatory regions. We expanded the assignment of quality to various sources of evidence, particularly for knowledge generated through high-throughput (HT) technology. Based on our analysis of most relevant methods, we defined rules for determining the quality of evidence when multiple independent sources support an entry. With this latest release of RegulonDB, we present a new highly reliable larger collection of transcription start sites, a result of our experimental HT genome-wide efforts. These improvements, together with several novel enhancements (the tracks display, uploading format and curational guidelines), address the challenges of incorporating HT-generated knowledge into RegulonDB. Information on the evolutionary conservation of regulatory elements is also available now. Altogether, RegulonDB version 8.0 is a much better home for integrating knowledge on gene regulation from the sources of information currently available.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Elementos Reguladores de Transcrição , Transcrição Gênica , Proteínas de Bactérias/metabolismo , Bases de Dados Genéticas/normas , Evolução Molecular , Genômica , Internet , Regiões Promotoras Genéticas , Regulon , Proteínas Repressoras/metabolismo , Análise de Sequência de RNA , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição
8.
Nucleic Acids Res ; 39(Database issue): D583-90, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21097882

RESUMO

EcoCyc (http://EcoCyc.org) is a comprehensive model organism database for Escherichia coli K-12 MG1655. From the scientific literature, EcoCyc captures the functions of individual E. coli gene products; their regulation at the transcriptional, post-transcriptional and protein level; and their organization into operons, complexes and pathways. EcoCyc users can search and browse the information in multiple ways. Recent improvements to the EcoCyc Web interface include combined gene/protein pages and a Regulation Summary Diagram displaying a graphical overview of all known regulatory inputs to gene expression and protein activity. The graphical representation of signal transduction pathways has been updated, and the cellular and regulatory overviews were enhanced with new functionality. A specialized undergraduate teaching resource using EcoCyc is being developed.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/fisiologia , Sítios de Ligação , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Transdução de Sinais , Software , Fatores de Transcrição/metabolismo , Transcrição Gênica , Interface Usuário-Computador
9.
Nucleic Acids Res ; 39(Database issue): D98-105, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21051347

RESUMO

RegulonDB (http://regulondb.ccg.unam.mx/) is the primary reference database of the best-known regulatory network of any free-living organism, that of Escherichia coli K-12. The major conceptual change since 3 years ago is an expanded biological context so that transcriptional regulation is now part of a unit that initiates with the signal and continues with the signal transduction to the core of regulation, modifying expression of the affected target genes responsible for the response. We call these genetic sensory response units, or Gensor Units. We have initiated their high-level curation, with graphic maps and superreactions with links to other databases. Additional connectivity uses expandable submaps. RegulonDB has summaries for every transcription factor (TF) and TF-binding sites with internal symmetry. Several DNA-binding motifs and their sizes have been redefined and relocated. In addition to data from the literature, we have incorporated our own information on transcription start sites (TSSs) and transcriptional units (TUs), obtained by using high-throughput whole-genome sequencing technologies. A new portable drawing tool for genomic features is also now available, as well as new ways to download the data, including web services, files for several relational database manager systems and text files including BioPAX format.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Fatores de Transcrição/metabolismo , Sítios de Ligação , Escherichia coli K12/metabolismo , Transdução de Sinais , Integração de Sistemas , Sítio de Iniciação de Transcrição , Transcrição Gênica
10.
EcoSal Plus ; 11(1): eesp00022023, 2023 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-37220074

RESUMO

EcoCyc is a bioinformatics database available online at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed online. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc are also available. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.


Assuntos
Escherichia coli K12 , Proteínas de Escherichia coli , Escherichia coli/genética , Escherichia coli/metabolismo , Escherichia coli K12/genética , Bases de Dados Genéticas , Software , Biologia Computacional , Proteínas de Escherichia coli/metabolismo
11.
Microbiology (Reading) ; 157(Pt 5): 1393-1401, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21310789

RESUMO

The stationary-phase response mediated by the RpoS sigma factor (σ(S), σ³8) has been widely studied as a general mechanism of activation of highly diverse genes that maintain cell viability. In bacteria, genes for diverse functions have been associated with this response, showing that bacteria use a large number of functions to contend with adverse conditions in their environment. However, little is known about how the genes have been functionally recruited in diverse organisms. In this work, we address the analysis of genes regulated by σ(S), based on a comparative genomic-scale analysis considering four versatile bacterial species that represent different lifestyles and taxonomic groups, Escherichia coli K-12, Geobacter sulfurreducens, Borrelia burgdorferi and Bacillus subtilis, as well as the extent of conservation in bacterial genomes, as a means of assessing the evolution of this sigmulon across all organisms completely sequenced. The analysis presented here shows that genes associated with the σ(S) response have been recruited from diverse regulons to achieve a global response. In addition, and based on the distribution of orthologues, we show a group of genes that is highly conserved among all organisms, mainly associated with glycerol metabolism, as well as diverse functional genes recruited in a lineage-specific manner.


Assuntos
Bactérias/genética , Proteínas de Bactérias/genética , Evolução Molecular , Variação Genética , Genoma Bacteriano , Fator sigma/genética , Bactérias/classificação , Bactérias/metabolismo , Proteínas de Bactérias/metabolismo , Regulação Bacteriana da Expressão Gênica , Fator sigma/metabolismo
12.
Nucleic Acids Res ; 37(Database issue): D464-70, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18974181

RESUMO

EcoCyc (http://EcoCyc.org) provides a comprehensive encyclopedia of Escherichia coli biology. EcoCyc integrates information about the genome, genes and gene products; the metabolic network; and the regulatory network of E. coli. Recent EcoCyc developments include a new initiative to represent and curate all types of E. coli regulatory processes such as attenuation and regulation by small RNAs. EcoCyc has started to curate Gene Ontology (GO) terms for E. coli and has made a dataset of E. coli GO terms available through the GO Web site. The curation and visualization of electron transfer processes has been significantly improved. Other software and Web site enhancements include the addition of tracks to the EcoCyc genome browser, in particular a type of track designed for the display of ChIP-chip datasets, and the development of a comparative genome browser. A new Genome Omics Viewer enables users to paint omics datasets onto the full E. coli genome for analysis. A new advanced query page guides users in interactively constructing complex database queries against EcoCyc. A Macintosh version of EcoCyc is now available. A series of Webinars is available to instruct users in the use of EcoCyc.


Assuntos
Bases de Dados Genéticas , Escherichia coli/genética , Escherichia coli/metabolismo , Membrana Celular/enzimologia , Transporte de Elétrons , Escherichia coli/enzimologia , Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Genoma Bacteriano , Genômica , Internet , Software , Transcrição Gênica
13.
Front Microbiol ; 12: 711077, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34394059

RESUMO

The EcoCyc model-organism database collects and summarizes experimental data for Escherichia coli K-12. EcoCyc is regularly updated by the manual curation of individual database entries, such as genes, proteins, and metabolic pathways, and by the programmatic addition of results from select high-throughput analyses. Updates to the Pathway Tools software that supports EcoCyc and to the web interface that enables user access have continuously improved its usability and expanded its functionality. This article highlights recent improvements to the curated data in the areas of metabolism, transport, DNA repair, and regulation of gene expression. New and revised data analysis and visualization tools include an interactive metabolic network explorer, a circular genome viewer, and various improvements to the speed and usability of existing tools.

14.
Nucleic Acids Res ; 36(Database issue): D120-4, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18158297

RESUMO

RegulonDB (http://regulondb.ccg.unam.mx/) is the primary reference database offering curated knowledge of the transcriptional regulatory network of Escherichia coli K12, currently the best-known electronically encoded database of the genetic regulatory network of any free-living organism. This paper summarizes the improvements, new biology and new features available in version 6.0. Curation of original literature is, from now on, up to date for every new release. All the objects are supported by their corresponding evidences, now classified as strong or weak. Transcription factors are classified by origin of their effectors and by gene ontology class. We have now computational predictions for sigma(54) and five different promoter types of the sigma(70) family, as well as their corresponding -10 and -35 boxes. In addition to those curated from the literature, we added about 300 experimentally mapped promoters coming from our own high-throughput mapping efforts. RegulonDB v.6.0 now expands beyond transcription initiation, including RNA regulatory elements, specifically riboswitches, attenuators and small RNAs, with their known associated targets. The data can be accessed through overviews of correlations about gene regulation. RegulonDB associated original literature, together with more than 4000 curation notes, can now be searched with the Textpresso text mining engine.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Biologia Computacional , Internet , Modelos Genéticos , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Ribonucleico , Regulon , Fator sigma/metabolismo , Software , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição , Transcrição Gênica
15.
Nucleic Acids Res ; 35(22): 7577-90, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17940092

RESUMO

The annotation of the Escherichia coli K-12 genome in the EcoCyc database is one of the most accurate, complete and multidimensional genome annotations. Of the 4460 E. coli genes, EcoCyc assigns biochemical functions to 76%, and 66% of all genes had their functions determined experimentally. EcoCyc assigns E. coli genes to Gene Ontology and to MultiFun. Seventy-five percent of gene products contain reviews authored by the EcoCyc project that summarize the experimental literature about the gene product. EcoCyc information was derived from 15 000 publications. The database contains extensive descriptions of E. coli cellular networks, describing its metabolic, transport and transcriptional regulatory processes. A comparison to genome annotations for other model organisms shows that the E. coli genome contains the most experimentally determined gene functions in both relative and absolute terms: 2941 (66%) for E. coli, 2319 (37%) for Saccharomyces cerevisiae, 1816 (5%) for Arabidopsis thaliana, 1456 (4%) for Mus musculus and 614 (4%) for Drosophila melanogaster. Database queries to EcoCyc survey the global properties of E. coli cellular networks and illuminate the extent of information gaps for E. coli, such as dead-end metabolites. EcoCyc provides a genome browser with novel properties, and a novel interactive display of transcriptional regulatory networks.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Genoma Bacteriano , Biologia Computacional , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/fisiologia , Redes Reguladoras de Genes , Genes Bacterianos , Software
16.
J Biomed Semantics ; 10(1): 8, 2019 05 22.
Artigo em Inglês | MEDLINE | ID: mdl-31118102

RESUMO

BACKGROUND: The ability to express the same meaning in different ways is a well-known property of natural language. This amazing property is the source of major difficulties in natural language processing. Given the constant increase in published literature, its curation and information extraction would strongly benefit from efficient automatic processes, for which corpora of sentences evaluated by experts are a valuable resource. RESULTS: Given our interest in applying such approaches to the benefit of curation of the biomedical literature, specifically that about gene regulation in microbial organisms, we decided to build a corpus with graded textual similarity evaluated by curators and that was designed specifically oriented to our purposes. Based on the predefined statistical power of future analyses, we defined features of the design, including sampling, selection criteria, balance, and size, among others. A non-fully crossed study design was applied. Each pair of sentences was evaluated by 3 annotators from a total of 7; the scale used in the semantic similarity assessment task within the Semantic Evaluation workshop (SEMEVAL) was adapted to our goals in four successive iterative sessions with clear improvements in the agreed guidelines and interrater reliability results. Alternatives for such a corpus evaluation have been widely discussed. CONCLUSIONS: To the best of our knowledge, this is the first similarity corpus-a dataset of pairs of sentences for which human experts rate the semantic similarity of each pair-in this domain of knowledge. We have initiated its incorporation in our research towards high-throughput curation strategies based on natural language processing.


Assuntos
Regulação da Expressão Gênica , Microbiologia , Processamento de Linguagem Natural , Transcrição Gênica/genética
17.
Nucleic Acids Res ; 34(Database issue): D394-7, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381895

RESUMO

RegulonDB is the internationally recognized reference database of Escherichia coli K-12 offering curated knowledge of the regulatory network and operon organization. It is currently the largest electronically-encoded database of the regulatory network of any free-living organism. We present here the recently launched RegulonDB version 5.0 radically different in content, interface design and capabilities. Continuous curation of original scientific literature provides the evidence behind every single object and feature. This knowledge is complemented with comprehensive computational predictions across the complete genome. Literature-based and predicted data are clearly distinguished in the database. Starting with this version, RegulonDB public releases are synchronized with those of EcoCyc since our curation supports both databases. The complex biology of regulation is simplified in a navigation scheme based on three major streams: genes, operons and regulons. Regulatory knowledge is directly available in every navigation step. Displays combine graphic and textual information and are organized allowing different levels of detail and biological context. This knowledge is the backbone of an integrated system for the graphic display of the network, graphic and tabular microarray comparisons with curated and predicted objects, as well as predictions across bacterial genomes, and predicted networks of functionally related gene products. Access RegulonDB at http://regulondb.ccg.unam.mx.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Óperon , Regulon , Escherichia coli K12/crescimento & desenvolvimento , Genoma Bacteriano , Internet , Software , Transcrição Gênica , Interface Usuário-Computador
18.
EcoSal Plus ; 8(1)2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30406744

RESUMO

EcoCyc is a bioinformatics database available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on E. coli gene essentiality and on nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed via EcoCyc.org. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.


Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Genoma Bacteriano , Software , Biologia Computacional , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Internet , Análise do Fluxo Metabólico , Redes e Vias Metabólicas/genética , Interface Usuário-Computador
19.
BMC Bioinformatics ; 7: 5, 2006 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-16398937

RESUMO

BACKGROUND: Escherichia coli is the model organism for which our knowledge of its regulatory network is the most extensive. Over the last few years, our project has been collecting and curating the literature concerning E. coli transcription initiation and operons, providing in both the RegulonDB and EcoCyc databases the largest electronically encoded network available. A paper published recently by Ma et al. (2004) showed several differences in the versions of the network present in these two databases. Discrepancies have been corrected, annotations from this and other groups (Shen-Orr et al., 2002) have been added, making the RegulonDB and EcoCyc databases the largest comprehensive and constantly curated regulatory network of E. coli K-12. RESULTS: Several groups have been using these curated data as part of their bioinformatics and systems biology projects, in combination with external data obtained from other sources, thus enlarging the dataset initially obtained from either RegulonDB or EcoCyc of the E. coli K12 regulatory network. We kindly obtained from the groups of Uri Alon and Hong-Wu Ma the interactions they have added to enrich their public versions of the E. coli regulatory network. These were used to search for original references and curate them with the same standards we use regularly, adding in several cases the original references (instead of reviews or missing references), as well as adding the corresponding experimental evidence codes. We also corrected all discrepancies in the two databases available as explained below. CONCLUSION: One hundred and fifty new interactions have been added to our databases as a result of this specific curation effort, in addition to those added as a result of our continuous curation work. RegulonDB gene names are now based on those of EcoCyc to avoid confusion due to gene names and synonyms, and the public releases of RegulonDB and EcoCyc are henceforth synchronized to avoid confusion due to different versions. Public flat files are available providing direct access to the regulatory network interactions thus avoiding errors due to differences in database modelling and representation. The regulatory network available in RegulonDB and EcoCyc is the most comprehensive and regularly updated electronically-encoded regulatory network of E. coli K-12.


Assuntos
Bases de Dados de Proteínas , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica/fisiologia , Modelos Biológicos , Mapeamento de Interação de Proteínas/métodos , Transdução de Sinais/fisiologia , Simulação por Computador
20.
Nucleic Acids Res ; 32(Database issue): D303-6, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681419

RESUMO

RegulonDB is the primary database of the major international maintained curation of original literature with experimental knowledge about the elements and interactions of the network of transcriptional regulation in Escherichia coli K-12. This includes mechanistic information about operon organization and their decomposition into transcription units (TUs), promoters and their sigma type, binding sites of specific transcriptional regulators (TRs), their organization into 'regulatory phrases', active and inactive conformations of TRs, as well as terminators and ribosome binding sites. The database is complemented with clearly marked computational predictions of TUs, promoters and binding sites of TRs. The current version has been expanded to include information beyond specific mechanisms aimed at gathering different growth conditions and the associated induced and/or repressed genes. RegulonDB is now linked with Swiss-Prot, with microarray databases, and with a suite of programs to analyze and visualize microarray experiments. We provide a summary of the biological knowledge contained in RegulonDB and describe the major changes in the design of the database. RegulonDB can be accessed on the web at the URL: http://www.cifn.unam.mx/Computational_Biology/regulondb/.


Assuntos
Bases de Dados Factuais , Escherichia coli/crescimento & desenvolvimento , Escherichia coli/genética , Regulação Bacteriana da Expressão Gênica , Óperon/genética , Regulon/genética , Transcrição Gênica , Bases de Dados Genéticas , Meio Ambiente , Armazenamento e Recuperação da Informação , Internet , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA