Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 38
Filtrar
1.
Nucleic Acids Res ; 52(D1): D255-D264, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37971353

RESUMEN

RegulonDB is a database that contains the most comprehensive corpus of knowledge of the regulation of transcription initiation of Escherichia coli K-12, including data from both classical molecular biology and high-throughput methodologies. Here, we describe biological advances since our last NAR paper of 2019. We explain the changes to satisfy FAIR requirements. We also present a full reconstruction of the RegulonDB computational infrastructure, which has significantly improved data storage, retrieval and accessibility and thus supports a more intuitive and user-friendly experience. The integration of graphical tools provides clear visual representations of genetic regulation data, facilitating data interpretation and knowledge integration. RegulonDB version 12.0 can be accessed at https://regulondb.ccg.unam.mx.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12 , Regulación Bacteriana de la Expresión Génica , Biología Computacional/métodos , Escherichia coli K12/genética , Internet , Transcripción Genética
2.
Nucleic Acids Res ; 47(D1): D212-D220, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30395280

RESUMEN

RegulonDB, first published 20 years ago, is a comprehensive electronic resource about regulation of transcription initiation of Escherichia coli K-12 with decades of knowledge from classic molecular biology experiments, and recently also from high-throughput genomic methodologies. We curated the literature to keep RegulonDB up to date, and initiated curation of ChIP and gSELEX experiments. We estimate that current knowledge describes between 10% and 30% of the expected total number of transcription factor- gene regulatory interactions in E. coli. RegulonDB provides datasets for interactions for which there is no evidence that they affect expression, as well as expression datasets. We developed a proof of concept pipeline to merge binding and expression evidence to identify regulatory interactions. These datasets can be visualized in the RegulonDB JBrowse. We developed the Microbial Conditions Ontology with a controlled vocabulary for the minimal properties to reproduce an experiment, which contributes to integrate data from high throughput and classic literature. At a higher level of integration, we report Genetic Sensory-Response Units for 200 transcription factors, including their regulation at the metabolic level, and include summaries for 70 of them. Finally, we summarize our research with Natural language processing strategies to enhance our biocuration work.


Asunto(s)
Biología Computacional/métodos , Escherichia coli K12/genética , Regulación Bacteriana de la Expresión Génica , Genómica , Ontología de Genes , Redes Reguladoras de Genes , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento
3.
Nucleic Acids Res ; 45(D1): D543-D550, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899573

RESUMEN

EcoCyc (EcoCyc.org) is a freely accessible, comprehensive database that collects and summarizes experimental data for Escherichia coli K-12, the best-studied bacterial model organism. New experimental discoveries about gene products, their function and regulation, new metabolic pathways, enzymes and cofactors are regularly added to EcoCyc. New SmartTable tools allow users to browse collections of related EcoCyc content. SmartTables can also serve as repositories for user- or curator-generated lists. EcoCyc now supports running and modifying E. coli metabolic models directly on the EcoCyc website.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Metabolismo Energético , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Regulación Bacteriana de la Expresión Génica , Redes y Vías Metabólicas , Transducción de Señal , Programas Informáticos , Factores de Transcripción/metabolismo , Navegador Web
4.
BMC Biol ; 16(1): 91, 2018 08 16.
Artículo en Inglés | MEDLINE | ID: mdl-30115066

RESUMEN

BACKGROUND: Our understanding of the regulation of gene expression has benefited from the availability of high-throughput technologies that interrogate the whole genome for the binding of specific transcription factors and gene expression profiles. In the case of widely used model organisms, such as Escherichia coli K-12, the new knowledge gained from these approaches needs to be integrated with the legacy of accumulated knowledge from genetic and molecular biology experiments conducted in the pre-genomic era in order to attain the deepest level of understanding possible based on the available data. RESULTS: In this paper, we describe an expansion of RegulonDB, the database containing the rich legacy of decades of classic molecular biology experiments supporting what we know about gene regulation and operon organization in E. coli K-12, to include the genome-wide dataset collections from 32 ChIP and 19 gSELEX publications, in addition to around 60 genome-wide expression profiles relevant to the functional significance of these datasets and used in their curation. Three essential features for the integration of this information coming from different methodological approaches are: first, a controlled vocabulary within an ontology for precisely defining growth conditions; second, the criteria to separate elements with enough evidence to consider them involved in gene regulation from isolated transcription factor binding sites without such support; and third, an expanded computational model supporting this knowledge. Altogether, this constitutes the basis for adequately gathering and enabling the comparisons and integration needed to manage and access such wealth of knowledge. CONCLUSIONS: This version 10.0 of RegulonDB is a first step toward what should become the unifying access point for current and future knowledge on gene regulation in E. coli K-12. Furthermore, this model platform and associated methodologies and criteria can be emulated for gathering knowledge on other microbial organisms.


Asunto(s)
Bases de Datos como Asunto , Escherichia coli K12/genética , Regulación Bacteriana de la Expresión Génica , Transcripción Genética
5.
Nucleic Acids Res ; 44(D1): D620-3, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26586805

RESUMEN

COLOMBOS is a database that integrates publicly available transcriptomics data for several prokaryotic model organisms. Compared to the previous version it has more than doubled in size, both in terms of species and data available. The manually curated condition annotation has been overhauled as well, giving more complete information about samples' experimental conditions and their differences. Functionality-wise cross-species analyses now enable users to analyse expression data for all species simultaneously, and identify candidate genes with evolutionary conserved expression behaviour. All the expression-based query tools have undergone a substantial improvement, overcoming the limit of enforced co-expression data retrieval and instead enabling the return of more complex patterns of expression behaviour. COLOMBOS is freely available through a web application at http://colombos.net/. The complete database is also accessible via REST API or downloadable as tab-delimited text files.


Asunto(s)
Bases de Datos Genéticas , Perfilación de la Expresión Génica , Archaea/genética , Archaea/metabolismo , Bacterias/genética , Bacterias/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Análisis de Secuencia de ARN , Programas Informáticos
6.
Nucleic Acids Res ; 44(D1): D133-43, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26527724

RESUMEN

RegulonDB (http://regulondb.ccg.unam.mx) is one of the most useful and important resources on bacterial gene regulation,as it integrates the scattered scientific knowledge of the best-characterized organism, Escherichia coli K-12, in a database that organizes large amounts of data. Its electronic format enables researchers to compare their results with the legacy of previous knowledge and supports bioinformatics tools and model building. Here, we summarize our progress with RegulonDB since our last Nucleic Acids Research publication describing RegulonDB, in 2013. In addition to maintaining curation up-to-date, we report a collection of 232 interactions with small RNAs affecting 192 genes, and the complete repertoire of 189 Elementary Genetic Sensory-Response units (GENSOR units), integrating the signal, regulatory interactions, and metabolic pathways they govern. These additions represent major progress to a higher level of understanding of regulated processes. We have updated the computationally predicted transcription factors, which total 304 (184 with experimental evidence and 120 from computational predictions); we updated our position-weight matrices and have included tools for clustering them in evolutionary families. We describe our semiautomatic strategy to accelerate curation, including datasets from high-throughput experiments, a novel coexpression distance to search for 'neighborhood' genes to known operons and regulons, and computational developments.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12/genética , Regulación Bacteriana de la Expresión Génica , Regulón , Análisis por Conglomerados , Escherichia coli K12/metabolismo , Redes Reguladoras de Genes , Operón , Posición Específica de Matrices de Puntuación , ARN Pequeño no Traducido/metabolismo , Factores de Transcripción/clasificación
7.
Nucleic Acids Res ; 42(Database issue): D649-53, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24214998

RESUMEN

The COLOMBOS database (http://www.colombos.net) features comprehensive organism-specific cross-platform gene expression compendia of several bacterial model organisms and is supported by a fully interactive web portal and an extensive web API. COLOMBOS was originally published in PLoS One, and COLOMBOS v2.0 includes both an update of the expression data, by expanding the previously available compendia and by adding compendia for several new species, and an update of the surrounding functionality, with improved search and visualization options and novel tools for programmatic access to the database. The scope of the database has also been extended to incorporate RNA-seq data in our compendia by a dedicated analysis pipeline. We demonstrate the validity and robustness of this approach by comparing the same RNA samples measured in parallel using both microarrays and RNA-seq. As far as we know, COLOMBOS currently hosts the largest homogenized gene expression compendia available for seven bacterial model organisms.


Asunto(s)
Bacterias/genética , Bases de Datos Genéticas , Expresión Génica , Bacterias/metabolismo , Perfilación de la Expresión Génica , Internet , Análisis de Secuencia por Matrices de Oligonucleótidos , Análisis de Secuencia de ARN
8.
Nucleic Acids Res ; 41(Database issue): D605-12, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23143106

RESUMEN

EcoCyc (http://EcoCyc.org) is a model organism database built on the genome sequence of Escherichia coli K-12 MG1655. Expert manual curation of the functions of individual E. coli gene products in EcoCyc has been based on information found in the experimental literature for E. coli K-12-derived strains. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12/genética , Sitios de Unión , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/clasificación , Proteínas de Escherichia coli/metabolismo , Regulación Bacteriana de la Expresión Génica , Internet , Proteínas de Transporte de Membrana/clasificación , Proteínas de Transporte de Membrana/metabolismo , Modelos Genéticos , Anotación de Secuencia Molecular , Fenotipo , Posición Específica de Matrices de Puntuación , Regiones Promotoras Genéticas , Biología de Sistemas , Factores de Transcripción/metabolismo , Transcripción Genética
9.
Nucleic Acids Res ; 41(Database issue): D203-13, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23203884

RESUMEN

This article summarizes our progress with RegulonDB (http://regulondb.ccg.unam.mx/) during the past 2 years. We have kept up-to-date the knowledge from the published literature regarding transcriptional regulation in Escherichia coli K-12. We have maintained and expanded our curation efforts to improve the breadth and quality of the encoded experimental knowledge, and we have implemented criteria for the quality of our computational predictions. Regulatory phrases now provide high-level descriptions of regulatory regions. We expanded the assignment of quality to various sources of evidence, particularly for knowledge generated through high-throughput (HT) technology. Based on our analysis of most relevant methods, we defined rules for determining the quality of evidence when multiple independent sources support an entry. With this latest release of RegulonDB, we present a new highly reliable larger collection of transcription start sites, a result of our experimental HT genome-wide efforts. These improvements, together with several novel enhancements (the tracks display, uploading format and curational guidelines), address the challenges of incorporating HT-generated knowledge into RegulonDB. Information on the evolutionary conservation of regulatory elements is also available now. Altogether, RegulonDB version 8.0 is a much better home for integrating knowledge on gene regulation from the sources of information currently available.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12/genética , Regulación Bacteriana de la Expresión Génica , Elementos Reguladores de la Transcripción , Transcripción Genética , Proteínas Bacterianas/metabolismo , Bases de Datos Genéticas/normas , Evolución Molecular , Genómica , Internet , Regiones Promotoras Genéticas , Regulón , Proteínas Represoras/metabolismo , Análisis de Secuencia de ARN , Factores de Transcripción/metabolismo , Sitio de Iniciación de la Transcripción
10.
Front Genet ; 15: 1353553, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38505828

RESUMEN

Post-genomic implementations have expanded the experimental strategies to identify elements involved in the regulation of transcription initiation. Here, we present for the first time a detailed analysis of the sources of knowledge supporting the collection of transcriptional regulatory interactions (RIs) of Escherichia coli K-12. An RI groups the transcription factor, its effect (positive or negative) and the regulated target, a promoter, a gene or transcription unit. We improved the evidence codes so that specific methods are incorporated and classified into independent groups. On this basis we updated the computation of confidence levels, weak, strong, or confirmed, for the collection of RIs. These updates enabled us to map the RI set to the current collection of HT TF-binding datasets from ChIP-seq, ChIP-exo, gSELEX and DAP-seq in RegulonDB, enriching in this way the evidence of close to one-quarter (1329) of RIs from the current total 5446 RIs. Based on the new computational capabilities of our improved annotation of evidence sources, we can now analyze the internal architecture of evidence, their categories (experimental, classical, HT, computational), and confidence levels. This is how we know that the joint contribution of HT and computational methods increase the overall fraction of reliable RIs (the sum of confirmed and strong evidence) from 49% to 71%. Thus, the current collection has 3912 reliable RIs, with 2718 or 70% of them with classical evidence which can be used to benchmark novel HT methods. Users can selectively exclude the method they want to benchmark, or keep for instance only the confirmed interactions. The recovery of regulatory sites in RegulonDB by the different HT methods ranges between 33% by ChIP-exo to 76% by ChIP-seq although as discussed, many potential confounding factors limit their interpretation. The collection of improvements reported here provides a solid foundation to incorporate new methods and data, and to further integrate the diverse sources of knowledge of the different components of the transcriptional regulatory network. There is no other genomic database that offers this comprehensive high-quality architecture of knowledge supporting a corpus of transcriptional regulatory interactions.

11.
Nucleic Acids Res ; 39(Database issue): D583-90, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21097882

RESUMEN

EcoCyc (http://EcoCyc.org) is a comprehensive model organism database for Escherichia coli K-12 MG1655. From the scientific literature, EcoCyc captures the functions of individual E. coli gene products; their regulation at the transcriptional, post-transcriptional and protein level; and their organization into operons, complexes and pathways. EcoCyc users can search and browse the information in multiple ways. Recent improvements to the EcoCyc Web interface include combined gene/protein pages and a Regulation Summary Diagram displaying a graphical overview of all known regulatory inputs to gene expression and protein activity. The graphical representation of signal transduction pathways has been updated, and the cellular and regulatory overviews were enhanced with new functionality. A specialized undergraduate teaching resource using EcoCyc is being developed.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12/fisiología , Sitios de Unión , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Regulación Bacteriana de la Expresión Génica , Transducción de Señal , Programas Informáticos , Factores de Transcripción/metabolismo , Transcripción Genética , Interfaz Usuario-Computador
12.
Nucleic Acids Res ; 39(Database issue): D98-105, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21051347

RESUMEN

RegulonDB (http://regulondb.ccg.unam.mx/) is the primary reference database of the best-known regulatory network of any free-living organism, that of Escherichia coli K-12. The major conceptual change since 3 years ago is an expanded biological context so that transcriptional regulation is now part of a unit that initiates with the signal and continues with the signal transduction to the core of regulation, modifying expression of the affected target genes responsible for the response. We call these genetic sensory response units, or Gensor Units. We have initiated their high-level curation, with graphic maps and superreactions with links to other databases. Additional connectivity uses expandable submaps. RegulonDB has summaries for every transcription factor (TF) and TF-binding sites with internal symmetry. Several DNA-binding motifs and their sizes have been redefined and relocated. In addition to data from the literature, we have incorporated our own information on transcription start sites (TSSs) and transcriptional units (TUs), obtained by using high-throughput whole-genome sequencing technologies. A new portable drawing tool for genomic features is also now available, as well as new ways to download the data, including web services, files for several relational database manager systems and text files including BioPAX format.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli K12/genética , Regulación Bacteriana de la Expresión Génica , Redes Reguladoras de Genes , Factores de Transcripción/metabolismo , Sitios de Unión , Escherichia coli K12/metabolismo , Transducción de Señal , Integración de Sistemas , Sitio de Iniciación de la Transcripción , Transcripción Genética
13.
bioRxiv ; 2023 Dec 11.
Artículo en Inglés | MEDLINE | ID: mdl-37163020

RESUMEN

Post-genomic implementations have expanded the experimental strategies to identify elements involved in the regulation of transcription initiation. As new methodologies emerge, a natural step is to compare their results with those from established methodologies, such as the classic methods of molecular biology used to characterize transcription factor binding sites, promoters, or transcription units. In the case of Escherichia coli K-12, the best-studied microorganism, for the last 30 years we have continuously gathered such knowledge from original scientific publications, and have organized it in two databases, RegulonDB and EcoCyc. Furthermore, since RegulonDB version 11.0 (1), we offer comprehensive datasets of binding sites from chromatin immunoprecipitation combined with sequencing (ChIP-seq), ChIP combined with exonuclease digestion and next-generation sequencing (ChIP-exo), genomic SELEX screening (gSELEX), and DNA affinity purification sequencing (DAP-seq) HT technologies, as well as additional datasets for transcription start sites, transcription units and RNA sequencing (RNA-seq) expression profiles. Here, we present for the first time an analysis of the sources of knowledge supporting the collection of transcriptional regulatory interactions (RIs) of E. coli K-12. An RI is formed by the transcription factor, its positive or negative effect on a promoter, a gene or transcription unit. We improved the evidence codes so that the specific methods are described, and we classified them into seven independent groups. This is the basis for our updated computation of confidence levels, weak, strong, or confirmed, for the collection of RIs. We compare the confidence levels of the RI collection before and after adding HT evidence illustrating how knowledge will change as more HT data and methods appear in the future. Users can generate subsets filtering out the method they want to benchmark and avoid circularity, or keep for instance only the confirmed interactions. The comparison of different HT methods with the available datasets indicate that ChIP-seq recovers the highest fraction (>70%) of binding sites present in RegulonDB followed by gSELEX, DAP-seq and ChIP-exo. There is no other genomic database that offers this comprehensive high-quality anatomy of evidence supporting a corpus of transcriptional regulatory interactions.

14.
EcoSal Plus ; 11(1): eesp00022023, 2023 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-37220074

RESUMEN

EcoCyc is a bioinformatics database available online at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed online. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc are also available. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.


Asunto(s)
Escherichia coli K12 , Proteínas de Escherichia coli , Escherichia coli/genética , Escherichia coli/metabolismo , Escherichia coli K12/genética , Bases de Datos Genéticas , Programas Informáticos , Biología Computacional , Proteínas de Escherichia coli/metabolismo
15.
Front Bioeng Biotechnol ; 10: 823240, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35237580

RESUMEN

In free-living bacteria, the ability to regulate gene expression is at the core of adapting and interacting with the environment. For these systems to have a logic, a signal must trigger a genetic change that helps the cell to deal with what implies its presence in the environment; briefly, the response is expected to include a feedback to the signal. Thus, it makes sense to think of genetic sensory mechanisms of gene regulation. Escherichia coli K-12 is the bacterium model for which the largest number of regulatory systems and its sensing capabilities have been studied in detail at the molecular level. In this special issue focused on biomolecular sensing systems, we offer an overview of the transcriptional regulatory corpus of knowledge for E. coli that has been gathered in our database, RegulonDB, from the perspective of sensing regulatory systems. Thus, we start with the beginning of the information flux, which is the signal's chemical or physical elements detected by the cell as changes in the environment; these signals are internally transduced to transcription factors and alter their conformation. Signals transduced to effectors bind allosterically to transcription factors, and this defines the dominant sensing mechanism in E. coli. We offer an updated list of the repertoire of known allosteric effectors, as well as a list of the currently known different mechanisms of this sensing capability. Our previous definition of elementary genetic sensory-response units, GENSOR units for short, that integrate signals, transport, gene regulation, and the biochemical response of the regulated gene products of a given transcriptional factor fit perfectly with the purpose of this overview. We summarize the functional heterogeneity of their response, based on our updated collection of GENSORs, and we use them to identify the expected feedback as part of their response. Finally, we address the question of multiple sensing in the regulatory network of E. coli. This overview introduces the architecture of sensing and regulation of native components in E.coli K-12, which might be a source of inspiration to bioengineering applications.

16.
Microb Genom ; 8(5)2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35584008

RESUMEN

Genomics has set the basis for a variety of methodologies that produce high-throughput datasets identifying the different players that define gene regulation, particularly regulation of transcription initiation and operon organization. These datasets are available in public repositories, such as the Gene Expression Omnibus, or ArrayExpress. However, accessing and navigating such a wealth of data is not straightforward. No resource currently exists that offers all available high and low-throughput data on transcriptional regulation in Escherichia coli K-12 to easily use both as whole datasets, or as individual interactions and regulatory elements. RegulonDB (https://regulondb.ccg.unam.mx) began gathering high-throughput dataset collections in 2009, starting with transcription start sites, then adding ChIP-seq and gSELEX in 2012, with up to 99 different experimental high-throughput datasets available in 2019. In this paper we present a radical upgrade to more than 2000 high-throughput datasets, processed to facilitate their comparison, introducing up-to-date collections of transcription termination sites, transcription units, as well as transcription factor binding interactions derived from ChIP-seq, ChIP-exo, gSELEX and DAP-seq experiments, besides expression profiles derived from RNA-seq experiments. For ChIP-seq experiments we offer both the data as presented by the authors, as well as data uniformly processed in-house, enhancing their comparability, as well as the traceability of the methods and reproducibility of the results. Furthermore, we have expanded the tools available for browsing and visualization across and within datasets. We include comparisons against previously existing knowledge in RegulonDB from classic experiments, a nucleotide-resolution genome viewer, and an interface that enables users to browse datasets by querying their metadata. A particular effort was made to automatically extract detailed experimental growth conditions by implementing an assisted curation strategy applying Natural language processing and machine learning. We provide summaries with the total number of interactions found in each experiment, as well as tools to identify common results among different experiments. This is a long-awaited resource to make use of such wealth of knowledge and advance our understanding of the biology of the model bacterium E. coli K-12.


Asunto(s)
Escherichia coli K12 , Escherichia coli , Escherichia coli/genética , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Regulación Bacteriana de la Expresión Génica , Operón/genética , Reproducibilidad de los Resultados
17.
Microbiology (Reading) ; 157(Pt 5): 1393-1401, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21310789

RESUMEN

The stationary-phase response mediated by the RpoS sigma factor (σ(S), σ³8) has been widely studied as a general mechanism of activation of highly diverse genes that maintain cell viability. In bacteria, genes for diverse functions have been associated with this response, showing that bacteria use a large number of functions to contend with adverse conditions in their environment. However, little is known about how the genes have been functionally recruited in diverse organisms. In this work, we address the analysis of genes regulated by σ(S), based on a comparative genomic-scale analysis considering four versatile bacterial species that represent different lifestyles and taxonomic groups, Escherichia coli K-12, Geobacter sulfurreducens, Borrelia burgdorferi and Bacillus subtilis, as well as the extent of conservation in bacterial genomes, as a means of assessing the evolution of this sigmulon across all organisms completely sequenced. The analysis presented here shows that genes associated with the σ(S) response have been recruited from diverse regulons to achieve a global response. In addition, and based on the distribution of orthologues, we show a group of genes that is highly conserved among all organisms, mainly associated with glycerol metabolism, as well as diverse functional genes recruited in a lineage-specific manner.


Asunto(s)
Bacterias/genética , Proteínas Bacterianas/genética , Evolución Molecular , Variación Genética , Genoma Bacteriano , Factor sigma/genética , Bacterias/clasificación , Bacterias/metabolismo , Proteínas Bacterianas/metabolismo , Regulación Bacteriana de la Expresión Génica , Factor sigma/metabolismo
18.
Nucleic Acids Res ; 37(Database issue): D464-70, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18974181

RESUMEN

EcoCyc (http://EcoCyc.org) provides a comprehensive encyclopedia of Escherichia coli biology. EcoCyc integrates information about the genome, genes and gene products; the metabolic network; and the regulatory network of E. coli. Recent EcoCyc developments include a new initiative to represent and curate all types of E. coli regulatory processes such as attenuation and regulation by small RNAs. EcoCyc has started to curate Gene Ontology (GO) terms for E. coli and has made a dataset of E. coli GO terms available through the GO Web site. The curation and visualization of electron transfer processes has been significantly improved. Other software and Web site enhancements include the addition of tracks to the EcoCyc genome browser, in particular a type of track designed for the display of ChIP-chip datasets, and the development of a comparative genome browser. A new Genome Omics Viewer enables users to paint omics datasets onto the full E. coli genome for analysis. A new advanced query page guides users in interactively constructing complex database queries against EcoCyc. A Macintosh version of EcoCyc is now available. A series of Webinars is available to instruct users in the use of EcoCyc.


Asunto(s)
Bases de Datos Genéticas , Escherichia coli/genética , Escherichia coli/metabolismo , Membrana Celular/enzimología , Transporte de Electrón , Escherichia coli/enzimología , Regulación Bacteriana de la Expresión Génica , Genes Bacterianos , Genoma Bacteriano , Genómica , Internet , Programas Informáticos , Transcripción Genética
19.
Biochim Biophys Acta Gene Regul Mech ; 1864(11-12): 194753, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34461312

RESUMEN

The number of published papers in biomedical research makes it rather impossible for a researcher to keep up to date. This is where manually curated databases contribute facilitating the access to knowledge. However, the structure required by databases strongly limits the type of valuable information that can be incorporated. Here, we present Lisen&Curate, a curation system that facilitates linking sentences or part of sentences (both considered sources) in articles with their corresponding curated objects, so that rich additional information of these objects is easily available to users. These sources are going to be offered both within RegulonDB and a new database, L-Regulon. To show the relevance of our work, two senior curators performed a curation of 31 articles on the regulation of transcription initiation of E. coli using Lisen&Curate. As a result, 194 objects were curated and 781 sources were recorded. We also found that these sources are useful to develop automatic approaches to detect objects in articles by observing word frequency patterns and by carrying out an open information extraction task. Sources may help to elaborate a controlled vocabulary of experimental methods. Finally, we discuss our ecosystem of interconnected applications, RegulonDB, L-Regulon, and Lisen&Curate, to facilitate the access to knowledge on regulation of transcription initiation in bacteria. We see our proposal as the starting point to change the way experimentalists connect a piece of knowledge with its evidence using RegulonDB.


Asunto(s)
Curaduría de Datos/métodos , Bases de Datos Genéticas , Regulación Bacteriana de la Expresión Génica , Iniciación de la Transcripción Genética , Escherichia coli/genética
20.
Front Microbiol ; 12: 711077, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34394059

RESUMEN

The EcoCyc model-organism database collects and summarizes experimental data for Escherichia coli K-12. EcoCyc is regularly updated by the manual curation of individual database entries, such as genes, proteins, and metabolic pathways, and by the programmatic addition of results from select high-throughput analyses. Updates to the Pathway Tools software that supports EcoCyc and to the web interface that enables user access have continuously improved its usability and expanded its functionality. This article highlights recent improvements to the curated data in the areas of metabolism, transport, DNA repair, and regulation of gene expression. New and revised data analysis and visualization tools include an interactive metabolic network explorer, a circular genome viewer, and various improvements to the speed and usability of existing tools.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA