RESUMEN
Transcription factors (TFs) and their specific interactions with targets are crucial for specifying gene-expression programs. To gain insights into the transcriptional regulatory networks in embryonic stem (ES) cells, we use chromatin immunoprecipitation coupled with ultra-high-throughput DNA sequencing (ChIP-seq) to map the locations of 13 sequence-specific TFs (Nanog, Oct4, STAT3, Smad1, Sox2, Zfx, c-Myc, n-Myc, Klf4, Esrrb, Tcfcp2l1, E2f1, and CTCF) and 2 transcription regulators (p300 and Suz12). These factors are known to play different roles in ES-cell biology as components of the LIF and BMP signaling pathways, self-renewal regulators, and key reprogramming factors. Our study provides insights into the integration of the signaling pathways into the ES-cell-specific transcription circuitries. Intriguingly, we find specific genomic regions extensively targeted by different TFs. Collectively, the comprehensive mapping of TF-binding sites identifies important features of the transcriptional regulatory networks that define ES-cell identity.
Asunto(s)
Células Madre Embrionarias/metabolismo , Redes Reguladoras de Genes , Transducción de Señal , Animales , Secuencia de Bases , Sitios de Unión , Inmunoprecipitación de Cromatina , Genoma , Factor 4 Similar a Kruppel , Ratones , Complejos Multiproteicos , Factores de Transcripción/metabolismoRESUMEN
BACKGROUND: Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress. RESULTS: We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases. CONCLUSIONS: Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear.
Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales , Internet , Archivos , Programas Informáticos , Interfaz Usuario-ComputadorRESUMEN
BACKGROUND: Effectors of Type III Secretion System (T3SS) play a pivotal role in establishing and maintaining pathogenicity in the host and therefore the identification of these effectors is important in understanding virulence. However, the effectors display high level of sequence diversity, therefore making the identification a difficult process. There is a need to collate and annotate existing effector sequences in public databases to enable systematic analyses of these sequences for development of models for screening and selection of putative novel effectors from bacterial genomes that can be validated by a smaller number of key experiments. RESULTS: Herein, we present T3SEdb http://effectors.bic.nus.edu.sg/T3SEdb, a specialized database of annotated T3SS effector (T3SE) sequences containing 1089 records from 46 bacterial species compiled from the literature and public protein databases. Procedures have been defined for i) comprehensive annotation of experimental status of effectors, ii) submission and curation review of records by users of the database, and iii) the regular update of T3SEdb existing and new records. Keyword fielded and sequence searches (BLAST, regular expression) are supported for both experimentally verified and hypothetical T3SEs. More than 171 clusters of T3SEs were detected based on sequence identity comparisons (intra-cluster difference up to ~60%). Owing to this high level of sequence diversity of T3SEs, the T3SEdb provides a large number of experimentally known effector sequences with wide species representation for creation of effector predictors. We created a reliable effector prediction tool, integrated into the database, to demonstrate the application of the database for such endeavours. CONCLUSIONS: T3SEdb is the first specialised database reported for T3SS effectors, enriched with manual annotations that facilitated systematic construction of a reliable prediction model for identification of novel effectors. The T3SEdb represents a platform for inclusion of additional annotations of metadata for future developments of sophisticated effector prediction models for screening and selection of putative novel effectors from bacterial genomes/proteomes that can be validated by a small number of key experiments.
Asunto(s)
Bacterias/genética , Bacterias/patogenicidad , Sistemas de Secreción Bacterianos/genética , Bases de Datos de Proteínas , Factores de Virulencia/genética , Algoritmos , Variación GenéticaRESUMEN
BACKGROUND: Notch signaling is an evolutionarily conserved developmental pathway. Zebrafish mind bomb (mib) mutants carry mutations on mib gene, which encodes a RING E3 ligase required for Notch activation via Delta/Jagged ubiquitylation and internalization. METHODOLOGY/PRINCIPAL FINDINGS: We examined the mib mutants for defects in pancreas development using in situ hybridization and GFP expression analysis of pancreas-specific GFP lines, carried out the global gene expression profile analysis of three different mib mutant alleles and validated the microarray data using real-time PCR and fluorescent double in situ hybridization. Our study showed that the mib mutants have diminished exocrine pancreas and this defect was most severe in mib(ta52b) followed by mib(m132) and then mib(tfi91), which is consistent with the compromised Notch activity found in corresponding mib mutant alleles. Global expression profile analysis of mib mutants showed that there is a significant difference in gene expression profile of wt and three mib mutant alleles. There are 91 differentially expressed genes that are common to all three mib alleles. Through detailed analysis of microarray data, we have identified several previously characterized genes and some putative Notch-responsive genes involved in pancreas development. Moreover, results from real-time PCR and fluorescent double in situ hybridization were largely consistent with microarray data. CONCLUSIONS/SIGNIFICANCE: This study provides, for the first time, a global gene expression profile in mib mutants generating useful genomic resources and providing an opportunity to identify the function of novel genes involved in Notch signaling and Notch-regulated developmental processes.
Asunto(s)
Alelos , Páncreas/anomalías , Receptores Notch/fisiología , Ubiquitina-Proteína Ligasas/genética , Proteínas de Pez Cebra/genética , Pez Cebra/genética , Animales , Perfilación de la Expresión Génica , Hibridación Fluorescente in Situ , Mutación , Reacción en Cadena de la PolimerasaRESUMEN
SUMMARY: Binding of short antigenic peptides to Major histocompatibility complex (MHC) proteins is the first step in T-cell mediated immune response. To understand the structural principles governing MHC-specific peptide recognition and binding, we have developed the MHC-Peptide Interaction Database (MPID), containing sequence-structure-function information. MPID (version 1.2) contains curated x-ray crystallographic data on 86 MHC peptide complexes, with precomputed interaction parameters (solvent accessibility, hydrogen bonds, gap volume and gap index). A user-friendly web interface and query tools will facilitate the development of predictive algorithms for MHC-peptide binding from a structural viewpoint. AVAILABILITY: Freely accessible from http://surya.bic.nus.edu.sg/mpid.