Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
Add more filters










Publication year range
1.
EMBO J ; 41(24): e111132, 2022 12 15.
Article in English | MEDLINE | ID: mdl-36345783

ABSTRACT

The cerebral cortex contains billions of neurons, and their disorganization or misspecification leads to neurodevelopmental disorders. Understanding how the plethora of projection neuron subtypes are generated by cortical neural stem cells (NSCs) is a major challenge. Here, we focused on elucidating the transcriptional landscape of murine embryonic NSCs, basal progenitors (BPs), and newborn neurons (NBNs) throughout cortical development. We uncover dynamic shifts in transcriptional space over time and heterogeneity within each progenitor population. We identified signature hallmarks of NSC, BP, and NBN clusters and predict active transcriptional nodes and networks that contribute to neural fate specification. We find that the expression of receptors, ligands, and downstream pathway components is highly dynamic over time and throughout the lineage implying differential responsiveness to signals. Thus, we provide an expansive compendium of gene expression during cortical development that will be an invaluable resource for studying neural developmental processes and neurodevelopmental disorders.


Subject(s)
Neural Stem Cells , Neurons , Animals , Mice , Cell Differentiation , Cell Lineage/genetics , Cerebral Cortex , Embryonic Stem Cells , Neurogenesis/genetics , Neurons/metabolism
2.
Nat Genet ; 54(7): 1037-1050, 2022 07.
Article in English | MEDLINE | ID: mdl-35789323

ABSTRACT

Zebrafish, a popular organism for studying embryonic development and for modeling human diseases, has so far lacked a systematic functional annotation program akin to those in other animal models. To address this, we formed the international DANIO-CODE consortium and created a central repository to store and process zebrafish developmental functional genomic data. Our data coordination center ( https://danio-code.zfin.org ) combines a total of 1,802 sets of unpublished and re-analyzed published genomic data, which we used to improve existing annotations and show its utility in experimental design. We identified over 140,000 cis-regulatory elements throughout development, including classes with distinct features dependent on their activity in time and space. We delineated the distinct distance topology and chromatin features between regulatory elements active during zygotic genome activation and those active during organogenesis. Finally, we matched regulatory elements and epigenomic landscapes between zebrafish and mouse and predicted functional relationships between them beyond sequence similarity, thus extending the utility of zebrafish developmental genomics to mammals.


Subject(s)
Databases, Genetic , Gene Expression Regulation, Developmental , Genome , Genomics , Regulatory Sequences, Nucleic Acid , Zebrafish Proteins , Zebrafish , Animals , Chromatin/genetics , Genome/genetics , Humans , Mice , Molecular Sequence Annotation , Organogenesis/genetics , Regulatory Sequences, Nucleic Acid/genetics , Zebrafish/embryology , Zebrafish/genetics , Zebrafish Proteins/genetics
3.
Genome Res ; 29(7): 1164-1177, 2019 07.
Article in English | MEDLINE | ID: mdl-31138617

ABSTRACT

Although ChIP-seq has become a routine experimental approach for quantitatively characterizing the genome-wide binding of transcription factors (TFs), computational analysis procedures remain far from standardized, making it difficult to compare ChIP-seq results across experiments. In addition, although genome-wide binding patterns must ultimately be determined by local constellations of DNA-binding sites, current analysis is typically limited to identifying enriched motifs in ChIP-seq peaks. Here we present Crunch, a completely automated computational method that performs all ChIP-seq analysis from quality control through read mapping and peak detecting and that integrates comprehensive modeling of the ChIP signal in terms of known and novel binding motifs, quantifying the contribution of each motif and annotating which combinations of motifs explain each binding peak. By applying Crunch to 128 data sets from the ENCODE Project, we show that Crunch outperforms current peak finders and find that TFs naturally separate into "solitary TFs," for which a single motif explains the ChIP-peaks, and "cobinding TFs," for which multiple motifs co-occur within peaks. Moreover, for most data sets, the motifs that Crunch identified de novo outperform known motifs, and both the set of cobinding motifs and the top motif of solitary TFs are consistent across experiments and cell lines. Crunch is implemented as a web server, enabling standardized analysis of any collection of ChIP-seq data sets by simply uploading raw sequencing data. Results are provided both in a graphical web interface and as downloadable files.


Subject(s)
Chromatin Immunoprecipitation Sequencing , Computational Biology/methods , Transcription Factors/metabolism , Amino Acid Motifs , Animals , Binding Sites , Datasets as Topic , Humans , Nucleotide Motifs , Quality Control , Regulatory Sequences, Nucleic Acid
4.
PLoS Comput Biol ; 13(7): e1005176, 2017 Jul.
Article in English | MEDLINE | ID: mdl-28753602

ABSTRACT

Gene regulatory networks are ultimately encoded by the sequence-specific binding of (TFs) to short DNA segments. Although it is customary to represent the binding specificity of a TF by a position-specific weight matrix (PSWM), which assumes each position within a site contributes independently to the overall binding affinity, evidence has been accumulating that there can be significant dependencies between positions. Unfortunately, methodological challenges have so far hindered the development of a practical and generally-accepted extension of the PSWM model. On the one hand, simple models that only consider dependencies between nearest-neighbor positions are easy to use in practice, but fail to account for the distal dependencies that are observed in the data. On the other hand, models that allow for arbitrary dependencies are prone to overfitting, requiring regularization schemes that are difficult to use in practice for non-experts. Here we present a new regulatory motif model, called dinucleotide weight tensor (DWT), that incorporates arbitrary pairwise dependencies between positions in binding sites, rigorously from first principles, and free from tunable parameters. We demonstrate the power of the method on a large set of ChIP-seq data-sets, showing that DWTs outperform both PSWMs and motif models that only incorporate nearest-neighbor dependencies. We also demonstrate that DWTs outperform two previously proposed methods. Finally, we show that DWTs inferred from ChIP-seq data also outperform PSWMs on HT-SELEX data for the same TF, suggesting that DWTs capture inherent biophysical properties of the interactions between the DNA binding domains of TFs and their binding sites. We make a suite of DWT tools available at dwt.unibas.ch, that allow users to automatically perform 'motif finding', i.e. the inference of DWT motifs from a set of sequences, binding site prediction with DWTs, and visualization of DWT 'dilogo' motifs.


Subject(s)
Binding Sites/genetics , Computational Biology/methods , DNA , Nucleotide Motifs/genetics , Transcription Factors , DNA/chemistry , DNA/genetics , DNA/metabolism , Models, Statistical , RNA/chemistry , RNA/genetics , RNA/metabolism , Sequence Analysis, DNA , Transcription Factors/chemistry , Transcription Factors/genetics , Transcription Factors/metabolism
5.
F1000Res ; 52016.
Article in English | MEDLINE | ID: mdl-28232860

ABSTRACT

ISMARA ( ismara.unibas.ch) automatically infers the key regulators and regulatory interactions from high-throughput gene expression or chromatin state data. However, given the large sizes of current next generation sequencing (NGS) datasets, data uploading times are a major bottleneck. Additionally, for proprietary data, users may be uncomfortable with uploading entire raw datasets to an external server. Both these problems could be alleviated by providing a means by which users could pre-process their raw data locally, transferring only a small summary file to the ISMARA server. We developed a stand-alone client application that pre-processes large input files (RNA-seq or ChIP-seq data) on the user's computer for performing ISMARA analysis in a completely automated manner, including uploading of small processed summary files to the ISMARA server. This reduces file sizes by up to a factor of 1000, and upload times from many hours to mere seconds. The client application is available from ismara.unibas.ch/ISMARA/client.

6.
Methods ; 85: 62-74, 2015 Sep 01.
Article in English | MEDLINE | ID: mdl-26164700

ABSTRACT

Analysis of gene expression data remains one of the most promising avenues toward reconstructing genome-wide gene regulatory networks. However, the large dimensionality of the problem prohibits the fitting of explicit dynamical models of gene regulatory networks, whereas machine learning methods for dimensionality reduction such as clustering or principal component analysis typically fail to provide mechanistic interpretations of the reduced descriptions. To address this, we recently developed a general methodology called motif activity response analysis (MARA) that, by modeling gene expression patterns in terms of the activities of concrete regulators, accomplishes dramatic dimensionality reduction while retaining mechanistic biological interpretations of its predictions (Balwierz, 2014). Here we extend MARA by presenting ARMADA, which models the activity dynamics of regulators across a time course, and infers the causal interactions between the regulators that drive the dynamics of their activities across time. We have implemented ARMADA as part of our ISMARA webserver, ismara.unibas.ch, allowing any researcher to automatically apply it to any gene expression time course. To illustrate the method, we apply ARMADA to a time course of human umbilical vein endothelial cells treated with TNF. Remarkably, ARMADA is able to reproduce the complex observed motif activity dynamics using a relatively small set of interactions between the key regulators in this system. In addition, we show that ARMADA successfully infers many of the key regulatory interactions known to drive this inflammatory response and discuss several novel interactions that ARMADA predicts. In combination with ISMARA, ARMADA provides a powerful approach to generating plausible hypotheses for the key interactions between regulators that control gene expression in any system for which time course measurements are available.


Subject(s)
Gene Expression Profiling/methods , Gene Regulatory Networks/genetics , Systems Analysis , Algorithms , Amino Acid Motifs/genetics , Animals , Computational Biology/methods , Humans , Mice
7.
Nucleic Acids Res ; 42(14): 9313-26, 2014 Aug.
Article in English | MEDLINE | ID: mdl-25030899

ABSTRACT

The findings that microRNAs (miRNAs) are essential for early development in many species and that embryonic miRNAs can reprogram somatic cells into induced pluripotent stem cells suggest that these miRNAs act directly on transcriptional and chromatin regulators of pluripotency. To elucidate the transcription regulatory networks immediately downstream of embryonic miRNAs, we extended the motif activity response analysis approach that infers the regulatory impact of both transcription factors (TFs) and miRNAs from genome-wide expression states. Applying this approach to multiple experimental data sets generated from mouse embryonic stem cells (ESCs) that did or did not express miRNAs of the ESC-specific miR-290-295 cluster, we identified multiple TFs that are direct miRNA targets, some of which are known to be active during cell differentiation. Our results provide new insights into the transcription regulatory network downstream of ESC-specific miRNAs, indicating that these miRNAs act on cell cycle and chromatin regulators at several levels and downregulate TFs that are involved in the innate immune response.


Subject(s)
Embryonic Stem Cells/metabolism , Gene Regulatory Networks , MicroRNAs/metabolism , Animals , Cell Cycle/genetics , Cell Differentiation/genetics , Epigenesis, Genetic , Interferon Regulatory Factor-2/metabolism , Mice , Pluripotent Stem Cells/metabolism , Transcription Factor RelA/metabolism
8.
Mol Biol Evol ; 31(5): 1077-88, 2014 May.
Article in English | MEDLINE | ID: mdl-24600054

ABSTRACT

Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads.


Subject(s)
Genomics/methods , Phylogeny , Algorithms , Computer Simulation , Escherichia coli/genetics , Evolution, Molecular , Genome, Bacterial , High-Throughput Nucleotide Sequencing , Likelihood Functions , Models, Genetic , Polymorphism, Single Nucleotide , Pseudomonas syringae/genetics , Reproducibility of Results , Sequence Alignment , Sinorhizobium meliloti/genetics
9.
Genome Res ; 24(5): 869-84, 2014 May.
Article in English | MEDLINE | ID: mdl-24515121

ABSTRACT

Accurate reconstruction of the regulatory networks that control gene expression is one of the key current challenges in molecular biology. Although gene expression and chromatin state dynamics are ultimately encoded by constellations of binding sites recognized by regulators such as transcriptions factors (TFs) and microRNAs (miRNAs), our understanding of this regulatory code and its context-dependent read-out remains very limited. Given that there are thousands of potential regulators in mammals, it is not practical to use direct experimentation to identify which of these play a key role for a particular system of interest. We developed a methodology that models gene expression or chromatin modifications in terms of genome-wide predictions of regulatory sites and completely automated it into a web-based tool called ISMARA (Integrated System for Motif Activity Response Analysis). Given only gene expression or chromatin state data across a set of samples as input, ISMARA identifies the key TFs and miRNAs driving expression/chromatin changes and makes detailed predictions regarding their regulatory roles. These include predicted activities of the regulators across the samples, their genome-wide targets, enriched gene categories among the targets, and direct interactions between the regulators. Applying ISMARA to data sets from well-studied systems, we show that it consistently identifies known key regulators ab initio. We also present a number of novel predictions including regulatory interactions in innate immunity, a master regulator of mucociliary differentiation, TFs consistently disregulated in cancer, and TFs that mediate specific chromatin modifications.


Subject(s)
Genome, Human , Models, Genetic , Nucleotide Motifs , Regulatory Sequences, Nucleic Acid , Sequence Analysis, DNA/methods , Algorithms , Animals , Chromatin Assembly and Disassembly , Humans , Mice
10.
Cancer Cell ; 23(6): 768-83, 2013 Jun 10.
Article in English | MEDLINE | ID: mdl-23764001

ABSTRACT

Gene expression profiling has uncovered the transcription factor Sox4 with upregulated activity during TGF-ß-induced epithelial-mesenchymal transition (EMT) in normal and cancerous breast epithelial cells. Sox4 is indispensable for EMT and cell survival in vitro and for primary tumor growth and metastasis in vivo. Among several EMT-relevant genes, Sox4 directly regulates the expression of Ezh2, encoding the Polycomb group histone methyltransferase that trimethylates histone 3 lysine 27 (H3K27me3) for gene repression. Ablation of Ezh2 expression prevents EMT, whereas forced expression of Ezh2 restores EMT in Sox4-deficient cells. Ezh2-mediated H3K27me3 marks associate with key EMT genes, representing an epigenetic EMT signature that predicts patient survival. Our results identify Sox4 as a master regulator of EMT by governing the expression of the epigenetic modifier Ezh2.


Subject(s)
Epigenesis, Genetic , Epithelial-Mesenchymal Transition , Gene Expression Regulation, Neoplastic , Mammary Neoplasms, Experimental/genetics , Polycomb Repressive Complex 2/genetics , SOXC Transcription Factors/physiology , Animals , Cell Line , Cell Movement/genetics , Cell Survival/genetics , Enhancer of Zeste Homolog 2 Protein , Female , Histones/metabolism , Humans , MCF-7 Cells , Mammary Neoplasms, Experimental/metabolism , Mammary Neoplasms, Experimental/pathology , Methylation , Mice , Neoplasm Metastasis/genetics , Polycomb Repressive Complex 2/metabolism , Polycomb Repressive Complex 2/physiology , Promoter Regions, Genetic , SOXC Transcription Factors/genetics , SOXC Transcription Factors/metabolism , Transcription, Genetic
11.
PLoS One ; 8(2): e57329, 2013.
Article in English | MEDLINE | ID: mdl-23451207

ABSTRACT

We have identified the zinc-finger transcription factor Kruppel-like factor 4 (Klf4) among the transcription factors that are significantly downregulated in their expression during epithelial-mesenchymal transition (EMT) in mammary epithelial cells and in breast cancer cells. Loss and gain of function experiments demonstrate that the down-regulation of Klf4 expression is required for the induction of EMT in vitro and for metastasis in vivo. In addition, reduced Klf4 expression correlates with shorter disease-free survival of subsets of breast cancer patients. Yet, reduced expression of Klf4 also induces apoptosis in cells undergoing TGFß-induced EMT. Chromatin immunoprecipitation/deep-sequencing in combination with gene expression profiling reveals direct Klf4 target genes, including E-cadherin (Cdh1), N-cadherin (Cdh2), vimentin (Vim), ß-catenin (Ctnnb1), VEGF-A (Vegfa), endothelin-1 (Edn1) and Jnk1 (Mapk8). Thereby, Klf4 acts as a transcriptional activator of epithelial genes and as a repressor of mesenchymal genes. Specifically, increased expression of Jnk1 (Mapk8) upon down-regulation of its transcriptional repressor Klf4 is required for EMT cell migration and for the induction of apoptosis. The data demonstrate a central role of Klf4 in the maintenance of epithelial cell differentiation and the prevention of EMT and metastasis.


Subject(s)
Epithelial-Mesenchymal Transition/physiology , Kruppel-Like Transcription Factors/physiology , Mitogen-Activated Protein Kinase 8/physiology , Trans-Activators/physiology , Animals , Cell Differentiation , Kruppel-Like Factor 4 , Mice , Neoplasms, Experimental/pathology
12.
Genome Res ; 23(1): 60-73, 2013 Jan.
Article in English | MEDLINE | ID: mdl-22964890

ABSTRACT

Although changes in chromatin are integral to transcriptional reprogramming during cellular differentiation, it is currently unclear how chromatin modifications are targeted to specific loci. To systematically identify transcription factors (TFs) that can direct chromatin changes during cell fate decisions, we model the relationship between genome-wide dynamics of chromatin marks and the local occurrence of computationally predicted TF binding sites. By applying this computational approach to a time course of Polycomb-mediated H3K27me3 marks during neuronal differentiation of murine stem cells, we identify several motifs that likely regulate the dynamics of this chromatin mark. Among these, the sites bound by REST and by the SNAIL family of TFs are predicted to transiently recruit H3K27me3 in neuronal progenitors. We validate these predictions experimentally and show that absence of REST indeed causes loss of H3K27me3 at target promoters in trans, specifically at the neuronal progenitor state. Moreover, using targeted transgenic insertion, we show that promoter fragments containing REST or SNAIL binding sites are sufficient to recruit H3K27me3 in cis, while deletion of these sites results in loss of H3K27me3. These findings illustrate that the occurrence of TF binding sites can determine chromatin dynamics. Local determination of Polycomb activity by REST and SNAIL motifs exemplifies such TF based regulation of chromatin. Furthermore, our results show that key TFs can be identified ab initio through computational modeling of epigenome data sets using a modeling approach that we make readily accessible.


Subject(s)
Chromatin Assembly and Disassembly , Epigenesis, Genetic , Models, Genetic , Polycomb-Group Proteins/metabolism , Transcription Factors/metabolism , Animals , Binding Sites , Cattle , Cell Differentiation , Chromatin/metabolism , Dogs , Genome , Histones/metabolism , Horses , Humans , Macaca , Mice , Neurons/cytology , Opossums , Promoter Regions, Genetic , Snail Family Transcription Factors , Stem Cells/cytology , Transgenes
13.
Nucleic Acids Res ; 41(Database issue): D214-20, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23180783

ABSTRACT

Identification of genomic regulatory elements is essential for understanding the dynamics of cellular processes. This task has been substantially facilitated by the availability of genome sequences for many species and high-throughput data of transcripts and transcription factor (TF) binding. However, rigorous computational methods are necessary to derive accurate genome-wide annotations of regulatory sites from such data. SwissRegulon (http://swissregulon.unibas.ch) is a database containing genome-wide annotations of regulatory motifs, promoters and TF binding sites (TFBSs) in promoter regions across model organisms. Its binding site predictions were obtained with rigorous Bayesian probabilistic methods that operate on orthologous regions from related genomes, and use explicit evolutionary models to assess the evidence of purifying selection on each site. New in the current version of SwissRegulon is a curated collection of 190 mammalian regulatory motifs associated with ∼340 TFs, and TFBS annotations across a curated set of ∼35 000 promoters in both human and mouse. Predictions of TFBSs for Saccharomyces cerevisiae have also been significantly extended and now cover 158 of yeast's ∼180 TFs. All data are accessible through both an easily navigable genome browser with search functions, and as flat files that can be downloaded for further analysis.


Subject(s)
Databases, Genetic , Molecular Sequence Annotation , Regulatory Elements, Transcriptional , Transcription Factors/metabolism , Algorithms , Animals , Binding Sites , Genomics , Humans , Internet , Mice , Promoter Regions, Genetic , Regulon , Saccharomyces cerevisiae/genetics , Transcription Initiation Site , User-Computer Interface
14.
Diabetes ; 61(8): 1986-93, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22688341

ABSTRACT

In obesity, white adipose tissue (WAT) inflammation is linked to insulin resistance. Increased adipocyte chemokine (C-C motif) ligand 2 (CCL2) secretion may initiate adipose inflammation by attracting the migration of inflammatory cells into the tissue. Using an unbiased approach, we identified adipose microRNAs (miRNAs) that are dysregulated in human obesity and assessed their possible role in controlling CCL2 production. In subcutaneous WAT obtained from 56 subjects, 11 miRNAs were present in all subjects and downregulated in obesity. Of these, 10 affected adipocyte CCL2 secretion in vitro and for 2 miRNAs (miR-126 and miR-193b), regulatory circuits were defined. While miR-126 bound directly to the 3'-untranslated region of CCL2 mRNA, miR-193b regulated CCL2 production indirectly through a network of transcription factors, many of which have been identified in other inflammatory conditions. In addition, overexpression of miR-193b and miR-126 in a human monocyte/macrophage cell line attenuated CCL2 production. The levels of the two miRNAs in subcutaneous WAT were significantly associated with CCL2 secretion (miR-193b) and expression of integrin, α-X, an inflammatory macrophage marker (miR-193b and miR-126). Taken together, our data suggest that miRNAs may be important regulators of adipose inflammation through their effects on CCL2 release from human adipocytes and macrophages.


Subject(s)
Adipose Tissue, White/metabolism , Chemokine CCL2/biosynthesis , MicroRNAs/metabolism , Obesity/metabolism , Adipocytes/metabolism , Cell Line , Female , Humans , Inflammation/metabolism , Insulin Resistance/physiology , Macrophages/metabolism
15.
Nat Med ; 18(4): 529-37, 2012 Mar 04.
Article in English | MEDLINE | ID: mdl-22388088

ABSTRACT

New cancer therapies are likely to arise from an in-depth understanding of the signaling networks influencing tumor initiation, progression and metastasis. We show a fundamental role for Src-homology 2 domain-containing phosphatase 2 (SHP2) in these processes in human epidermal growth factor receptor 2 (HER2)-positive and triple-negative breast cancers. Knockdown of SHP2 eradicated breast tumor-initiating cells in xenograft models, and SHP2 depletion also prevented invasion in three-dimensional cultures and in a transductal invasion assay in vivo. Notably, SHP2 knockdown in established breast tumors blocked their growth and reduced metastasis. Mechanistically, SHP2 activated stemness-associated transcription factors, including v-myc myelocytomatosis viral oncogene homolog (c-Myc) and zinc finger E-box binding homeobox 1 (ZEB1), which resulted in the repression of let-7 microRNA and the expression of a set of 'SHP2 signature' genes. We found these genes to be simultaneously activated in a large subset of human primary breast tumors that are associated with invasive behavior and poor prognosis. These results provide new insights into the signaling cascades influencing tumor-initiating cells as well as a rationale for targeting SHP2 in breast cancer.


Subject(s)
Breast Neoplasms/pathology , Cell Transformation, Neoplastic/pathology , Gene Expression Regulation, Neoplastic/physiology , Protein Tyrosine Phosphatase, Non-Receptor Type 11/metabolism , Signal Transduction/physiology , Transcription Factors/metabolism , Animals , Autoantigens/metabolism , Caspase 3/metabolism , Cell Adhesion Molecules/metabolism , Cell Polarity/physiology , Cell Proliferation , Computational Biology , Disease Progression , Doxycycline/pharmacology , Female , Flow Cytometry , Gene Expression Profiling , Gene Expression Regulation, Neoplastic/drug effects , Homeodomain Proteins/genetics , Homeodomain Proteins/metabolism , Humans , Ki-67 Antigen/metabolism , Luminescent Proteins/genetics , Luminescent Proteins/metabolism , Membrane Proteins/metabolism , Mice , Mice, SCID , Mitogen-Activated Protein Kinases/metabolism , Oligonucleotide Array Sequence Analysis , Platelet Endothelial Cell Adhesion Molecule-1/metabolism , Protein Tyrosine Phosphatase, Non-Receptor Type 11/genetics , Proto-Oncogene Proteins c-myc/genetics , Proto-Oncogene Proteins c-myc/metabolism , RNA, Small Interfering/metabolism , Receptor, ErbB-2/metabolism , Time Factors , Transcription Factors/genetics , Tumor Cells, Cultured , Zinc Finger E-box-Binding Homeobox 1 , src Homology Domains/physiology , Kalinin
16.
Bioinformatics ; 28(4): 487-94, 2012 Feb 15.
Article in English | MEDLINE | ID: mdl-22334039

ABSTRACT

MOTIVATION: Probabilistic approaches for inferring transcription factor binding sites (TFBSs) and regulatory motifs from DNA sequences have been developed for over two decades. Previous work has shown that prediction accuracy can be significantly improved by incorporating features such as the competition of multiple transcription factors (TFs) for binding to nearby sites, the tendency of TFBSs for co-regulated TFs to cluster and form cis-regulatory modules and explicit evolutionary modeling of conservation of TFBSs across orthologous sequences. However, currently available tools only incorporate some of these features, and significant methodological hurdles hampered their synthesis into a single consistent probabilistic framework. RESULTS: We present MotEvo, a integrated suite of Bayesian probabilistic methods for the prediction of TFBSs and inference of regulatory motifs from multiple alignments of phylogenetically related DNA sequences, which incorporates all features just mentioned. In addition, MotEvo incorporates a novel model for detecting unknown functional elements that are under evolutionary constraint, and a new robust model for treating gain and loss of TFBSs along a phylogeny. Rigorous benchmarking tests on ChIP-seq datasets show that MotEvo's novel features significantly improve the accuracy of TFBS prediction, motif inference and enhancer prediction. AVAILABILITY: Source code, a user manual and files with several example applications are available at www.swissregulon.unibas.ch.


Subject(s)
Bayes Theorem , Sequence Alignment/methods , Transcription Factors/metabolism , Animals , Base Sequence , Binding Sites , Enhancer Elements, Genetic , Humans , Phylogeny , Protein Binding
17.
J Immunol ; 182(1): 433-45, 2009 Jan 01.
Article in English | MEDLINE | ID: mdl-19109175

ABSTRACT

MicroRNAs (miRNAs) constitute a large family of small noncoding RNAs that have emerged as key posttranscriptional regulators in a wide variety of organisms. Because any one miRNA can potentially regulate expression of a distinct set of genes, differential miRNA expression can shape the repertoire of proteins that are actually expressed during development and differentiation or disease. Here, we have used mast cells as a model to investigate the role of miRNAs in differentiated innate immune cells and found that miR-221-222 are significantly up-regulated upon mast cell activation. Using both bioinformatics and experimental approaches, we identified some signaling pathways, transcription factors, and potential cis-regulatory regions that control miR-221-222 transcription. Overexpression of miR-221-222 in a model mast cell line perturbed cell morphology and cell cycle regulation without altering viability. While in stimulated cells miR-221-222 partially counteracted expression of the cell-cycle inhibitor p27(kip1), we found that in the mouse alternative splicing results in two p27(kip1) mRNA isoforms that differ in their 3' untranslated region, only one of which is subject to miR-221-222 regulation. Additionally, transgenic expression of miR-221-222 from bacterial artificial chromosome clones in embryonic stem cells dramatically reduced cell proliferation and severely impaired their accumulation. Our study provides further insights on miR-221-222 transcriptional regulation as well as evidences that miR-221-222 regulate cell cycle checkpoints in mast cells in response to acute activation stimuli.


Subject(s)
Cell Cycle/genetics , Cell Cycle/immunology , Gene Expression Regulation/immunology , Mast Cells/immunology , MicroRNAs/physiology , Animals , Bone Marrow Cells/cytology , Bone Marrow Cells/immunology , Bone Marrow Cells/metabolism , Cell Aggregation/genetics , Cell Aggregation/immunology , Cell Line , Cell Proliferation , Cell Survival/genetics , Cell Survival/immunology , Growth Inhibitors/genetics , Growth Inhibitors/physiology , Humans , Lymphocyte Subsets/cytology , Lymphocyte Subsets/immunology , Lymphocyte Subsets/metabolism , Mast Cells/cytology , Mast Cells/metabolism , Mice , Mice, Inbred BALB C , Mice, Inbred C57BL , Mice, Transgenic , MicroRNAs/biosynthesis , MicroRNAs/genetics , Signal Transduction/genetics , Signal Transduction/immunology , Transcriptional Activation/immunology , Up-Regulation/genetics , Up-Regulation/immunology
18.
Gene ; 396(2): 215-25, 2007 Jul 15.
Article in English | MEDLINE | ID: mdl-17467928

ABSTRACT

Elementary modes analysis allows one to reveal whether a set of known enzymes is sufficient to sustain functionality of the cell. Moreover, it is helpful in detecting missing reactions and predicting which enzymes could fill these gaps. Here, we perform a comprehensive elementary modes analysis and a genomic context analysis of Mycoplasma pneumoniae nucleotide metabolism, and search for new enzyme activities. The purine and pyrimidine networks are reconstructed by assembling enzymes annotated in the genome or found experimentally. We show that these reaction sets are sufficient for enabling synthesis of DNA and RNA in M. pneumoniae. Special focus is on the key modes for growth. Moreover, we make an educated guess on the nutritional requirements of this micro-organism. For the case that M. pneumoniae does not require adenine as a substrate, we suggest adenylosuccinate synthetase (EC 6.3.4.4), adenylosuccinate lyase (EC 4.3.2.2) and GMP reductase (EC 1.7.1.7) to be operative. GMP reductase activity is putatively assigned to the NRDI_MYCPN gene on the basis of the genomic context analysis. For the pyrimidine network, we suggest CTP synthase (EC 6.3.4.2) to be active. Further experiments on the nutritional requirements are needed to make a decision. Pyrimidine metabolism appears to be more appropriate as a drug target than purine metabolism since it shows lower plasticity.


Subject(s)
Genomics , Mycoplasma pneumoniae/genetics , Adenine/chemistry , Base Composition , Biochemistry/methods , DNA/chemistry , Genome, Bacterial , Models, Biological , Models, Genetic , Models, Theoretical , Nucleotides/chemistry , Purines/chemistry , Pyrimidines/chemistry , RNA/chemistry , Substrate Specificity
19.
Methods Mol Biol ; 358: 199-226, 2007.
Article in English | MEDLINE | ID: mdl-17035688

ABSTRACT

The theoretical investigation of the structure of metabolic systems has recently attracted increasing interest. In this chapter, the basic concepts of metabolic pathway analysis are described and various applications are outlined. In particular, the concepts of nullspace and elementary flux modes are explained. The presentation is illustrated by a simple example from tyrosine metabolism and a system describing lysine production in Corynebacterium glutamicum. The latter system gives rise to 37 elementary modes, 36 of which produce lysine with different molar yields. The examples illustrate that metabolic pathway analysis is a useful tool for better understanding the complex architecture of intracellular metabolism, for determining the pathways on which the molar conversion yield of a substrate-product pair under study is maximal, and for assigning functions to orphan genes (functional genomics). Moreover, problems emerging in the modeling of large networks are discussed. An outlook on current trends in the field concludes the chapter.


Subject(s)
Corynebacterium glutamicum/growth & development , Corynebacterium glutamicum/metabolism , Metabolic Networks and Pathways , Models, Biological , Animals , Humans , Lysine/metabolism , Tyrosine/metabolism
20.
Nucleic Acids Res ; 35(Database issue): D127-31, 2007 Jan.
Article in English | MEDLINE | ID: mdl-17130146

ABSTRACT

SwissRegulon (http://www.swissregulon.unibas.ch) is a database containing genome-wide annotations of regulatory sites in the intergenic regions of genomes. The regulatory site annotations are produced using a number of recently developed algorithms that operate on multiple alignments of orthologous intergenic regions from related genomes in combination with, whenever available, known sites from the literature, and ChIP-on-chip binding data. Currently SwissRegulon contains annotations for yeast and 17 prokaryotic genomes. The database provides information about the sequence, location, orientation, posterior probability and, whenever available, binding factor of each annotated site. To enable easy viewing of the regulatory site annotations in the context of other features annotated on the genomes, the sites are displayed using the GBrowse genome browser interface and can be queried based on any annotated genomic feature. The database can also be queried for regulons, i.e. sites bound by a common factor.


Subject(s)
Databases, Nucleic Acid , Regulatory Elements, Transcriptional , Regulon , Transcription Factors/metabolism , Algorithms , Bacteria/genetics , Binding Sites , DNA, Intergenic/chemistry , Genomics , Internet , User-Computer Interface , Yeasts/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...