Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
BMC Genomics ; 25(1): 307, 2024 Mar 23.
Artigo em Inglês | MEDLINE | ID: mdl-38521929

RESUMO

BACKGROUND: Transcription factor (TF) proteins are a key component of the gene regulatory networks that control cellular fates and function. TFs bind DNA regulatory elements in a sequence-specific manner and modulate target gene expression through combinatorial interactions with each other, cofactors, and chromatin-modifying proteins. Large-scale studies over the last two decades have helped shed light on the complex network of TFs that regulate development in Drosophila melanogaster. RESULTS: Here, we present a detailed characterization of expression of all known and predicted Drosophila TFs in two well-established embryonic cell lines, Kc167 and S2 cells. Using deep coverage RNA sequencing approaches we investigate the transcriptional profile of all 707 TF coding genes in both cell types. Only 103 TFs have no detectable expression in either cell line and 493 TFs have a read count of 5 or greater in at least one of the cell lines. The 493 TFs belong to 54 different DNA-binding domain families, with significant enrichment of those in the zf-C2H2 family. We identified 123 differentially expressed genes, with 57 expressed at significantly higher levels in Kc167 cells than S2 cells, and 66 expressed at significantly lower levels in Kc167 cells than S2 cells. Network mapping reveals that many of these TFs are crucial components of regulatory networks involved in cell proliferation, cell-cell signaling pathways, and eye development. CONCLUSIONS: We produced a reference TF coding gene expression dataset in the extensively studied Drosophila Kc167 and S2 embryonic cell lines, and gained insight into the TF regulatory networks that control the activity of these cells.


Assuntos
Drosophila , Fatores de Transcrição , Humanos , Animais , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Drosophila/genética , Drosophila melanogaster/metabolismo , Redes Reguladoras de Genes , DNA/metabolismo , Linhagem Celular
2.
Protein Expr Purif ; 158: 9-14, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-30738927

RESUMO

Homeodomain transcription factors (HD TFs) are a large class of evolutionarily conserved DNA binding proteins that contain a basic 60-amino acid region required for binding to specific DNA sites. In Drosophila melanogaster, many of these HD TFs are expressed in the early embryo and control transcription of target genes in development through their interaction with cis-regulatory modules. Previous studies where some of the Drosophila HD TFs were purified required the use of strong denaturants (i.e. 6 M urea) and multiple chromatography columns, making the downstream biochemical examination of the isolated protein difficult. To circumvent these obstacles, we have developed a streamlined expression and purification protocol to produce large yields of Drosophila HD TFs. Using the HD TFs FUSHI-TARAZU (FTZ), ANTENNAPEDIA (ANTP), ABDOMINAL-A (ABD-A), ABDOMINAL-B (ABD-B), and ULTRABITHORAX (UBX) as examples, we demonstrate that our 3-day protocol involving the overexpression of His6-SUMO fusion constructs in E. coli followed by a Ni2+-IMAC, SUMO-tag cleavage with the SUMO protease Ulp1, and a heparin column purification produces pure, soluble protein in biological buffers around pH 7 in the absence of denaturants. Electrophoretic mobility shift assays (EMSA) confirm that the purified HD proteins are functional and nuclear magnetic resonance (NMR) spectra confirm that the purified HDs are well-folded. These purified HD TFs can be used in future biophysical experiments to structurally and biochemically characterize how and why these HD TFs bind to different DNA sequences and further probe how nucleotide differences contribute to TF-DNA specificity in the HD family.


Assuntos
Proteínas de Drosophila , Proteínas de Homeodomínio , Proteínas Recombinantes de Fusão , Animais , Proteínas de Drosophila/química , Proteínas de Drosophila/genética , Proteínas de Drosophila/isolamento & purificação , Drosophila melanogaster , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/isolamento & purificação , Proteínas Recombinantes de Fusão/química , Proteínas Recombinantes de Fusão/genética , Proteínas Recombinantes de Fusão/isolamento & purificação
3.
BMC Bioinformatics ; 16: 30, 2015 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-25637281

RESUMO

BACKGROUND: A key challenge in understanding the molecular mechanisms that control gene regulation is the characterization of the specificity with which transcription factor proteins bind to specific DNA sequences. A number of computational approaches have been developed to examine these interactions, including simple mononucleotide and dinucleotide position weight matrix models. RESULTS: Here we develop a novel, unbiased computational algorithm, MARZ, that systematically analyzes all possible gapped matrices across a fixed number of nucleotides. In addition, to evaluate the ability of these matrix models to predict in vivo binding sites, we utilize a new scoring system and, in combination with established scoring methods and statistical analysis, test the performance of 32 different gapped matrices on the well characterized HUNCHBACK transcription factor in Drosophila. CONCLUSIONS: Our results indicate that in many cases gapped matrix models can outperform traditional models, but that the relative strength of the binding sites considered in the analysis can profoundly influence the predictive ability of specific models.


Assuntos
Algoritmos , Biologia Computacional/métodos , Proteínas de Ligação a DNA/química , Proteínas de Drosophila/química , Drosophila/metabolismo , Nucleotídeos/genética , Fatores de Transcrição/química , Animais , Sítios de Ligação , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Drosophila/genética , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Matrizes de Pontuação de Posição Específica , Ligação Proteica , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
4.
Biochim Biophys Acta ; 1829(9): 946-53, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23643643

RESUMO

A rigorous analysis of transcriptional regulation at the DNA level is crucial to the understanding of many biological systems. Mathematical modeling has offered researchers a new approach to understanding this central process. In particular, thermodynamic-based modeling represents the most biophysically informed approach aimed at connecting DNA level regulatory sequences to the expression of specific genes. The goal of this review is to give biologists a thorough description of the steps involved in building, analyzing, and implementing a thermodynamic-based model of transcriptional regulation. The data requirements for this modeling approach are described, the derivation for a specific regulatory region is shown, and the challenges and future directions for the quantitative modeling of gene regulation are discussed.


Assuntos
Regulação da Expressão Gênica , Modelos Genéticos , Termodinâmica , Transcrição Gênica
5.
Methods ; 62(1): 99-108, 2013 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-23726942

RESUMO

Deciphering the mechanisms involved in gene regulation holds the key to understanding the control of central biological processes, including human disease, population variation, and the evolution of morphological innovations. New experimental techniques including whole genome sequencing and transcriptome analysis have enabled comprehensive modeling approaches to study gene regulation. In many cases, it is useful to be able to assign biological significance to the inferred model parameters, but such interpretation should take into account features that affect these parameters, including model construction and sensitivity, the type of fitness calculation, and the effectiveness of parameter estimation. This last point is often neglected, as estimation methods are often selected for historical reasons or for computational ease. Here, we compare the performance of two parameter estimation techniques broadly representative of local and global approaches, namely, a quasi-Newton/Nelder-Mead simplex (QN/NMS) method and a covariance matrix adaptation-evolutionary strategy (CMA-ES) method. The estimation methods were applied to a set of thermodynamic models of gene transcription applied to regulatory elements active in the Drosophila embryo. Measuring overall fit, the global CMA-ES method performed significantly better than the local QN/NMS method on high quality data sets, but this difference was negligible on lower quality data sets with increased noise or on data sets simplified by stringent thresholding. Our results suggest that the choice of parameter estimation technique for evaluation of gene expression models depends both on quality of data, the nature of the models [again, remains to be established] and the aims of the modeling effort.


Assuntos
Algoritmos , Drosophila melanogaster/genética , Embrião não Mamífero/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Modelos Genéticos , Biologia de Sistemas/métodos , Animais , Drosophila melanogaster/embriologia , Drosophila melanogaster/metabolismo , Embrião não Mamífero/citologia , Perfilação da Expressão Gênica , Humanos , Termodinâmica , Transcrição Gênica
6.
BMC Bioinformatics ; 14: 298, 2013 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-24093548

RESUMO

BACKGROUND: Gene expression in the Drosophila embryo is controlled by functional interactions between a large network of protein transcription factors (TFs) and specific sequences in DNA cis-regulatory modules (CRMs). The binding site sequences for any TF can be experimentally determined and represented in a position weight matrix (PWM). PWMs can then be used to predict the location of TF binding sites in other regions of the genome, although there are limitations to this approach as currently implemented. RESULTS: In this proof-of-principle study, we analyze 127 CRMs and focus on four TFs that control transcription of target genes along the anterio-posterior axis of the embryo early in development. For all four of these TFs, there is some degree of conserved flanking sequence that extends beyond the predicted binding regions. A potential role for these conserved flanking sequences may be to enhance the specificity of TF binding, as the abundance of these sequences is greatly diminished when we examine only predicted high-affinity binding sites. CONCLUSIONS: Expanding PWMs to include sequence context-dependence will increase the information content in PWMs and facilitate a more efficient functional identification and dissection of CRMs.


Assuntos
Sítios de Ligação/genética , Drosophila melanogaster/genética , Ligação Proteica/genética , Elementos Reguladores de Transcrição/genética , Fatores de Transcrição , Animais , Drosophila melanogaster/embriologia , Drosophila melanogaster/metabolismo , Regulação da Expressão Gênica/genética , Genoma , Genômica , Matrizes de Pontuação de Posição Específica , Análise de Sequência de DNA/métodos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
7.
G3 (Bethesda) ; 13(5)2023 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-36869676

RESUMO

Drosophila melanogaster cell lines are an important resource for a range of studies spanning genomics, molecular genetics, and cell biology. Amongst these valuable lines are Kc167 (Kc) and Schneider 2 (S2) cells, which were originally isolated in the late 1960s from embryonic sources and have been used extensively to investigate a broad spectrum of biological activities including cell-cell signaling and immune system function. Whole-genome tiling microarray analysis of total RNA from these two cell types was performed as part of the modENCODE project over a decade ago and revealed that they share a number of gene expression features. Here, we expand on these earlier studies by using deep-coverage RNA-sequencing approaches to investigate the transcriptional profile in Kc and S2 cells in detail. Comparison of the transcriptomes reveals that ∼75% of the 13,919 annotated genes are expressed at a detectable level in at least one of the cell lines, with the majority of these genes expressed at high levels in both cell lines. Despite the overall similarity of the transcriptional landscape in the two cell types, 2,588 differentially expressed genes are identified. Many of the genes with the largest fold change are known only by their "CG" designations, indicating that the molecular control of Kc and S2 cell identity may be regulated in part by a cohort of relatively uncharacterized genes. Our data also indicate that both cell lines have distinct hemocyte-like identities, but share active signaling pathways and express a number of genes in the network responsible for dorsal-ventral patterning of the early embryo.


Assuntos
Drosophila , Transcriptoma , Animais , Drosophila/genética , Drosophila melanogaster/genética , RNA , Linhagem Celular
8.
PeerJ ; 11: e15597, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37366427

RESUMO

The core promoter elements are important DNA sequences for the regulation of RNA polymerase II transcription in eukaryotic cells. Despite the broad evolutionary conservation of these elements, there is extensive variation in the nucleotide composition of the actual sequences. In this study, we aim to improve our understanding of the complexity of this sequence variation in the TATA box and initiator core promoter elements in Drosophila melanogaster. Using computational approaches, including an enhanced version of our previously developed MARZ algorithm that utilizes gapped nucleotide matrices, several sequence landscape features are uncovered, including an interdependency between the nucleotides in position 2 and 5 in the initiator. Incorporating this information in an expanded MARZ algorithm improves predictive performance for the identification of the initiator element. Overall our results demonstrate the need to carefully consider detailed sequence composition features in core promoter elements in order to make more robust and accurate bioinformatic predictions.


Assuntos
Drosophila melanogaster , Drosophila , Animais , Drosophila/genética , Drosophila melanogaster/genética , Sequência de Bases , Algoritmos , Nucleotídeos
9.
NAR Genom Bioinform ; 5(2): lqad035, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37081864

RESUMO

DNA methylation, the addition of a methyl (CH3) group to a cytosine residue, is an evolutionarily conserved epigenetic mark involved in a number of different biological functions in eukaryotes, including transcriptional regulation, chromatin structural organization, cellular differentiation and development. In the social amoeba Dictyostelium, previous studies have shown the existence of a DNA methyltransferase (DNMA) belonging to the DNMT2 family, but the extent and function of 5-methylcytosine in the genome are unclear. Here, we present the whole genome DNA methylation profile of Dictyostelium discoideum using deep coverage replicate sequencing of bisulfite-converted gDNA extracted from post-starvation cells. We find an overall very low number of sites with any detectable level of DNA methylation, occurring at significant levels in only 303-3432 cytosines out of the ∼7.5 million total cytosines in the genome depending on the replicate. Furthermore, a knockout of the DNMA enzyme leads to no overall decrease in DNA methylation. Of the identified sites, significant methylation is only detected at 11 sites in all four of the methylomes analyzed. Targeted bisulfite PCR sequencing and computational analysis demonstrate that the methylation profile does not change during development and that these 11 cytosines are most likely false positives generated by protection from bisulfite conversion due to their location in hairpin-forming palindromic DNA sequences. Our data therefore provide evidence that there is no significant DNA methylation in Dictyostelium before fruiting body formation and identify a reproducible experimental artifact from bisulfite sequencing.

10.
Math Biosci ; 342: 108716, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34687735

RESUMO

A detailed comprehension of transcriptional regulation is critical to understanding the genetic control of development and disease across many different organisms. To more fully investigate the complex molecular interactions controlling the precise expression of genes, many groups have constructed mathematical models to complement their experimental approaches. A critical step in such studies is choosing the most appropriate parameter estimation algorithm to enable detailed analysis of the parameters that contribute to the models. In this study, we develop a novel set of evolutionary algorithms that use a pseudo-random Sobol Set to construct the initial population and incorporate parameter sensitivities into the adaptation of mutation rates, using local, global, and hybrid strategies. Comparison of the performance of these new algorithms to a number of current state-of-the-art global parameter estimation algorithms on a range of continuous test functions, as well as synthetic biological data representing models of gene regulatory systems, reveals improved performance of the new algorithms in terms of runtime, error and reproducibility. In addition, by analyzing the ability of these algorithms to fit datasets of varying quality, we provide the experimentalist with a guide to how the algorithms perform across a range of noisy data. These results demonstrate the improved performance of the new set of parameter estimation algorithms and facilitate meaningful integration of model parameters and predictions in our understanding of the molecular mechanisms of gene regulation.


Assuntos
Algoritmos , Modelos Biológicos , Evolução Biológica , Reprodutibilidade dos Testes , Termodinâmica
11.
Math Biosci ; 316: 108239, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31454629

RESUMO

In all complex organisms, the precise levels and timing of gene expression controls vital biological processes. In higher eukaryotes, including the fruit fly Drosophila melanogaster, the complex molecular control of transcription (the synthesis of RNA from DNA) and translation (the synthesis of proteins from RNA) events driving this gene expression are not fully understood. In particular, for Drosophila melanogaster, there is a plethora of experimental data, including quantitative measurements of both RNA and protein concentrations, but the precise mechanisms that control the dynamics of gene expression during early development and the processes which lead to steady-state levels of certain proteins remain elusive. This study analyzes a current mathematical modeling approach in an attempt to better understand the long-term behavior of gene regulation. The model is a modified reaction-diffusion equation which has been previously employed in predicting gene expression levels and studying the relative contributions of transcription and translation events to protein abundance [10,11,24]. Here, we use Matrix Algebra and Analysis techniques to study the stability of the gene expression system and analyze equilibria, using very general assumptions regarding the parameter values incorporated into the model. We prove that, given realistic biological parameter values, the system will result in a unique, stable equilibrium solution. Additionally, we give an example of this long-term behavior using the model alongside actual experimental data obtained from Drosophila embryos.


Assuntos
Drosophila , Regulação da Expressão Gênica no Desenvolvimento , Expressão Gênica , Modelos Biológicos , Animais , Drosophila/embriologia , Drosophila/genética
12.
PLoS One ; 12(10): e0185570, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28982128

RESUMO

Understanding the molecular machinery involved in transcriptional regulation is central to improving our knowledge of an organism's development, disease, and evolution. The building blocks of this complex molecular machinery are an organism's genomic DNA sequence and transcription factor proteins. Despite the vast amount of sequence data now available for many model organisms, predicting where transcription factors bind, often referred to as 'motif detection' is still incredibly challenging. In this study, we develop a novel bioinformatic approach to binding site prediction. We do this by extending pre-existing SVM approaches in an unbiased way to include all possible gapped k-mers, representing different combinations of complex nucleotide dependencies within binding sites. We show the advantages of this new approach when compared to existing SVM approaches, through a rigorous set of cross-validation experiments. We also demonstrate the effectiveness of our new approach by reporting on its improved performance on a set of 127 genomic regions known to regulate gene expression along the anterio-posterior axis in early Drosophila embryos.


Assuntos
Aprendizado de Máquina , Nucleotídeos/metabolismo , Fatores de Transcrição/metabolismo , Sítios de Ligação , Máquina de Vetores de Suporte
13.
Mech Dev ; 141: 51-61, 2016 08.
Artigo em Inglês | MEDLINE | ID: mdl-27264535

RESUMO

In the development of the Drosophila embryo, gene expression is directed by the sequence-specific interactions of a large network of protein transcription factors (TFs) and DNA cis-regulatory binding sites. Once the identity of the typically 8-10bp binding sites for any given TF has been determined by one of several experimental procedures, the sequences can be represented in a position weight matrix (PWM) and used to predict the location of additional TF binding sites elsewhere in the genome. Often, alignments of large (>200bp) genomic fragments that have been experimentally determined to bind the TF of interest in Chromatin Immunoprecipitation (ChIP) studies are trimmed under the assumption that the majority of the binding sites are located near the center of all the aligned fragments. In this study, ChIP/chip datasets are analyzed using the corresponding PWMs for the well-studied TFs; CAUDAL, HUNCHBACK, KNIRPS and KRUPPEL, to determine the distribution of predicted binding sites. All four TFs are critical regulators of gene expression along the anterio-posterior axis in early Drosophila development. For all four TFs, the ChIP peaks contain multiple binding sites that are broadly distributed across the genomic region represented by the peak, regardless of the prediction stringency criteria used. This result suggests that ChIP peak trimming may exclude functional binding sites from subsequent analyses.


Assuntos
Proteínas de Ligação a DNA/genética , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Proteínas de Homeodomínio/genética , Fatores de Transcrição Kruppel-Like/genética , Proteínas Repressoras/genética , Fatores de Transcrição/genética , Animais , Sítios de Ligação , Imunoprecipitação da Cromatina , Biologia Computacional , Drosophila melanogaster/crescimento & desenvolvimento , Regulação da Expressão Gênica no Desenvolvimento , Genoma de Inseto/genética , Análise de Sequência com Séries de Oligonucleotídeos , Ligação Proteica
14.
Gene Regul Syst Bio ; 10: 21-33, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27330274

RESUMO

A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.

15.
Elife ; 52016 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-27152947

RESUMO

Enhancers constitute one of the major components of regulatory machinery of metazoans. Although several genome-wide studies have focused on finding and locating enhancers in the genomes, the fundamental principles governing their internal architecture and cis-regulatory grammar remain elusive. Here, we describe an extensive, quantitative perturbation analysis targeting the dorsal-ventral patterning gene regulatory network (GRN) controlled by Drosophila NF-κB homolog Dorsal. To understand transcription factor interactions on enhancers, we employed an ensemble of mathematical models, testing effects of cooperativity, repression, and factor potency. Models trained on the dataset correctly predict activity of evolutionarily divergent regulatory regions, providing insights into spatial relationships between repressor and activator binding sites. Importantly, the collective predictions of sets of models were effective at novel enhancer identification and characterization. Our study demonstrates how experimental dataset and modeling can be effectively combined to provide quantitative insights into cis-regulatory information on a genome-wide scale.


Assuntos
Padronização Corporal/genética , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Elementos Facilitadores Genéticos , Proteínas Nucleares/genética , Fosfoproteínas/genética , Fatores de Transcrição/genética , Animais , Drosophila melanogaster/crescimento & desenvolvimento , Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes/genética , Genoma de Inseto , Modelos Teóricos
16.
Cell Syst ; 1(6): 379-80, 2015 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-27136351

RESUMO

Genomic information includes not just a "parts list" of encoded proteins and RNAs, but also the information on regulation and function. To understand this more complex, deeper layer of biological information, recent efforts have turned to mathematical models as discovery engines of the cis regulatory code.

17.
PeerJ ; 3: e1022, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26157608

RESUMO

It is well known that gene regulation is a tightly controlled process in early organismal development. However, the roles of key processes involved in this regulation, such as transcription and translation, are less well understood, and mathematical modeling approaches in this field are still in their infancy. In recent studies, biologists have taken precise measurements of protein and mRNA abundance to determine the relative contributions of key factors involved in regulating protein levels in mammalian cells. We now approach this question from a mathematical modeling perspective. In this study, we use a simple dynamic mathematical model that incorporates terms representing transcription, translation, mRNA and protein decay, and diffusion in an early Drosophila embryo. We perform global sensitivity analyses on this model using various different initial conditions and spatial and temporal outputs. Our results indicate that transcription and translation are often the key parameters to determine protein abundance. This observation is in close agreement with the experimental results from mammalian cells for various initial conditions at particular time points, suggesting that a simple dynamic model can capture the qualitative behavior of a gene. Additionally, we find that parameter sensitivites are temporally dynamic, illustrating the importance of conducting a thorough global sensitivity analysis across multiple time points when analyzing mathematical models of gene regulation.

18.
Mech Dev ; 131: 68-77, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24514265

RESUMO

In Drosophila, the 330 kb bithorax complex regulates cellular differentiation along the anterior­posterior axis during development in the thorax and abdomen and is comprised of three homeotic genes: Ultrabithorax, abdominal-A, and Abdominal-B. The expression of each of these genes is in turn controlled through interactions between transcription factors and a number of cis-regulatory modules in the neighboring intergenic regions. In this study, we examine how the sequence architecture of transcription factor binding sites mediates the functional activity of one of these cis-regulatory modules. Using computational, mathematical modeling and experimental molecular genetic approaches we investigate the IAB7b enhancer, which regulates Abdominal-B expression specifically in the presumptive seventh and ninth abdominal segments of the early embryo. A cross-species comparison of the IAB7b enhancer reveals an evolutionarily conserved signature motif containing two FUSHI-TARAZU activator transcription factor binding sites. We find that the transcriptional repressors KNIRPS, KRUPPEL and GIANT are able to restrict reporter gene expression to the posterior abdominal segments, using different molecular mechanisms including short-range repression and competitive binding. Additionally, we show the functional importance of the spacing between the two FUSHI-TARAZU binding sites and discuss the potential importance of cooperativity for transcriptional activation. Our results demonstrate that the transcriptional output of the IAB7b cis-regulatory module relies on a complex set of combinatorial inputs mediated by specific transcription factor binding and that the sequence architecture at this enhancer is critical to maintain robust regulatory function.


Assuntos
Diferenciação Celular/genética , Proteínas de Drosophila/genética , Elementos Facilitadores Genéticos/genética , Proteínas de Homeodomínio/genética , Animais , Animais Geneticamente Modificados , Sítios de Ligação , Drosophila/genética , Drosophila/crescimento & desenvolvimento , Proteínas de Drosophila/metabolismo , Fatores de Transcrição Fushi Tarazu/genética , Regulação da Expressão Gênica no Desenvolvimento , Genes Homeobox , Proteínas de Homeodomínio/metabolismo , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
19.
SIAM J Appl Math ; 73(2): 804-826, 2013 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-25328249

RESUMO

High-throughput genome sequencing and transcriptome analysis have provided researchers with a quantitative basis for detailed modeling of gene expression using a wide variety of mathematical models. Two of the most commonly employed approaches used to model eukaryotic gene regulation are systems of differential equations, which describe time-dependent interactions of gene networks, and thermodynamic equilibrium approaches that can explore DNA-level transcriptional regulation. To combine the strengths of these approaches, we have constructed a new two-layer mathematical model that provides a dynamical description of gene regulatory systems, using detailed DNA-based information, as well as spatial and temporal transcription factor concentration data. We also developed a semi-implicit numerical algorithm for solving the model equations and demonstrate here the efficiency of this algorithm through stability and convergence analyses. To test the model, we used it together with the semi-implicit algorithm to simulate a Drosophila gene regulatory circuit that drives development in the dorsal-ventral axis of the blastoderm-stage embryo, involving three genes. For model validation, we have done both mathematical and statistical comparisons between the experimental data and the model's simulated data. Where protein and cis-regulatory information is available, our two-layer model provides a method for recapitulating and predicting dynamic aspects of eukaryotic transcriptional systems that will greatly improve our understanding of gene regulation at a global level.

20.
BMC Syst Biol ; 4: 142, 2010 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-20969803

RESUMO

BACKGROUND: Quantitative models of gene expression generate parameter values that can shed light on biological features such as transcription factor activity, cooperativity, and local effects of repressors. An important element in such investigations is sensitivity analysis, which determines how strongly a model's output reacts to variations in parameter values. Parameters of low sensitivity may not be accurately estimated, leading to unwarranted conclusions. Low sensitivity may reflect the nature of the biological data, or it may be a result of the model structure. Here, we focus on the analysis of thermodynamic models, which have been used extensively to analyze gene transcription. Extracted parameter values have been interpreted biologically, but until now little attention has been given to parameter sensitivity in this context. RESULTS: We apply local and global sensitivity analyses to two recent transcriptional models to determine the sensitivity of individual parameters. We show that in one case, values for repressor efficiencies are very sensitive, while values for protein cooperativities are not, and provide insights on why these differential sensitivities stem from both biological effects and the structure of the applied models. In a second case, we demonstrate that parameters that were thought to prove the system's dependence on activator-activator cooperativity are relatively insensitive. We show that there are numerous parameter sets that do not satisfy the relationships proferred as the optimal solutions, indicating that structural differences between the two types of transcriptional enhancers analyzed may not be as simple as altered activator cooperativity. CONCLUSIONS: Our results emphasize the need for sensitivity analysis to examine model construction and forms of biological data used for modeling transcriptional processes, in order to determine the significance of estimated parameter values for thermodynamic models. Knowledge of parameter sensitivities can provide the necessary context to determine how modeling results should be interpreted in biological systems.


Assuntos
Modelos Genéticos , Transcrição Gênica , Algoritmos , Termodinâmica , Ativação Transcricional
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa