Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
PLoS Comput Biol ; 15(11): e1007337, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31682597

RESUMO

Gene expression governs cell fate, and is regulated via a complex interplay of transcription factors and molecules that change chromatin structure. Advances in sequencing-based assays have enabled investigation of these processes genome-wide, leading to large datasets that combine information on the dynamics of gene expression, transcription factor binding and chromatin structure as cells differentiate. While numerous studies focus on the effects of these features on broader gene regulation, less work has been done on the mechanisms of gene-specific transcriptional control. In this study, we have focussed on the latter by integrating gene expression data for the in vitro differentiation of murine ES cells to macrophages and cardiomyocytes, with dynamic data on chromatin structure, epigenetics and transcription factor binding. Combining a novel strategy to identify communities of related control elements with a penalized regression approach, we developed individual models to identify the potential control elements predictive of the expression of each gene. Our models were compared to an existing method and evaluated using the existing literature and new experimental data from embryonic stem cell differentiation reporter assays. Our method is able to identify transcriptional control elements in a gene specific manner that reflect known regulatory relationships and to generate useful hypotheses for further testing.


Assuntos
Diferenciação Celular/genética , Ensaios de Triagem em Larga Escala/métodos , Elementos Reguladores de Transcrição/genética , Animais , Diferenciação Celular/fisiologia , Cromatina/metabolismo , Bases de Dados Genéticas , Epigênese Genética , Epigenômica , Regulação da Expressão Gênica/genética , Genoma , Macrófagos/metabolismo , Camundongos , Células-Tronco Embrionárias Murinas/metabolismo , Miócitos Cardíacos/metabolismo , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo
2.
Dev Cell ; 36(5): 572-87, 2016 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-26923725

RESUMO

Metazoan development involves the successive activation and silencing of specific gene expression programs and is driven by tissue-specific transcription factors programming the chromatin landscape. To understand how this process executes an entire developmental pathway, we generated global gene expression, chromatin accessibility, histone modification, and transcription factor binding data from purified embryonic stem cell-derived cells representing six sequential stages of hematopoietic specification and differentiation. Our data reveal the nature of regulatory elements driving differential gene expression and inform how transcription factor binding impacts on promoter activity. We present a dynamic core regulatory network model for hematopoietic specification and demonstrate its utility for the design of reprogramming experiments. Functional studies motivated by our genome-wide data uncovered a stage-specific role for TEAD/YAP factors in mammalian hematopoietic specification. Our study presents a powerful resource for studying hematopoiesis and demonstrates how such data advance our understanding of mammalian development.


Assuntos
Diferenciação Celular/genética , Células-Tronco Embrionárias/citologia , Regulação da Expressão Gênica no Desenvolvimento/genética , Redes Reguladoras de Genes/genética , Hematopoese/fisiologia , Células-Tronco Hematopoéticas/citologia , Animais , Linhagem da Célula/fisiologia , Proteínas de Homeodomínio/metabolismo , Camundongos , Ligação Proteica/genética , Fatores de Transcrição/metabolismo
3.
Blood ; 127(13): e12-23, 2016 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-26809507

RESUMO

Comprehensive study of transcriptional control processes will be required to enhance our understanding of both normal and malignant hematopoiesis. Modern sequencing technologies have revolutionized our ability to generate genome-scale expression and histone modification profiles, transcription factor (TF)-binding maps, and also comprehensive chromatin-looping information. Many of these technologies, however, require large numbers of cells, and therefore cannot be applied to rare hematopoietic stem/progenitor cell (HSPC) populations. The stem cell factor-dependent multipotent progenitor cell line HPC-7 represents a well-recognized cell line model for HSPCs. Here we report genome-wide maps for 17 TFs, 3 histone modifications, DNase I hypersensitive sites, and high-resolution promoter-enhancer interactomes in HPC-7 cells. Integrated analysis of these complementary data sets revealed TF occupancy patterns of genomic regions involved in promoter-anchored loops. Moreover, preferential associations between pairs of TFs bound at either ends of chromatin loops led to the identification of 4 previously unrecognized protein-protein interactions between key blood stem cell regulators. All HPC-7 data sets are freely available both through standard repositories and a user-friendly Web interface. Together with previously generated genome-wide data sets, this study integrates HPC-7 data into a genomic resource on par with ENCODE tier 1 cell lines and, importantly, is the only current model with comprehensive genome-scale data that is relevant to HSPC biology.


Assuntos
Regulação da Expressão Gênica , Hematopoese/genética , Células-Tronco Hematopoéticas/metabolismo , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação/genética , Células Cultivadas , Imunoprecipitação da Cromatina , Embrião de Mamíferos , Genoma , Células HEK293 , Humanos , Camundongos , Camundongos Transgênicos , Regiões Promotoras Genéticas , Ligação Proteica/genética , Fatores de Transcrição/genética
4.
Open Biol ; 5(11)2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26538567

RESUMO

Morphological evolution is driven both by coding sequence variation and by changes in regulatory sequences. However, how cis-regulatory modules (CRMs) evolve to generate entirely novel expression domains is largely unknown. Here, we reconstruct the evolutionary history of a lens enhancer located within a CRM that not only predates the lens, a vertebrate innovation, but bilaterian animals in general. Alignments of orthologous sequences from different deuterostomes sub-divide the CRM into a deeply conserved core and a more divergent flanking region. We demonstrate that all deuterostome flanking regions, including invertebrate sequences, activate gene expression in the zebrafish lens through the same ancient cluster of activator sites. However, levels of gene expression vary between species due to the presence of repressor motifs in flanking region and core. These repressor motifs are responsible for the relatively weak enhancer activity of tetrapod flanking regions. Ray-finned fish, however, have gained two additional lineage-specific activator motifs which in combination with the ancient cluster of activators and the core constitute a potent lens enhancer. The exploitation and modification of existing regulatory potential in flanking regions but not in the highly conserved core might represent a more general model for the emergence of novel regulatory functions in complex CRMs.


Assuntos
Sequência Conservada , Evolução Molecular , Sequências Reguladoras de Ácido Nucleico , Animais , Sequência de Bases , Galinhas , Humanos , Dados de Sequência Molecular , Motivos de Nucleotídeos , Fatores de Transcrição SOXB2/genética , Fatores de Transcrição SOXB2/metabolismo , Takifugu , Xenopus , Peixe-Zebra
5.
PLoS Genet ; 9(11): e1003904, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24282393

RESUMO

Urochordates are the closest relatives of vertebrates and at the larval stage, possess a characteristic bilateral chordate body plan. In vertebrates, the genes that orchestrate embryonic patterning are in part regulated by highly conserved non-coding elements (CNEs), yet these elements have not been identified in urochordate genomes. Consequently the evolution of the cis-regulatory code for urochordate development remains largely uncharacterised. Here, we use genome-wide comparisons between C. intestinalis and C. savignyi to identify putative urochordate cis-regulatory sequences. Ciona conserved non-coding elements (ciCNEs) are associated with largely the same key regulatory genes as vertebrate CNEs. Furthermore, some of the tested ciCNEs are able to activate reporter gene expression in both zebrafish and Ciona embryos, in a pattern that at least partially overlaps that of the gene they associate with, despite the absence of sequence identity. We also show that the ability of a ciCNE to up-regulate gene expression in vertebrate embryos can in some cases be localised to short sub-sequences, suggesting that functional cross-talk may be defined by small regions of ancestral regulatory logic, although functional sub-sequences may also be dispersed across the whole element. We conclude that the structure and organisation of cis-regulatory modules is very different between vertebrates and urochordates, reflecting their separate evolutionary histories. However, functional cross-talk still exists because the same repertoire of transcription factors has likely guided their parallel evolution, exploiting similar sets of binding sites but in different combinations.


Assuntos
Ciona intestinalis/genética , Evolução Molecular , Redes Reguladoras de Genes , Peixe-Zebra/genética , Animais , Sequência Conservada , Regulação da Expressão Gênica , Urocordados , Vertebrados/genética
6.
Biol Open ; 2(11): 1229-38, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24244860

RESUMO

Comprehensive analysis of cis-regulatory elements is key to understanding the dynamic gene regulatory networks that control embryonic development. While transgenic animals represent the gold standard assay, their generation is costly, entails significant animal usage, and in utero development complicates time-course studies. As an alternative, embryonic stem (ES) cells can readily be differentiated in a process that correlates well with developing embryos. Here, we describe a highly effective platform for enhancer assays using an Hsp68/Venus reporter cassette that targets to the Hprt locus in mouse ES cells. This platform combines the flexibility of Gateway® cloning, live cell trackability of a fluorescent reporter, low background and the advantages of single copy insertion into a defined genomic locus. We demonstrate the successful recapitulation of tissue-specific enhancer activity for two cardiac and two haematopoietic enhancers. In addition, we used this assay to dissect the functionality of the highly conserved Ets/Ets/Gata motif in the Scl+19 enhancer, which revealed that the Gata motif is not required for initiation of enhancer activity. We further confirmed that Gata2 is not required for endothelial activity of the Scl+19 enhancer using Gata2(-/-) Scl+19 transgenic embryos. We have therefore established a valuable toolbox to study gene regulatory networks with broad applicability.

7.
Brief Funct Genomics ; 12(2): 142-60, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23117864

RESUMO

A key finding from early genomics research is the remarkable consistency in the number of protein-coding regions across diverse species. This has led many researchers to look to the cis-regulatory elements of genes as the fundamental influence behind evolving gene function and subsequent species diversification. Historically, since these elements are often located in vast intergenic and intronic regions of the genome, their identification has been recalcitrant. Now, with the deluge of whole-genome data from representatives of numerous eukaryotic lineages, various approaches have enabled us to begin to recognize features that characterize regulatory regions of the genome. Here we endeavour to collate these approaches in order to give an overview of the complexities involved in extrapolating regulatory signatures. The resource provided by the escalating richness of whole-genome datasets enables more sophisticated modelling of these regulatory signatures yet at the same time introduces increasing potential for noise. While we are only at the advent of making these discoveries, the next decade promises to be a very exciting and rewarding time for genome researchers.


Assuntos
Eucariotos/genética , Regulação da Expressão Gênica , Genoma/genética , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Sequência de Bases , Humanos , Fatores de Transcrição/metabolismo
8.
Development ; 138(5): 879-84, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21247963

RESUMO

Within the vertebrate lineage, a high proportion of duplicate genes have been retained after whole genome duplication (WGD) events. It has been proposed that many of these duplicate genes became indispensable because the ancestral gene function was divided between them. In addition, novel functions may have evolved, owing to changes in cis-regulatory elements. Functional analysis of the PAX2/5/8 gene subfamily appears to support at least the first part of this hypothesis. The collective role of these genes has been widely retained, but sub-functions have been differentially partitioned between the genes in different vertebrates. Conserved non-coding elements (CNEs) represent an interesting and readily identifiable class of putative cis-regulatory elements that have been conserved from fish to mammals, an evolutionary distance of 450 million years. Within the PAX2/5/8 gene subfamily, PAX2 is associated with the highest number of CNEs. An additional WGD experienced in the teleost lineage led to two copies of pax2, each of which retained a large proportion of these CNEs. Using a reporter gene assay in zebrafish embryos, we have exploited this rich collection of regulatory elements in order to determine whether duplicate CNEs have evolved different functions. Remarkably, we find that even highly conserved sequences exhibit more functional differences than similarities. We also discover that short flanking sequences can have a profound impact on CNE function. Therefore, if CNEs are to be used as candidate enhancers for transgenic studies or for multi-species comparative analyses, it is paramount that the CNEs are accurately delineated.


Assuntos
Sequência Conservada , Elementos Facilitadores Genéticos/fisiologia , Genes Duplicados , Genoma/genética , Animais , Biologia Computacional , Embrião não Mamífero , Genes Reporter , Fator de Transcrição PAX2/genética , Fator de Transcrição PAX2/fisiologia , Fator de Transcrição PAX5 , Fator de Transcrição PAX8 , Fatores de Transcrição Box Pareados , Pesquisa/normas , Peixe-Zebra , Proteínas de Peixe-Zebra
9.
PLoS Genet ; 5(12): e1000762, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-20011110

RESUMO

Comparisons between diverse vertebrate genomes have uncovered thousands of highly conserved non-coding sequences, an increasing number of which have been shown to function as enhancers during early development. Despite their extreme conservation over 500 million years from humans to cartilaginous fish, these elements appear to be largely absent in invertebrates, and, to date, there has been little understanding of their mode of action or the evolutionary processes that have modelled them. We have now exploited emerging genomic sequence data for the sea lamprey, Petromyzon marinus, to explore the depth of conservation of this type of element in the earliest diverging extant vertebrate lineage, the jawless fish (agnathans). We searched for conserved non-coding elements (CNEs) at 13 human gene loci and identified lamprey elements associated with all but two of these gene regions. Although markedly shorter and less well conserved than within jawed vertebrates, identified lamprey CNEs are able to drive specific patterns of expression in zebrafish embryos, which are almost identical to those driven by the equivalent human elements. These CNEs are therefore a unique and defining characteristic of all vertebrates. Furthermore, alignment of lamprey and other vertebrate CNEs should permit the identification of persistent sequence signatures that are responsible for common patterns of expression and contribute to the elucidation of the regulatory language in CNEs. Identifying the core regulatory code for development, common to all vertebrates, provides a foundation upon which regulatory networks can be constructed and might also illuminate how large conserved regulatory sequence blocks evolve and become fixed in genomic DNA.


Assuntos
Evolução Biológica , Sequências Reguladoras de Ácido Nucleico , Vertebrados/genética , Animais , Humanos , Lampreias/genética
10.
Dev Dyn ; 238(12): 2951-74, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19924770

RESUMO

Whole genome duplication events are thought to have substantially contributed to organismal complexity, largely via divergent transcriptional regulation. Members of the vertebrate PAX2, PAX5 and PAX8 gene subfamily derived from an ancient class of paired box genes and arose from such whole genome duplication events. These genes are critical in establishing the midbrain-hindbrain boundary, specifying interneuron populations and for eye, ear and kidney development. Also PAX2 has adopted a unique role in pancreas development, whilst PAX5 is essential for early B-cell differentiation. The contribution of PAX258 genes to their collective role has diverged across paralogues and the animal lineages, resulting in a complex wealth of literature. It is now timely to provide a comprehensive comparative overview of these genes and their ancient and divergent roles. We also discuss their fundamental place within gene regulatory networks and the likely influence of cis-regulatory elements over their differential roles during early animal development.


Assuntos
Fator de Transcrição PAX2/fisiologia , Fator de Transcrição PAX5/fisiologia , Fatores de Transcrição Box Pareados/fisiologia , Animais , Olho/embriologia , Olho/crescimento & desenvolvimento , Olho/metabolismo , Humanos , Modelos Biológicos , Família Multigênica/genética , Família Multigênica/fisiologia , Fator de Transcrição PAX2/genética , Fator de Transcrição PAX2/metabolismo , Fator de Transcrição PAX5/genética , Fator de Transcrição PAX5/metabolismo , Fator de Transcrição PAX8 , Fatores de Transcrição Box Pareados/genética , Fatores de Transcrição Box Pareados/metabolismo , Filogenia
11.
Evol Bioinform Online ; 5: 5-13, 2009 May 18.
Artigo em Inglês | MEDLINE | ID: mdl-19812723

RESUMO

BACKGROUND: In vertebrates the "SONIC HEDGEHOG" signalling pathway has been implicated in cell-fate determination, proliferation and the patterning of many different cell types and organs. As the GLI family members (GLI1, GLI2 and GLI3) are key mediators of hedgehog morphogenetic signals, over the past couple of decades they have been extensively scrutinized by genetic, molecular and biochemical means. Thus, a great deal of information is currently available about the functional aspects of GLI proteins in various vertebrate species. To address the roles of GLI genes in diversifying the repertoire of the Hh signalling and deploying them for the vertebrate specifications, in this study we have examined the evolutionary patterns of vertebrate GLI sequences within and between species. RESULTS: Phylogenetic tree analysis suggests that the vertebrate GLI1, GLI2 and GLI3 genes diverged after the separation of urochordates from vertebrates and before the tetrapods-bony fishes split. Lineage specific duplication events were also detected. Estimation of mode and strength of selection acting on GLI orthologs demonstrated that all members of the GLI gene family experienced more relaxed selection in teleost fish than in the mammalian lineage. Furthermore, the GLI1 gene appeared to have been exposed to different functional constraints in fish and tetrapod lineages, whilst a similar level of functional constraints on GLI2 and GLI3 was suggested by comparable average non-synonymous (Ka) substitutions across the lineages. A relative rate test suggested that the majority of the paralogous copies of the GLI family analyzed evolved with similar evolutionary rates except GLI1 which evolved at a significantly faster rate than its paralogous counterparts in tetrapods. CONCLUSIONS: Our analysis shows that sequence evolutionary patterns of GLI family members are largely correlated with the reported similarities and differences in the functionality of GLI proteins within and between the various vertebrate species. We propose that duplication and divergence of GLI genes has increased in the complexity of vertebrate body plan by recruiting the hedgehog signalling for the novel developmental tasks.

12.
BMC Dev Biol ; 7: 100, 2007 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-17760977

RESUMO

BACKGROUND: Comparative genomics is currently one of the most popular approaches to study the regulatory architecture of vertebrate genomes. Fish-mammal genomic comparisons have proved powerful in identifying conserved non-coding elements likely to be distal cis-regulatory modules such as enhancers, silencers or insulators that control the expression of genes involved in the regulation of early development. The scientific community is showing increasing interest in characterizing the function, evolution and language of these sequences. Despite this, there remains little in the way of user-friendly access to a large dataset of such elements in conjunction with the analysis and the visualization tools needed to study them. DESCRIPTION: Here we present CONDOR (COnserved Non-coDing Orthologous Regions) available at: http://condor.fugu.biology.qmul.ac.uk. In an interactive and intuitive way the website displays data on > 6800 non-coding elements associated with over 120 early developmental genes and conserved across vertebrates. The database regularly incorporates results of ongoing in vivo zebrafish enhancer assays of the CNEs carried out in-house, which currently number approximately 100. Included and highlighted within this set are elements derived from duplication events both at the origin of vertebrates and more recently in the teleost lineage, thus providing valuable data for studying the divergence of regulatory roles between paralogs. CONDOR therefore provides a number of tools and facilities to allow scientists to progress in their own studies on the function and evolution of developmental cis-regulation. CONCLUSION: By providing access to data with an approachable graphics interface, the CONDOR database presents a rich resource for further studies into the regulation and evolution of genes involved in early development.


Assuntos
Sequência Conservada , Bases de Dados de Ácidos Nucleicos , Regulação da Expressão Gênica no Desenvolvimento , Genômica , Vertebrados/genética , Animais , Sequência de Bases , Biologia Computacional , Evolução Molecular , Filogenia
13.
Dev Growth Differ ; 49(6): 543-53, 2007 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-17661744

RESUMO

The zinc-finger transcription factor GLI3 acts during vertebrate development in a combinatorial, context-dependent fashion as a primary transducer of sonic hedgehog (SHH) signaling. In humans, mutations affecting this key regulator of development are associated with GLI3-morphopathies, a group of congenital malformations in which forebrain and limb development are preferentially affected. We show that a non-coding element from intron two of GLI3, ultraconserved in mammals and highly conserved in the pufferfish Fugu, is a transcriptional enhancer. In transient transfection assays, it activates reporter gene transcription in human cell cultures expressing endogenous GLI3 but not in GLI3 negative cells. The identified enhancer element is predicted to contain conserved binding sites for transcription factors crucial for developmental steps in which GLI3 is involved. The regulatory potential of this element is conserved and was used to direct tissue-specific expression of a green fluorescent protein reporter gene in zebrafish embryos and of a beta-galactosidase reporter in transgenic mouse embryos. Time, location, and quantity of reporter gene expression are congruent with part of the pattern previously reported for endogenous GLI3 transcription.


Assuntos
Sequência Conservada , Elementos Facilitadores Genéticos , Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Fatores de Transcrição Kruppel-Like/genética , Proteínas do Tecido Nervoso/genética , Animais , Sequência de Bases , Linhagem Celular Tumoral , Feminino , Humanos , Fatores de Transcrição Kruppel-Like/biossíntese , Masculino , Camundongos , Camundongos Endogâmicos CBA , Camundongos Transgênicos , Dados de Sequência Molecular , Proteínas do Tecido Nervoso/biossíntese , Ratos , Takifugu/genética , Peixe-Zebra/genética , Proteína Gli3 com Dedos de Zinco
14.
PLoS One ; 2(4): e366, 2007 Apr 11.
Artigo em Inglês | MEDLINE | ID: mdl-17426814

RESUMO

The zinc-finger transcription factor GLI3 is a key regulator of development, acting as a primary transducer of Sonic hedgehog (SHH) signaling in a combinatorial context dependent fashion controlling multiple patterning steps in different tissues/organs. A tight temporal and spatial control of gene expression is indispensable, however, cis-acting sequence elements regulating GLI3 expression have not yet been reported. We show that 11 ancient genomic DNA signatures, conserved from the pufferfish Takifugu (Fugu) rubripes to man, are distributed throughout the introns of human GLI3. They map within larger conserved non-coding elements (CNEs) that are found in the tetrapod lineage. Full length CNEs transiently transfected into human cell cultures acted as cell type specific enhancers of gene transcription. The regulatory potential of these elements is conserved and was exploited to direct tissue specific expression of a reporter gene in zebrafish embryos. Assays of deletion constructs revealed that the human-Fugu conserved sequences within the GLI3 intronic CNEs were essential but not sufficient for full-scale transcriptional activation. The enhancer activity of the CNEs is determined by a combinatorial effect of a core sequence conserved between human and teleosts (Fugu) and flanking tetrapod-specific sequences, suggesting that successive clustering of sequences with regulatory potential around an ancient, highly conserved nucleus might be a possible mechanism for the evolution of cis-acting regulatory elements.


Assuntos
Sequência Conservada , Elementos Facilitadores Genéticos , Fatores de Transcrição Kruppel-Like/genética , Proteínas do Tecido Nervoso/genética , Animais , Sequência de Bases , Sítios de Ligação , Linhagem Celular Tumoral , DNA , Deleção de Genes , Regulação da Expressão Gênica no Desenvolvimento , Humanos , Fatores de Transcrição Kruppel-Like/metabolismo , Dados de Sequência Molecular , Proteínas do Tecido Nervoso/metabolismo , Filogenia , Homologia de Sequência do Ácido Nucleico , Fatores de Transcrição/metabolismo , Peixe-Zebra , Proteína Gli3 com Dedos de Zinco
15.
Artigo em Inglês | MEDLINE | ID: mdl-20483234

RESUMO

We recently identified approximately 1400 conserved non-coding elements (CNEs) shared by the genomes of fugu (Takifugu rubripes) and human that appear to be associated with developmental regulation in vertebrates [Woolfe, A., Goodson, M., Goode, D.K., Snell, P., McEwen, G.K., Vavouri, T., Smith, S.F., North, P., Callaway, H., Kelly, K., Walter, K., Abnizova, I., Gilks, W., Edwards, Y.J.K., Cooke, J.E., Elgar, G., 2005. Highly conserved non-coding sequences are associated with vertebrate development. PLoS Biol. 3 (1), e7]. This study encompassed a multi-disciplinary approach using bioinformatics, statistical methods and functional assays to identify and characterise the CNEs. Using an in vivo enhancer assay, over 90% of tested CNEs up-regulate tissue-specific GFP expression. Here we review our group's research in the field of characterising non-coding sequences conserved in vertebrates. We take this opportunity to discuss our research in progress and present some results of new and additional analyses. These include a phylogenomics analysis of CNEs, sequence conservation patterns in vertebrate CNEs and the distribution of human SNPs in the CNEs. We highlight the usefulness of the CNE dataset to help correlate genetic variation in health and disease. We also discuss the functional analysis using the enhancer assay and the enrichment of predicted transcription factor binding sites for two CNEs. Public access to the CNEs plus annotation is now possible and is described. The content of this review was presented by Dr. Y.J.K. Edwards at the TODAI International Symposium on Functional Genomics of the Pufferfish, Tokyo, Japan, 3-6 November 2004.

16.
Genomics ; 86(2): 172-81, 2005 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-15939571

RESUMO

Comparative genomic analysis reveals an exceptionally large section of conserved shared synteny between the human 7q36 chromosomal region and the pufferfish (Fugu rubripes) genome. Remarkably, this conservation extends not only to gene order across 16 genes, but also to the position and orientation of a number of prominent conserved noncoding elements (CNEs). A functional assay using zebrafish has shown that most of the CNEs have reproducible and specific enhancer activity. This enhancer activity is often detected in a subset of tissues which reflect the endogenous expression pattern of a proximal gene, though some CNEs may act over a long range. We propose that the distribution of CNEs, and their probable association with a number of genes throughout the region, imposes a critical constraint on genome architecture, resulting in the maintenance of such a large section of conserved synteny across the vertebrate lineage.


Assuntos
Cromossomos Humanos Par 7 , Genoma Humano , Sequências Reguladoras de Ácido Nucleico , Sintenia , Transativadores/genética , Animais , Mapeamento Cromossômico , Sequência Conservada , Primers do DNA/química , Elementos Facilitadores Genéticos , Proteínas de Fluorescência Verde/metabolismo , Proteínas Hedgehog , Humanos , Modelos Genéticos , Dados de Sequência Molecular , Homologia de Sequência do Ácido Nucleico , Distribuição Tecidual , Peixe-Zebra
17.
PLoS Biol ; 3(1): e7, 2005 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-15630479

RESUMO

In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH), in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development, including many transcription factors. These highly conserved non-coding sequences are likely to form part of the genomic circuitry that uniquely defines vertebrate development.


Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Genoma Humano , Sequências Reguladoras de Ácido Nucleico , Takifugu/genética , Animais , Sequência Conservada , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Proteínas do Olho/metabolismo , Genoma , Proteínas de Fluorescência Verde/metabolismo , Proteínas Hedgehog , Proteínas de Grupo de Alta Mobilidade/metabolismo , Proteínas de Homeodomínio/metabolismo , Humanos , Dados de Sequência Molecular , Família Multigênica , Proteínas de Neoplasias/metabolismo , Fator de Transcrição PAX6 , Fatores de Transcrição Box Pareados/metabolismo , Proteínas Repressoras/metabolismo , Fatores de Transcrição SOXB2 , Análise de Sequência de DNA , Especificidade da Espécie , Transativadores/metabolismo , Fatores de Transcrição/metabolismo
18.
Mamm Genome ; 14(3): 192-201, 2003 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-12647242

RESUMO

The puffer fish Takifugu rubripes (Fugu), with its compact genome, is an ideal model organism for comparative genomics. Sonic hedgehog (Shh) is a key protein in the patterning of differentiating cells during embryonic development. We have sequenced the Fugu Shh gene and compared it with the mammalian and zebrafish orthologs, identifying a number of novel conserved, non-coding sequences upstream of exon one and within the two introns. Additional conserved sequences serve to delineate activator regions and enhancers previously characterized through functional analysis. Control elements can thus be rapidly and effectively predicted by comparative methodology in its own right as well as complementing other, functional methods. This work demonstrates the value of using Fugu in comparative genomics, which has allowed identification of new putative regulatory elements, as well as corroborating enhancers identified by the more traditional deletion mapping method.


Assuntos
DNA Intergênico , Takifugu/genética , Transativadores/genética , Animais , Sequência de Bases , Mapeamento Cromossômico , Sequência Conservada , Proteínas Hedgehog , Humanos , Camundongos , Dados de Sequência Molecular , Regiões Promotoras Genéticas , Alinhamento de Sequência , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA