Pesquisa | Portal Regional da BVS

1.

An encyclopedia of enhancer-gene regulatory interactions in the human genome.

Gschwind, Andreas R; Mualim, Kristy S; Karbalayghareh, Alireza; Sheth, Maya U; Dey, Kushal K; Jagoda, Evelyn; Nurtdinov, Ramil N; Xi, Wang; Tan, Anthony S; Jones, Hank; Ma, X Rosa; Yao, David; Nasser, Joseph; Avsec, Ziga; James, Benjamin T; Shamim, Muhammad S; Durand, Neva C; Rao, Suhas S P; Mahajan, Ragini; Doughty, Benjamin R; Andreeva, Kalina; Ulirsch, Jacob C; Fan, Kaili; Perez, Elizabeth M; Nguyen, Tri C; Kelley, David R; Finucane, Hilary K; Moore, Jill E; Weng, Zhiping; Kellis, Manolis; Bassik, Michael C; Price, Alkes L; Beer, Michael A; Guigó, Roderic; Stamatoyannopoulos, John A; Lieberman Aiden, Erez; Greenleaf, William J; Leslie, Christina S; Steinmetz, Lars M; Kundaje, Anshul; Engreitz, Jesse M.

bioRxiv ; 2023 Nov 13.

Artigo em Inglês | MEDLINE | ID: mdl-38014075

RESUMO

Identifying transcriptional enhancers and their target genes is essential for understanding gene regulation and the impact of human genetic variation on disease1-6. Here we create and evaluate a resource of >13 million enhancer-gene regulatory interactions across 352 cell types and tissues, by integrating predictive models, measurements of chromatin state and 3D contacts, and largescale genetic perturbations generated by the ENCODE Consortium7. We first create a systematic benchmarking pipeline to compare predictive models, assembling a dataset of 10,411 elementgene pairs measured in CRISPR perturbation experiments, >30,000 fine-mapped eQTLs, and 569 fine-mapped GWAS variants linked to a likely causal gene. Using this framework, we develop a new predictive model, ENCODE-rE2G, that achieves state-of-the-art performance across multiple prediction tasks, demonstrating a strategy involving iterative perturbations and supervised machine learning to build increasingly accurate predictive models of enhancer regulation. Using the ENCODE-rE2G model, we build an encyclopedia of enhancer-gene regulatory interactions in the human genome, which reveals global properties of enhancer networks, identifies differences in the functions of genes that have more or less complex regulatory landscapes, and improves analyses to link noncoding variants to target genes and cell types for common, complex diseases. By interpreting the model, we find evidence that, beyond enhancer activity and 3D enhancer-promoter contacts, additional features guide enhancerpromoter communication including promoter class and enhancer-enhancer synergy. Altogether, these genome-wide maps of enhancer-gene regulatory interactions, benchmarking software, predictive models, and insights about enhancer function provide a valuable resource for future studies of gene regulation and human genetics.

2.

Chromosome-length genome assembly and linkage map of a critically endangered Australian bird: the helmeted honeyeater.

Robledo-Ruiz, Diana A; Gan, Han Ming; Kaur, Parwinder; Dudchenko, Olga; Weisz, David; Khan, Ruqayya; Lieberman Aiden, Erez; Osipova, Ekaterina; Hiller, Michael; Morales, Hernán E; Magrath, Michael J L; Clarke, Rohan H; Sunnucks, Paul; Pavlova, Alexandra.

Gigascience ; 112022 03 29.

Artigo em Inglês | MEDLINE | ID: mdl-35348671

RESUMO

BACKGROUND: The helmeted honeyeater (Lichenostomus melanops cassidix) is a Critically Endangered bird endemic to Victoria, Australia. To aid its conservation, the population is the subject of genetic rescue. To understand, monitor, and modulate the effects of genetic rescue on the helmeted honeyeater genome, a chromosome-length genome and a high-density linkage map are required. RESULTS: We used a combination of Illumina, Oxford Nanopore, and Hi-C sequencing technologies to assemble a chromosome-length genome of the helmeted honeyeater, comprising 906 scaffolds, with length of 1.1 Gb and scaffold N50 of 63.8 Mb. Annotation comprised 57,181 gene models. Using a pedigree of 257 birds and 53,111 single-nucleotide polymorphisms, we obtained high-density linkage and recombination maps for 25 autosomes and Z chromosome. The total sex-averaged linkage map was 1,347 cM long, with the male map being 6.7% longer than the female map. Recombination maps revealed sexually dimorphic recombination rates (overall higher in males), with average recombination rate of 1.8 cM/Mb. Comparative analyses revealed high synteny of the helmeted honeyeater genome with that of 3 passerine species (e.g., 32 Hi-C scaffolds mapped to 30 zebra finch autosomes and Z chromosome). The genome assembly and linkage map suggest that the helmeted honeyeater exhibits a fission of chromosome 1A into 2 chromosomes relative to zebra finch. PSMC analysis showed a â¼15-fold decline in effective population size to â¼60,000 from mid- to late Pleistocene. CONCLUSIONS: The annotated chromosome-length genome and high-density linkage map provide rich resources for evolutionary studies and will be fundamental in guiding conservation efforts for the helmeted honeyeater.

Assuntos

Passeriformes , Animais , Austrália , Mapeamento Cromossômico , Feminino , Ligação Genética , Masculino , Passeriformes/genética , Cromossomos Sexuais

3.

The Earth BioGenome Project 2020: Starting the clock.

Lewin, Harris A; Richards, Stephen; Lieberman Aiden, Erez; Allende, Miguel L; Archibald, John M; Bálint, Miklós; Barker, Katharine B; Baumgartner, Bridget; Belov, Katherine; Bertorelle, Giorgio; Blaxter, Mark L; Cai, Jing; Caperello, Nicolette D; Carlson, Keith; Castilla-Rubio, Juan Carlos; Chaw, Shu-Miaw; Chen, Lei; Childers, Anna K; Coddington, Jonathan A; Conde, Dalia A; Corominas, Montserrat; Crandall, Keith A; Crawford, Andrew J; DiPalma, Federica; Durbin, Richard; Ebenezer, ThankGod E; Edwards, Scott V; Fedrigo, Olivier; Flicek, Paul; Formenti, Giulio; Gibbs, Richard A; Gilbert, M Thomas P; Goldstein, Melissa M; Graves, Jennifer Marshall; Greely, Henry T; Grigoriev, Igor V; Hackett, Kevin J; Hall, Neil; Haussler, David; Helgen, Kristofer M; Hogg, Carolyn J; Isobe, Sachiko; Jakobsen, Kjetill Sigurd; Janke, Axel; Jarvis, Erich D; Johnson, Warren E; Jones, Steven J M; Karlsson, Elinor K; Kersey, Paul J; Kim, Jin-Hyoung.

Proc Natl Acad Sci U S A ; 119(4)2022 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-35042800

Assuntos

Sequência de Bases/genética , Eucariotos/genética , Animais , Biodiversidade , Genômica , Humanos

4.

RedChIP identifies noncoding RNAs associated with genomic sites occupied by Polycomb and CTCF proteins.

Gavrilov, Alexey A; Sultanov, Rinat I; Magnitov, Mikhail D; Galitsyna, Aleksandra A; Dashinimaev, Erdem B; Lieberman Aiden, Erez; Razin, Sergey V.

Proc Natl Acad Sci U S A ; 119(1)2022 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-34969862

RESUMO

Nuclear noncoding RNAs (ncRNAs) are key regulators of gene expression and chromatin organization. The progress in studying nuclear ncRNAs depends on the ability to identify the genome-wide spectrum of contacts of ncRNAs with chromatin. To address this question, a panel of RNA-DNA proximity ligation techniques has been developed. However, neither of these techniques examines proteins involved in RNA-chromatin interactions. Here, we introduce RedChIP, a technique combining RNA-DNA proximity ligation and chromatin immunoprecipitation for identifying RNA-chromatin interactions mediated by a particular protein. Using antibodies against architectural protein CTCF and the EZH2 subunit of the Polycomb repressive complex 2, we identify a spectrum of cis- and trans-acting ncRNAs enriched at Polycomb- and CTCF-binding sites in human cells, which may be involved in Polycomb-mediated gene repression and CTCF-dependent chromatin looping. By providing a protein-centric view of RNA-DNA interactions, RedChIP represents an important tool for studies of nuclear ncRNAs.

Assuntos

Fator de Ligação a CCCTC/metabolismo , Proteínas do Grupo Polycomb/metabolismo , RNA não Traduzido/metabolismo , Imunoprecipitação da Cromatina , Proteínas de Ligação a DNA/metabolismo , Humanos

5.

MCPH1 inhibits Condensin II during interphase by regulating its SMC2-Kleisin interface.

Houlard, Martin; Cutts, Erin E; Shamim, Muhammad S; Godwin, Jonathan; Weisz, David; Presser Aiden, Aviva; Lieberman Aiden, Erez; Schermelleh, Lothar; Vannini, Alessandro; Nasmyth, Kim.

Elife ; 102021 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-34850681

RESUMO

Dramatic change in chromosomal DNA morphology between interphase and mitosis is a defining features of the eukaryotic cell cycle. Two types of enzymes, namely cohesin and condensin confer the topology of chromosomal DNA by extruding DNA loops. While condensin normally configures chromosomes exclusively during mitosis, cohesin does so during interphase. The processivity of cohesin's loop extrusion during interphase is limited by a regulatory factor called WAPL, which induces cohesin to dissociate from chromosomes via a mechanism that requires dissociation of its kleisin from the neck of SMC3. We show here that a related mechanism may be responsible for blocking condensin II from acting during interphase. Cells derived from patients affected by microcephaly caused by mutations in the MCPH1 gene undergo premature chromosome condensation. We show that deletion of Mcph1 in mouse embryonic stem cells unleashes an activity of condensin II that triggers formation of compact chromosomes in G1 and G2 phases, accompanied by enhanced mixing of A and B chromatin compartments, and this occurs even in the absence of CDK1 activity. Crucially, inhibition of condensin II by MCPH1 depends on the binding of a short linear motif within MCPH1 to condensin II's NCAPG2 subunit. MCPH1's ability to block condensin II's association with chromatin is abrogated by the fusion of SMC2 with NCAPH2, hence may work by a mechanism similar to cohesin. Remarkably, in the absence of both WAPL and MCPH1, cohesin and condensin II transform chromosomal DNAs of G2 cells into chromosomes with a solenoidal axis.

Assuntos

Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas do Citoesqueleto/genética , Proteínas do Citoesqueleto/metabolismo , Células-Tronco Embrionárias/efeitos dos fármacos , Interfase/genética , Interfase/fisiologia , Animais , Regulação da Expressão Gênica , Redes e Vias Metabólicas , Camundongos

6.

The Easter Egg Weevil (Pachyrhynchus) genome reveals syntenic patterns in Coleoptera across 200 million years of evolution.

Van Dam, Matthew H; Cabras, Analyn Anzano; Henderson, James B; Rominger, Andrew J; Pérez Estrada, Cynthia; Omer, Arina D; Dudchenko, Olga; Lieberman Aiden, Erez; Lam, Athena W.

PLoS Genet ; 17(8): e1009745, 2021 08.

Artigo em Inglês | MEDLINE | ID: mdl-34460814

RESUMO

Patterns of genomic architecture across insects remain largely undocumented or decoupled from a broader phylogenetic context. For instance, it is unknown whether translocation rates differ between insect orders. We address broad scale patterns of genome architecture across Insecta by examining synteny in a phylogenetic framework from open-source insect genomes. To accomplish this, we add a chromosome level genome to a crucial lineage, Coleoptera. Our assembly of the Pachyrhynchus sulphureomaculatus genome is the first chromosome scale genome for the hyperdiverse Phytophaga lineage and currently the largest insect genome assembled to this scale. The genome is significantly larger than those of other weevils, and this increase in size is caused by repetitive elements. Our results also indicate that, among beetles, there are instances of long-lasting (>200 Ma) localization of genes to a particular chromosome with few translocation events. While some chromosomes have a paucity of translocations, intra-chromosomal synteny was almost absent, with gene order thoroughly shuffled along a chromosome. This large amount of reshuffling within chromosomes with few inter-chromosomal events contrasts with patterns seen in mammals in which the chromosomes tend to exchange larger blocks of material more readily. To place our findings in an evolutionary context, we compared syntenic patterns across Insecta in a phylogenetic framework. For the first time, we find that synteny decays at an exponential rate relative to phylogenetic distance. Additionally, there are significant differences in decay rates between insect orders, this pattern was not driven by Lepidoptera alone which has a substantially different rate.

Assuntos

Besouros/genética , Sintenia/genética , Gorgulhos/genética , Animais , Evolução Biológica , Cromossomos/genética , Evolução Molecular , Genoma de Inseto/genética , Genômica/métodos , Filogenia

7.

Delineating the Tnt1 Insertion Landscape of the Model Legume Medicago truncatula cv. R108 at the Hi-C Resolution Using a Chromosome-Length Genome Assembly.

Kaur, Parwinder; Lui, Christopher; Dudchenko, Olga; Nandety, Raja Sekhar; Hurgobin, Bhavna; Pham, Melanie; Lieberman Aiden, Erez; Wen, Jiangqi; Mysore, Kirankumar.

Int J Mol Sci ; 22(9)2021 Apr 21.

Artigo em Inglês | MEDLINE | ID: mdl-33919286

RESUMO

Legumes are of great interest for sustainable agricultural production as they fix atmospheric nitrogen to improve the soil. Medicago truncatula is a well-established model legume, and extensive studies in fundamental molecular, physiological, and developmental biology have been undertaken to translate into trait improvements in economically important legume crops worldwide. However, M. truncatula reference genome was generated in the accession Jemalong A17, which is highly recalcitrant to transformation. M. truncatula R108 is more attractive for genetic studies due to its high transformation efficiency and Tnt1-insertion population resource for functional genomics. The need to perform accurate synteny analysis and comprehensive genome-scale comparisons necessitates a chromosome-length genome assembly for M. truncatula cv. R108. Here, we performed in situ Hi-C (48×) to anchor, order, orient scaffolds, and correct misjoins of contigs in a previously published genome assembly (R108 v1.0), resulting in an improved genome assembly containing eight chromosome-length scaffolds that span 97.62% of the sequenced bases in the input assembly. The long-range physical information data generated using Hi-C allowed us to obtain a chromosome-length ordering of the genome assembly, better validate previous draft misjoins, and provide further insights accurately predicting synteny between A17 and R108 regions corresponding to the known chromosome 4/8 translocation. Furthermore, mapping the Tnt1 insertion landscape on this reference assembly presents an important resource for M. truncatula functional genomics by supporting efficient mutant gene identification in Tnt1 insertion lines. Our data provide a much-needed foundational resource that supports functional and molecular research into the Leguminosae for sustainable agriculture and feeding the future.

Assuntos

Mapeamento Cromossômico , Genoma de Planta , Medicago truncatula/genética , Genômica , Retroelementos , Análise de Sequência de DNA

8.

Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to Mediator.

Sanborn, Adrian L; Yeh, Benjamin T; Feigerle, Jordan T; Hao, Cynthia V; Townshend, Raphael Jl; Lieberman Aiden, Erez; Dror, Ron O; Kornberg, Roger D.

Elife ; 102021 04 27.

Artigo em Inglês | MEDLINE | ID: mdl-33904398

RESUMO

Gene activator proteins comprise distinct DNA-binding and transcriptional activation domains (ADs). Because few ADs have been described, we tested domains tiling all yeast transcription factors for activation in vivo and identified 150 ADs. By mRNA display, we showed that 73% of ADs bound the Med15 subunit of Mediator, and that binding strength was correlated with activation. AD-Mediator interaction in vitro was unaffected by a large excess of free activator protein, pointing to a dynamic mechanism of interaction. Structural modeling showed that ADs interact with Med15 without shape complementarity ('fuzzy' binding). ADs shared no sequence motifs, but mutagenesis revealed biochemical and structural constraints. Finally, a neural network trained on AD sequences accurately predicted ADs in human proteins and in other yeast proteins, including chromosomal proteins and chromatin remodeling complexes. These findings solve the longstanding enigma of AD structure and function and provide a rationale for their role in biology.

Cells adapt and respond to changes by regulating the activity of their genes. To turn genes on or off, they use a family of proteins called transcription factors. Transcription factors influence specific but overlapping groups of genes, so that each gene is controlled by several transcription factors that act together like a dimmer switch to regulate gene activity. The presence of transcription factors attracts proteins such as the Mediator complex, which activates genes by gathering the protein machines that read the genes. The more transcription factors are found near a specific gene, the more strongly they attract Mediator and the more active the gene is. A specific region on the transcription factor called the activation domain is necessary for this process. The biochemical sequences of these domains vary greatly between species, yet activation domains from, for example, yeast and human proteins are often interchangeable. To understand why this is the case, Sanborn et al. analyzed the genome of baker's yeast and identified 150 activation domains, each very different in sequence. Three-quarters of them bound to a subunit of the Mediator complex called Med15. Sanborn et al. then developed a machine learning algorithm to predict activation domains in both yeast and humans. This algorithm also showed that negatively charged and greasy regions on the activation domains were essential to be activated by the Mediator complex. Further analyses revealed that activation domains used different poses to bind multiple sites on Med15, a behavior known as 'fuzzy' binding. This creates a high overall affinity even though the binding strength at each individual site is low, enabling the protein complexes to remain dynamic. These weak interactions together permit fine control over the activity of several genes, allowing cells to respond quickly and precisely to many changes. The computer algorithm used here provides a new way to identify activation domains across species and could improve our understanding of how living things grow, adapt and evolve. It could also give new insights into mechanisms of disease, particularly cancer, where transcription factors are often faulty.

Assuntos

Complexo Mediador/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Ativação Transcricional/genética , Domínio Catalítico/genética , Variação Genética/genética , Ensaios de Triagem em Larga Escala , Humanos , Complexo Mediador/genética , Saccharomyces cerevisiae , Proteínas de Saccharomyces cerevisiae/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo

9.

CTCF loss has limited effects on global genome architecture in Drosophila despite critical regulatory functions.

Kaushal, Anjali; Mohana, Giriram; Dorier, Julien; Özdemir, Isa; Omer, Arina; Cousin, Pascal; Semenova, Anastasiia; Taschner, Michael; Dergai, Oleksandr; Marzetta, Flavia; Iseli, Christian; Eliaz, Yossi; Weisz, David; Shamim, Muhammad Saad; Guex, Nicolas; Lieberman Aiden, Erez; Gambetta, Maria Cristina.

Nat Commun ; 12(1): 1011, 2021 02 12.

Artigo em Inglês | MEDLINE | ID: mdl-33579945

RESUMO

Vertebrate genomes are partitioned into contact domains defined by enhanced internal contact frequency and formed by two principal mechanisms: compartmentalization of transcriptionally active and inactive domains, and stalling of chromosomal loop-extruding cohesin by CTCF bound at domain boundaries. While Drosophila has widespread contact domains and CTCF, it is currently unclear whether CTCF-dependent domains exist in flies. We genetically ablate CTCF in Drosophila and examine impacts on genome folding and transcriptional regulation in the central nervous system. We find that CTCF is required to form a small fraction of all domain boundaries, while critically controlling expression patterns of certain genes and supporting nervous system function. We also find that CTCF recruits the pervasive boundary-associated factor Cp190 to CTCF-occupied boundaries and co-regulates a subset of genes near boundaries together with Cp190. These results highlight a profound difference in CTCF-requirement for genome folding in flies and vertebrates, in which a large fraction of boundaries are CTCF-dependent and suggest that CTCF has played mutable roles in genome architecture and direct gene expression control during metazoan evolution.

Assuntos

Fator de Ligação a CCCTC/genética , Fator de Ligação a CCCTC/metabolismo , Drosophila/genética , Genoma , Animais , Cromatina , Cromossomos/metabolismo , Biologia do Desenvolvimento , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Feminino , Técnicas de Inativação de Genes , Masculino , Proteínas Associadas aos Microtúbulos/metabolismo

10.

H3K27me3-rich genomic regions can function as silencers to repress gene expression via chromatin interactions.

Cai, Yichao; Zhang, Ying; Loh, Yan Ping; Tng, Jia Qi; Lim, Mei Chee; Cao, Zhendong; Raju, Anandhkumar; Lieberman Aiden, Erez; Li, Shang; Manikandan, Lakshmanan; Tergaonkar, Vinay; Tucker-Kellogg, Greg; Fullwood, Melissa Jane.

Nat Commun ; 12(1): 719, 2021 01 29.

Artigo em Inglês | MEDLINE | ID: mdl-33514712

RESUMO

The mechanisms underlying gene repression and silencers are poorly understood. Here we investigate the hypothesis that H3K27me3-rich regions of the genome, defined from clusters of H3K27me3 peaks, may be used to identify silencers that can regulate gene expression via proximity or looping. We find that H3K27me3-rich regions are associated with chromatin interactions and interact preferentially with each other. H3K27me3-rich regions component removal at interaction anchors by CRISPR leads to upregulation of interacting target genes, altered H3K27me3 and H3K27ac levels at interacting regions, and altered chromatin interactions. Chromatin interactions did not change at regions with high H3K27me3, but regions with low H3K27me3 and high H3K27ac levels showed changes in chromatin interactions. Cells with H3K27me3-rich regions knockout also show changes in phenotype associated with cell identity, and altered xenograft tumor growth. Finally, we observe that H3K27me3-rich regions-associated genes and long-range chromatin interactions are susceptible to H3K27me3 depletion. Our results characterize H3K27me3-rich regions and their mechanisms of functioning via looping.

Assuntos

Cromatina/metabolismo , Repressão Epigenética , Histonas/genética , Neoplasias/genética , Elementos Silenciadores Transcricionais/genética , Animais , Linhagem Celular Tumoral , Cromatina/genética , Sequenciamento de Cromatina por Imunoprecipitação , Feminino , Fatores de Crescimento de Fibroblastos/genética , Regulação Neoplásica da Expressão Gênica , Técnicas de Silenciamento de Genes , Técnicas de Inativação de Genes , Histonas/metabolismo , Humanos , Fator de Crescimento Insulin-Like II/genética , Camundongos , RNA-Seq , Ensaios Antitumorais Modelo de Xenoenxerto

11.

The Nucleome Data Bank: web-based resources to simulate and analyze the three-dimensional genome.

Contessoto, Vinícius G; Cheng, Ryan R; Hajitaheri, Arya; Dodero-Rojas, Esteban; Mello, Matheus F; Lieberman-Aiden, Erez; Wolynes, Peter G; Di Pierro, Michele; Onuchic, José N.

Nucleic Acids Res ; 49(D1): D172-D182, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33021634

RESUMO

We introduce the Nucleome Data Bank (NDB), a web-based platform to simulate and analyze the three-dimensional (3D) organization of genomes. The NDB enables physics-based simulation of chromosomal structural dynamics through the MEGABASE + MiChroM computational pipeline. The input of the pipeline consists of epigenetic information sourced from the Encode database; the output consists of the trajectories of chromosomal motions that accurately predict Hi-C and fluorescence insitu hybridization data, as well as multiple observations of chromosomal dynamics in vivo. As an intermediate step, users can also generate chromosomal sub-compartment annotations directly from the same epigenetic input, without the use of any DNA-DNA proximity ligation data. Additionally, the NDB freely hosts both experimental and computational structural genomics data. Besides being able to perform their own genome simulations and download the hosted data, users can also analyze and visualize the same data through custom-designed web-based tools. In particular, the one-dimensional genetic and epigenetic data can be overlaid onto accurate 3D structures of chromosomes, to study the spatial distribution of genetic and epigenetic features. The NDB aims to be a shared resource to biologists, biophysicists and all genome scientists. The NDB is available at https://ndb.rice.edu.

Assuntos

Cromatina/ultraestrutura , Biologia Computacional/métodos , Bases de Dados Genéticas , Epigênese Genética , Genoma Humano , Células A549 , Cromatina/metabolismo , Humanos , Hibridização in Situ Fluorescente , Internet , Conformação Molecular , Anotação de Sequência Molecular , Software

12.

Exploring chromosomal structural heterogeneity across multiple cell lines.

Cheng, Ryan R; Contessoto, Vinicius G; Lieberman Aiden, Erez; Wolynes, Peter G; Di Pierro, Michele; Onuchic, Jose N.

Elife ; 92020 10 13.

Artigo em Inglês | MEDLINE | ID: mdl-33047670

RESUMO

Using computer simulations, we generate cell-specific 3D chromosomal structures and compare them to recently published chromatin structures obtained through microscopy. We demonstrate using machine learning and polymer physics simulations that epigenetic information can be used to predict the structural ensembles of multiple human cell lines. Theory predicts that chromosome structures are fluid and can only be described by an ensemble, which is consistent with the observation that chromosomes exhibit no unique fold. Nevertheless, our analysis of both structures from simulation and microscopy reveals that short segments of chromatin make two-state transitions between closed conformations and open dumbbell conformations. Finally, we study the conformational changes associated with the switching of genomic compartments observed in human cell lines. The formation of genomic compartments resembles hydrophobic collapse in protein folding, with the aggregation of denser and predominantly inactive chromatin driving the positioning of active chromatin toward the surface of individual chromosomal territories.

Assuntos

Cromossomos Humanos/ultraestrutura , Linhagem Celular , Linhagem Celular Tumoral , Cromatina/metabolismo , Cromatina/ultraestrutura , Simulação por Computador , Epigênese Genética , Loci Gênicos , Humanos , Imageamento Tridimensional

13.

Hi-C chromosome conformation capture sequencing of avian genomes using the BGISEQ-500 platform.

Sandoval-Velasco, Marcela; Rodríguez, Juan Antonio; Perez Estrada, Cynthia; Zhang, Guojie; Lieberman Aiden, Erez; Marti-Renom, Marc A; Gilbert, M Thomas P; Smith, Oliver.

Gigascience ; 9(8)2020 08 01.

Artigo em Inglês | MEDLINE | ID: mdl-32845983

RESUMO

BACKGROUND: Hi-C experiments couple DNA-DNA proximity with next-generation sequencing to yield an unbiased description of genome-wide interactions. Previous methods describing Hi-C experiments have focused on the industry-standard Illumina sequencing. With new next-generation sequencing platforms such as BGISEQ-500 becoming more widely available, protocol adaptations to fit platform-specific requirements are useful to give increased choice to researchers who routinely generate sequencing data. RESULTS: We describe an in situ Hi-C protocol adapted to be compatible with the BGISEQ-500 high-throughput sequencing platform. Using zebra finch (Taeniopygia guttata) as a biological sample, we demonstrate how Hi-C libraries can be constructed to generate informative data using the BGISEQ-500 platform, following circularization and DNA nanoball generation. Our protocol is a modification of an Illumina-compatible method, based around blunt-end ligations in library construction, using un-barcoded, distally overhanging double-stranded adapters, followed by amplification using indexed primers. The resulting libraries are ready for circularization and subsequent sequencing on the BGISEQ series of platforms and yield data similar to what can be expected using Illumina-compatible approaches. CONCLUSIONS: Our straightforward modification to an Illumina-compatible in situHi-C protocol enables data generation on the BGISEQ series of platforms, thus expanding the options available for researchers who wish to utilize the powerful Hi-C techniques in their research.

Assuntos

Cromossomos , Sequenciamento de Nucleotídeos em Larga Escala , DNA , Genoma Humano , Humanos , Análise de Sequência de DNA

14.

GSDB: a database of 3D chromosome and genome structures reconstructed from Hi-C data.

Oluwadare, Oluwatosin; Highsmith, Max; Turner, Douglass; Lieberman Aiden, Erez; Cheng, Jianlin.

BMC Mol Cell Biol ; 21(1): 60, 2020 Aug 05.

Artigo em Inglês | MEDLINE | ID: mdl-32758136

RESUMO

Advances in the study of chromosome conformation capture technologies, such as Hi-C technique - capable of capturing chromosomal interactions in a genome-wide scale - have led to the development of three-dimensional chromosome and genome structure reconstruction methods from Hi-C data. The three dimensional genome structure is important because it plays a role in a variety of important biological activities such as DNA replication, gene regulation, genome interaction, and gene expression. In recent years, numerous Hi-C datasets have been generated, and likewise, a number of genome structure construction algorithms have been developed.In this work, we outline the construction of a novel Genome Structure Database (GSDB) to create a comprehensive repository that contains 3D structures for Hi-C datasets constructed by a variety of 3D structure reconstruction tools. The GSDB contains over 50,000 structures from 12 state-of-the-art Hi-C data structure prediction algorithms for 32 Hi-C datasets.GSDB functions as a centralized collection of genome structures which will enable the exploration of the dynamic architectures of chromosomes and genomes for biomedical research. GSDB is accessible at http://sysbio.rnet.missouri.edu/3dgenome/GSDB.

Assuntos

Cromossomos/genética , Bases de Dados Genéticas , Genoma , Algoritmos , Conformação de Ácido Nucleico , Análise de Componente Principal

15.

Correction to: GSDB: a database of 3D chromosome and genome structures reconstructed from Hi-C data.

Oluwadare, Oluwatosin; Highsmith, Max; Turner, Douglass; Lieberman Aiden, Erez; Cheng, Jianlin.

BMC Mol Cell Biol ; 21(1): 62, 2020 08 18.

Artigo em Inglês | MEDLINE | ID: mdl-32811439

RESUMO

An amendment to this paper has been published and can be accessed via the original article.

16.

The genome sequence of the Eurasian red squirrel, Sciurus vulgaris Linnaeus 1758.

Mead, Daniel; Fingland, Kathryn; Cripps, Rachel; Portela Miguez, Roberto; Smith, Michelle; Corton, Craig; Oliver, Karen; Skelton, Jason; Betteridge, Emma; Dolucan, Jale; Dudchenko, Olga; Omer, Arina D; Weisz, David; Lieberman Aiden, Erez; Fedrigo, Olivier; Mountcastle, Jacquelyn; Jarvis, Erich; McCarthy, Shane A; Sims, Ying; Torrance, James; Tracey, Alan; Howe, Kerstin; Challis, Richard; Durbin, Richard; Blaxter, Mark.

Wellcome Open Res ; 5: 18, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-32587897

RESUMO

We present a genome assembly from an individual male Sciurus vulgaris (the Eurasian red squirrel; Vertebrata; Mammalia; Eutheria; Rodentia; Sciuridae). The genome sequence is 2.88 gigabases in span. The majority of the assembly is scaffolded into 21 chromosomal-level scaffolds, with both X and Y sex chromosomes assembled.

17.

Chromosomal-level genome assembly of the scimitar-horned oryx: Insights into diversity and demography of a species extinct in the wild.

Humble, Emily; Dobrynin, Pavel; Senn, Helen; Chuven, Justin; Scott, Alan F; Mohr, David W; Dudchenko, Olga; Omer, Arina D; Colaric, Zane; Lieberman Aiden, Erez; Al Dhaheri, Shaikha Salem; Wildt, David; Oliaji, Shireen; Tamazian, Gaik; Pukazhenthi, Budhan; Ogden, Rob; Koepfli, Klaus-Peter.

Mol Ecol Resour ; 20(6): 1668-1681, 2020 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-32365406

RESUMO

Captive populations provide a valuable insurance against extinctions in the wild. However, they are also vulnerable to the negative impacts of inbreeding, selection and drift. Genetic information is therefore considered a critical aspect of conservation management. Recent developments in sequencing technologies have the potential to improve the outcomes of management programmes; however, the transfer of these approaches to applied conservation has been slow. The scimitar-horned oryx (Oryx dammah) is a North African antelope that has been extinct in the wild since the early 1980s and is the focus of a large-scale and long-term reintroduction project. To enable the selection of suitable founder individuals, facilitate post-release monitoring and improve captive breeding management, comprehensive genomic resources are required. Here, we used 10X Chromium sequencing together with Hi-C contact mapping to develop a chromosomal-level genome assembly for the species. The resulting assembly contained 29 chromosomes with a scaffold N50 of 100.4 Mb, and displayed strong chromosomal synteny with the cattle genome. Using resequencing data from six additional individuals, we demonstrated relatively high genetic diversity in the scimitar-horned oryx compared to other mammals, despite it having experienced a strong founding event in captivity. Additionally, the level of diversity across populations varied according to management strategy. Finally, we uncovered a dynamic demographic history that coincided with periods of climate variation during the Pleistocene. Overall, our study provides a clear example of how genomic data can uncover valuable insights into captive populations and contributes important resources to guide future management decisions of an endangered species.

Assuntos

Antílopes , Espécies em Perigo de Extinção , Genoma , Animais , Antílopes/genética , Cromossomos , Endogamia , Sintenia

18.

The genome sequence of the Eurasian river otter, Lutra lutra Linnaeus 1758.

Mead, Dan; Hailer, Frank; Chadwick, Elisabeth; Portela Miguez, Roberto; Smith, Michelle; Corton, Craig; Oliver, Karen; Skelton, Jason; Betteridge, Emma; Doulcan, Jale Doulcan; Dudchenko, Olga; Omer, Arina; Weisz, David; Lieberman Aiden, Erez; McCarthy, Shane; Howe, Kerstin; Sims, Ying; Torrance, James; Tracey, Alan; Challis, Richard; Durbin, Richard; Blaxter, Mark.

Wellcome Open Res ; 5: 33, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-32258427

RESUMO

We present a genome assembly from an individual male Lutra lutra (the Eurasian river otter; Vertebrata; Mammalia; Eutheria; Carnivora; Mustelidae). The genome sequence is 2.44 gigabases in span. The majority of the assembly is scaffolded into 20 chromosomal pseudomolecules, with both X and Y sex chromosomes assembled.

19.

Analysis of Hi-C data using SIP effectively identifies loops in organisms from C. elegans to mammals.

Rowley, M Jordan; Poulet, Axel; Nichols, Michael H; Bixler, Brianna J; Sanborn, Adrian L; Brouhard, Elizabeth A; Hermetz, Karen; Linsenbaum, Hannah; Csankovszki, Gyorgyi; Lieberman Aiden, Erez; Corces, Victor G.

Genome Res ; 30(3): 447-458, 2020 03.

Artigo em Inglês | MEDLINE | ID: mdl-32127418

RESUMO

Chromatin loops are a major component of 3D nuclear organization, visually apparent as intense point-to-point interactions in Hi-C maps. Identification of these loops is a critical part of most Hi-C analyses. However, current methods often miss visually evident CTCF loops in Hi-C data sets from mammals, and they completely fail to identify high intensity loops in other organisms. We present SIP, Significant Interaction Peak caller, and SIPMeta, which are platform independent programs to identify and characterize these loops in a time- and memory-efficient manner. We show that SIP is resistant to noise and sequencing depth, and can be used to detect loops that were previously missed in human cells as well as loops in other organisms. SIPMeta corrects for a common visualization artifact by accounting for Manhattan distance to create average plots of Hi-C and HiChIP data. We then demonstrate that the use of SIP and SIPMeta can lead to biological insights by characterizing the contribution of several transcription factors to CTCF loop stability in human cells. We also annotate loops associated with the SMC component of the dosage compensation complex (DCC) in Caenorhabditis elegans and demonstrate that loop anchors represent bidirectional blocks for symmetrical loop extrusion. This is in contrast to the asymmetrical extrusion until unidirectional blockage by CTCF that is presumed to occur in mammals. Using HiChIP and multiway ligation events, we then show that DCC loops form a network of strong interactions that may contribute to X Chromosome-wide condensation in C. elegans hermaphrodites.

Assuntos

Caenorhabditis elegans/genética , Cromatina/química , Análise de Sequência de DNA , Software , Aedes/genética , Animais , Fator de Ligação a CCCTC/metabolismo , Drosophila melanogaster/genética , Humanos , Fatores de Transcrição/metabolismo , Inativação do Cromossomo X

20.

ESCO1 and CTCF enable formation of long chromatin loops by protecting cohesin^STAG1 from WAPL.

Wutz, Gordana; Ladurner, Rene; St Hilaire, Brian Glenn; Stocsits, Roman R; Nagasaka, Kota; Pignard, Benoit; Sanborn, Adrian; Tang, Wen; Várnai, Csilla; Ivanov, Miroslav P; Schoenfelder, Stefan; van der Lelij, Petra; Huang, Xingfan; Dürnberger, Gerhard; Roitinger, Elisabeth; Mechtler, Karl; Davidson, Iain Finley; Fraser, Peter; Lieberman-Aiden, Erez; Peters, Jan-Michael.

Elife ; 92020 02 17.

Artigo em Inglês | MEDLINE | ID: mdl-32065581

RESUMO

Eukaryotic genomes are folded into loops. It is thought that these are formed by cohesin complexes via extrusion, either until loop expansion is arrested by CTCF or until cohesin is removed from DNA by WAPL. Although WAPL limits cohesin's chromatin residence time to minutes, it has been reported that some loops exist for hours. How these loops can persist is unknown. We show that during G1-phase, mammalian cells contain acetylated cohesinSTAG1 which binds chromatin for hours, whereas cohesinSTAG2 binds chromatin for minutes. Our results indicate that CTCF and the acetyltransferase ESCO1 protect a subset of cohesinSTAG1 complexes from WAPL, thereby enable formation of long and presumably long-lived loops, and that ESCO1, like CTCF, contributes to boundary formation in chromatin looping. Our data are consistent with a model of nested loop extrusion, in which acetylated cohesinSTAG1 forms stable loops between CTCF sites, demarcating the boundaries of more transient cohesinSTAG2 extrusion activity.

Assuntos

Acetiltransferases/fisiologia , Fator de Ligação a CCCTC/fisiologia , Proteínas de Transporte/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Proteínas Nucleares/metabolismo , Proteínas Proto-Oncogênicas/metabolismo , Acetilação , Proteínas de Transporte/genética , Simulação por Computador , Fase G1 , Genoma Humano , Humanos , Proteínas Nucleares/genética , Ligação Proteica , Proteínas Proto-Oncogênicas/genética , Coesinas

RESUMO

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA