Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
1.
Cell ; 187(14): 3541-3562.e51, 2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-38996487

RESUMO

Analyses of ancient DNA typically involve sequencing the surviving short oligonucleotides and aligning to genome assemblies from related, modern species. Here, we report that skin from a female woolly mammoth (†Mammuthus primigenius) that died 52,000 years ago retained its ancient genome architecture. We use PaleoHi-C to map chromatin contacts and assemble its genome, yielding 28 chromosome-length scaffolds. Chromosome territories, compartments, loops, Barr bodies, and inactive X chromosome (Xi) superdomains persist. The active and inactive genome compartments in mammoth skin more closely resemble Asian elephant skin than other elephant tissues. Our analyses uncover new biology. Differences in compartmentalization reveal genes whose transcription was potentially altered in mammoths vs. elephants. Mammoth Xi has a tetradic architecture, not bipartite like human and mouse. We hypothesize that, shortly after this mammoth's death, the sample spontaneously freeze-dried in the Siberian cold, leading to a glass transition that preserved subfossils of ancient chromosomes at nanometer scale.


Assuntos
Genoma , Mamutes , Pele , Animais , Mamutes/genética , Genoma/genética , Feminino , Elefantes/genética , Cromatina/genética , Fósseis , DNA Antigo/análise , Camundongos , Humanos , Cromossomo X/genética
2.
Cell ; 173(5): 1165-1178.e20, 2018 05 17.
Artigo em Inglês | MEDLINE | ID: mdl-29706548

RESUMO

Cohesin extrusion is thought to play a central role in establishing the architecture of mammalian genomes. However, extrusion has not been visualized in vivo, and thus, its functional impact and energetics are unknown. Using ultra-deep Hi-C, we show that loop domains form by a process that requires cohesin ATPases. Once formed, however, loops and compartments are maintained for hours without energy input. Strikingly, without ATP, we observe the emergence of hundreds of CTCF-independent loops that link regulatory DNA. We also identify architectural "stripes," where a loop anchor interacts with entire domains at high frequency. Stripes often tether super-enhancers to cognate promoters, and in B cells, they facilitate Igh transcription and recombination. Stripe anchors represent major hotspots for topoisomerase-mediated lesions, which promote chromosomal translocations and cancer. In plasmacytomas, stripes can deregulate Igh-translocated oncogenes. We propose that higher organisms have coopted cohesin extrusion to enhance transcription and recombination, with implications for tumor development.


Assuntos
Trifosfato de Adenosina/metabolismo , Proteínas de Ciclo Celular/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Genoma , Animais , Linfócitos B/citologia , Linfócitos B/metabolismo , Fator de Ligação a CCCTC/genética , Fator de Ligação a CCCTC/metabolismo , Proteínas de Ciclo Celular/química , Proteínas de Ciclo Celular/genética , Linhagem Celular , Proteoglicanas de Sulfatos de Condroitina/genética , Proteoglicanas de Sulfatos de Condroitina/metabolismo , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/química , Proteínas Cromossômicas não Histona/genética , Cromossomos/metabolismo , Proteínas de Ligação a DNA , Humanos , Camundongos , Mutagênese , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transcrição Gênica , Coesinas
3.
Cell ; 159(7): 1665-80, 2014 Dec 18.
Artigo em Inglês | MEDLINE | ID: mdl-25497547

RESUMO

We use in situ Hi-C to probe the 3D architecture of genomes, constructing haploid and diploid maps of nine cell types. The densest, in human lymphoblastoid cells, contains 4.9 billion contacts, achieving 1 kb resolution. We find that genomes are partitioned into contact domains (median length, 185 kb), which are associated with distinct patterns of histone marks and segregate into six subcompartments. We identify ∼10,000 loops. These loops frequently link promoters and enhancers, correlate with gene activation, and show conservation across cell types and species. Loop anchors typically occur at domain boundaries and bind CTCF. CTCF sites at loop anchors occur predominantly (>90%) in a convergent orientation, with the asymmetric motifs "facing" one another. The inactive X chromosome splits into two massive domains and contains large loops anchored at CTCF-binding repeats.


Assuntos
Núcleo Celular/genética , Cromatina/química , Genoma Humano , Animais , Fator de Ligação a CCCTC , Linhagem Celular , Núcleo Celular/química , Regulação da Expressão Gênica , Código das Histonas , Humanos , Camundongos , Conformação Molecular , Sequências Reguladoras de Ácido Nucleico , Proteínas Repressoras/metabolismo
5.
Genome Res ; 31(6): 981-994, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34006569

RESUMO

Chromatin architecture mapping in 3D formats has increased our understanding of how regulatory sequences and gene expression are connected and regulated in a genome. The 3D chromatin genome shows extensive remodeling during embryonic development, and although the cleavage-stage embryos of most species lack structure before zygotic genome activation (pre-ZGA), zebrafish has been reported to have structure. Here, we aimed to determine the chromosomal architecture in paternal/sperm zebrafish gamete cells to discern whether it either resembles or informs early pre-ZGA zebrafish embryo chromatin architecture. First, we assessed the higher-order architecture through advanced low-cell in situ Hi-C. The structure of zebrafish sperm, packaged by histones, lacks topological associated domains and instead displays "hinge-like" domains of ∼150 kb that repeat every 1-2 Mbs, suggesting a condensed repeating structure resembling mitotic chromosomes. The pre-ZGA embryos lacked chromosomal structure, in contrast to prior work, and only developed structure post-ZGA. During post-ZGA, we find chromatin architecture beginning to form at small contact domains of a median length of ∼90 kb. These small contact domains are established at enhancers, including super-enhancers, and chemical inhibition of Ep300a (p300) and Crebbpa (CBP) activity, lowering histone H3K27ac, but not transcription inhibition, diminishes these contacts. Together, this study reveals hinge-like domains in histone-packaged zebrafish sperm chromatin and determines that the initial formation of high-order chromatin architecture in zebrafish embryos occurs after ZGA primarily at enhancers bearing high H3K27ac.


Assuntos
Cromatina , Peixe-Zebra , Animais , Cromatina/genética , Cromatina/metabolismo , Cromossomos/genética , Desenvolvimento Embrionário/genética , Regulação da Expressão Gênica no Desenvolvimento , Masculino , Espermatozoides/metabolismo , Peixe-Zebra/genética , Zigoto
6.
Genome Res ; 31(6): 968-980, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34006570

RESUMO

Chromatin looping plays an important role in genome regulation. However, because ChIP-seq and loop-resolution Hi-C (DNA-DNA proximity ligation) are extremely challenging in mammalian early embryos, the developmental stage at which cohesin-mediated loops form remains unknown. Here, we study early development in medaka (the Japanese killifish, Oryzias latipes) at 12 time points before, during, and after gastrulation (the onset of cell differentiation) and characterize transcription, protein binding, and genome architecture. We find that gastrulation is associated with drastic changes in genome architecture, including the formation of the first loops between sites bound by the insulator protein CTCF and a large increase in the size of contact domains. In contrast, the binding of the CTCF is fixed throughout embryogenesis. Loops form long after genome-wide transcriptional activation, and long after domain formation seen in mouse embryos. These results suggest that, although loops may play a role in differentiation, they are not required for zygotic transcription. When we repeated our experiments in zebrafish, loops did not emerge until gastrulation, that is, well after zygotic genome activation. We observe that loop positions are highly conserved in synteny blocks of medaka and zebrafish, indicating that the 3D genome architecture has been maintained for >110-200 million years of evolution.


Assuntos
Oryzias , Animais , Fator de Ligação a CCCTC/genética , Fator de Ligação a CCCTC/metabolismo , Proteínas de Ciclo Celular/genética , Cromatina/genética , Gastrulação/genética , Camundongos , Oryzias/genética , Peixe-Zebra/genética
7.
PLoS Genet ; 14(12): e1007872, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30586358

RESUMO

Chromosome organization is crucial for genome function. Here, we present a method for visualizing chromosomal DNA at super-resolution and then integrating Hi-C data to produce three-dimensional models of chromosome organization. Using the super-resolution microscopy methods of OligoSTORM and OligoDNA-PAINT, we trace 8 megabases of human chromosome 19, visualizing structures ranging in size from a few kilobases to over a megabase. Focusing on chromosomal regions that contribute to compartments, we discover distinct structures that, in spite of considerable variability, can predict whether such regions correspond to active (A-type) or inactive (B-type) compartments. Imaging through the depths of entire nuclei, we capture pairs of homologous regions in diploid cells, obtaining evidence that maternal and paternal homologous regions can be differentially organized. Finally, using restraint-based modeling to integrate imaging and Hi-C data, we implement a method-integrative modeling of genomic regions (IMGR)-to increase the genomic resolution of our traces to 10 kb.


Assuntos
Passeio de Cromossomo/métodos , Cromossomos Humanos Par 19/genética , Cromossomos Humanos Par 19/ultraestrutura , Modelos Genéticos , Células Cultivadas , Coloração Cromossômica/métodos , Estruturas Cromossômicas/química , Estruturas Cromossômicas/genética , Estruturas Cromossômicas/ultraestrutura , Cromossomos Humanos Par 19/química , Feminino , Corantes Fluorescentes , Humanos , Imageamento Tridimensional , Hibridização in Situ Fluorescente/métodos , Masculino , Sondas de Oligonucleotídeos , Linhagem
8.
Proc Natl Acad Sci U S A ; 113(31): E4504-12, 2016 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-27432957

RESUMO

During interphase, the inactive X chromosome (Xi) is largely transcriptionally silent and adopts an unusual 3D configuration known as the "Barr body." Despite the importance of X chromosome inactivation, little is known about this 3D conformation. We recently showed that in humans the Xi chromosome exhibits three structural features, two of which are not shared by other chromosomes. First, like the chromosomes of many species, Xi forms compartments. Second, Xi is partitioned into two huge intervals, called "superdomains," such that pairs of loci in the same superdomain tend to colocalize. The boundary between the superdomains lies near DXZ4, a macrosatellite repeat whose Xi allele extensively binds the protein CCCTC-binding factor. Third, Xi exhibits extremely large loops, up to 77 megabases long, called "superloops." DXZ4 lies at the anchor of several superloops. Here, we combine 3D mapping, microscopy, and genome editing to study the structure of Xi, focusing on the role of DXZ4 We show that superloops and superdomains are conserved across eutherian mammals. By analyzing ligation events involving three or more loci, we demonstrate that DXZ4 and other superloop anchors tend to colocate simultaneously. Finally, we show that deleting DXZ4 on Xi leads to the disappearance of superdomains and superloops, changes in compartmentalization patterns, and changes in the distribution of chromatin marks. Thus, DXZ4 is essential for proper Xi packaging.


Assuntos
Cromossomos Humanos X/genética , Deleção de Genes , Genoma Humano/genética , Repetições de Microssatélites/genética , Inativação do Cromossomo X , Animais , Sítios de Ligação/genética , Fator de Ligação a CCCTC/metabolismo , Cromatina/genética , Cromatina/metabolismo , Mapeamento Cromossômico , Feminino , Humanos , Macaca mulatta , Camundongos , Ligação Proteica
9.
Proc Natl Acad Sci U S A ; 112(47): E6456-65, 2015 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-26499245

RESUMO

We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that the observed contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic "tension globule." In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during interphase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap and why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the WT. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.


Assuntos
Cromatina/química , Cromatina/genética , Engenharia Genética , Genoma/genética , Conformação de Ácido Nucleico , Anisotropia , Pareamento de Bases , Fator de Ligação a CCCTC , Proteínas de Ciclo Celular/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Simulação por Computador , Difusão , Fractais , Humanos , Hibridização in Situ Fluorescente , Modelos Moleculares , Motivos de Nucleotídeos/genética , Polímeros/química , Probabilidade , Proteínas Repressoras/metabolismo , Coesinas
10.
BMC Biol ; 15(1): 110, 2017 Nov 16.
Artigo em Inglês | MEDLINE | ID: mdl-29145861

RESUMO

BACKGROUND: The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly. METHODS: We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome. RESULTS: We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies. CONCLUSIONS: We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.


Assuntos
Centrômero/genética , Cheirogaleidae/genética , Genoma , Animais , Biologia Computacional , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA
11.
bioRxiv ; 2024 Jul 22.
Artigo em Inglês | MEDLINE | ID: mdl-39026740

RESUMO

Enhancers are key drivers of gene regulation thought to act via 3D physical interactions with the promoters of their target genes. However, genome-wide depletions of architectural proteins such as cohesin result in only limited changes in gene expression, despite a loss of contact domains and loops. Consequently, the role of cohesin and 3D contacts in enhancer function remains debated. Here, we developed CRISPRi of regulatory elements upon degron operation (CRUDO), a novel approach to measure how changes in contact frequency impact enhancer effects on target genes by perturbing enhancers with CRISPRi and measuring gene expression in the presence or absence of cohesin. We systematically perturbed all 1,039 candidate enhancers near five cohesin-dependent genes and identified 34 enhancer-gene regulatory interactions. Of 26 regulatory interactions with sufficient statistical power to evaluate cohesin dependence, 18 show cohesin-dependent effects. A decrease in enhancer-promoter contact frequency upon removal of cohesin is frequently accompanied by a decrease in the regulatory effect of the enhancer on gene expression, consistent with a contact-based model for enhancer function. However, changes in contact frequency and regulatory effects on gene expression vary as a function of distance, with distal enhancers (e.g., >50Kb) experiencing much larger changes than proximal ones (e.g., <50Kb). Because most enhancers are located close to their target genes, these observations can explain how only a small subset of genes - those with strong distal enhancers - are sensitive to cohesin. Together, our results illuminate how 3D contacts, influenced by both cohesin and genomic distance, tune enhancer effects on gene expression.

12.
PLoS One ; 18(11): e0294283, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38032990

RESUMO

Early detection of SARS-CoV-2 infection is key to managing the current global pandemic, as evidence shows the virus is most contagious on or before symptom onset. Here, we introduce a low-cost, high-throughput method for diagnosing and studying SARS-CoV-2 infection. Dubbed Pathogen-Oriented Low-Cost Assembly & Re-Sequencing (POLAR), this method amplifies the entirety of the SARS-CoV-2 genome. This contrasts with typical RT-PCR-based diagnostic tests, which amplify only a few loci. To achieve this goal, we combine a SARS-CoV-2 enrichment method developed by the ARTIC Network (https://artic.network/) with short-read DNA sequencing and de novo genome assembly. Using this method, we can reliably (>95% accuracy) detect SARS-CoV-2 at a concentration of 84 genome equivalents per milliliter (GE/mL). The vast majority of diagnostic methods meeting our analytical criteria that are currently authorized for use by the United States Food and Drug Administration with the Coronavirus Disease 2019 (COVID-19) Emergency Use Authorization require higher concentrations of the virus to achieve this degree of sensitivity and specificity. In addition, we can reliably assemble the SARS-CoV-2 genome in the sample, often with no gaps and perfect accuracy given sufficient viral load. The genotypic data in these genome assemblies enable the more effective analysis of disease spread than is possible with an ordinary binary diagnostic. These data can also help identify vaccine and drug targets. Finally, we show that the diagnoses obtained using POLAR of positive and negative clinical nasal mid-turbinate swab samples 100% match those obtained in a clinical diagnostic lab using the Center for Disease Control's 2019-Novel Coronavirus test. Using POLAR, a single person can manually process 192 samples over an 8-hour experiment at the cost of ~$36 per patient (as of December 7th, 2022), enabling a 24-hour turnaround with sequencing and data analysis time. We anticipate that further testing and refinement will allow greater sensitivity using this approach.


Assuntos
COVID-19 , SARS-CoV-2 , Estados Unidos , Humanos , SARS-CoV-2/genética , COVID-19/diagnóstico , Teste para COVID-19 , Sensibilidade e Especificidade , Análise de Sequência de DNA
13.
bioRxiv ; 2023 Nov 13.
Artigo em Inglês | MEDLINE | ID: mdl-38014075

RESUMO

Identifying transcriptional enhancers and their target genes is essential for understanding gene regulation and the impact of human genetic variation on disease1-6. Here we create and evaluate a resource of >13 million enhancer-gene regulatory interactions across 352 cell types and tissues, by integrating predictive models, measurements of chromatin state and 3D contacts, and largescale genetic perturbations generated by the ENCODE Consortium7. We first create a systematic benchmarking pipeline to compare predictive models, assembling a dataset of 10,411 elementgene pairs measured in CRISPR perturbation experiments, >30,000 fine-mapped eQTLs, and 569 fine-mapped GWAS variants linked to a likely causal gene. Using this framework, we develop a new predictive model, ENCODE-rE2G, that achieves state-of-the-art performance across multiple prediction tasks, demonstrating a strategy involving iterative perturbations and supervised machine learning to build increasingly accurate predictive models of enhancer regulation. Using the ENCODE-rE2G model, we build an encyclopedia of enhancer-gene regulatory interactions in the human genome, which reveals global properties of enhancer networks, identifies differences in the functions of genes that have more or less complex regulatory landscapes, and improves analyses to link noncoding variants to target genes and cell types for common, complex diseases. By interpreting the model, we find evidence that, beyond enhancer activity and 3D enhancer-promoter contacts, additional features guide enhancerpromoter communication including promoter class and enhancer-enhancer synergy. Altogether, these genome-wide maps of enhancer-gene regulatory interactions, benchmarking software, predictive models, and insights about enhancer function provide a valuable resource for future studies of gene regulation and human genetics.

14.
bioRxiv ; 2023 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-37066421

RESUMO

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

15.
Res Sq ; 2023 Jul 19.
Artigo em Inglês | MEDLINE | ID: mdl-37503119

RESUMO

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

16.
J Adv Res ; 42: 315-329, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36513421

RESUMO

INTRODUCTION: Legume crops are an important source of protein and oil for human health and in fixing atmospheric N2 for soil enrichment. With an objective to accelerate much-needed genetic analyses and breeding applications, draft genome assemblies were generated in several legume crops; many of them are not high quality because they are mainly based on short reads. However, the superior quality of genome assembly is crucial for a detailed understanding of genomic architecture, genome evolution, and crop improvement. OBJECTIVES: Present study was undertaken with an objective of developing improved chromosome-length genome assemblies in six different legumes followed by their systematic investigation to unravel different aspects of genome organization and legume evolution. METHODS: We employed in situ Hi-C data to improve the existing draft genomes and performed different evolutionary and comparative analyses using improved genome assemblies. RESULTS: We have developed chromosome-length genome assemblies in chickpea, pigeonpea, soybean, subterranean clover, and two wild progenitor species of cultivated groundnut (A. duranensis and A. ipaensis). A comprehensive comparative analysis of these genome assemblies offered improved insights into various evolutionary events that shaped the present-day legume species. We highlighted the expansion of gene families contributing to unique traits such as nodulation in legumes, gravitropism in groundnut, and oil biosynthesis in oilseed legume crops such as groundnut and soybean. As examples, we have demonstrated the utility of improved genome assemblies for enhancing the resolution of "QTL-hotspot" identification for drought tolerance in chickpea and marker-trait associations for agronomic traits in pigeonpea through genome-wide association study. Genomic resources developed in this study are publicly available through an online repository, 'Legumepedia'. CONCLUSION: This study reports chromosome-length genome assemblies of six legume species and demonstrates the utility of these assemblies in crop improvement. The genomic resources developed here will have significant role in accelerating genetic improvement applications of legume crops.


Assuntos
Cicer , Fabaceae , Humanos , Fabaceae/genética , Mapeamento Cromossômico , Genoma de Planta , Estudo de Associação Genômica Ampla , Melhoramento Vegetal , Cicer/genética , Produtos Agrícolas/genética , Glycine max/genética , Cromossomos
17.
Nat Genet ; 51(12): 1664-1669, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31784727

RESUMO

Enhancer elements in the human genome control how genes are expressed in specific cell types and harbor thousands of genetic variants that influence risk for common diseases1-4. Yet, we still do not know how enhancers regulate specific genes, and we lack general rules to predict enhancer-gene connections across cell types5,6. We developed an experimental approach, CRISPRi-FlowFISH, to perturb enhancers in the genome, and we applied it to test >3,500 potential enhancer-gene connections for 30 genes. We found that a simple activity-by-contact model substantially outperformed previous methods at predicting the complex connections in our CRISPR dataset. This activity-by-contact model allows us to construct genome-wide maps of enhancer-gene connections in a given cell type, on the basis of chromatin state measurements. Together, CRISPRi-FlowFISH and the activity-by-contact model provide a systematic approach to map and predict which enhancers regulate which genes, and will help to interpret the functions of the thousands of disease risk variants in the noncoding genome.


Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Elementos Facilitadores Genéticos , Regiões Promotoras Genéticas , Animais , Fator de Transcrição GATA1/genética , Regulação da Expressão Gênica , Desacetilase 6 de Histona/genética , Humanos , Hibridização in Situ Fluorescente , Células K562 , Camundongos , Modelos Genéticos , RNA Guia de Cinetoplastídeos
18.
Cell Syst ; 6(2): 256-258.e1, 2018 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-29428417

RESUMO

Contact mapping experiments such as Hi-C explore how genomes fold in 3D. Here, we introduce Juicebox.js, a cloud-based web application for exploring the resulting datasets. Like the original Juicebox application, Juicebox.js allows users to zoom in and out of such datasets using an interface similar to Google Earth. Juicebox.js also has many features designed to facilitate data reproducibility and sharing. Furthermore, Juicebox.js encodes the exact state of the browser in a shareable URL. Creating a public browser for a new Hi-C dataset does not require coding and can be accomplished in under a minute. The web app also makes it possible to create interactive figures online that can complement or replace ordinary journal figures. When combined with Juicer, this makes the entire process of data analysis transparent, insofar as every step from raw reads to published figure is publicly available as open source code.


Assuntos
Biologia Computacional/métodos , Processamento de Imagem Assistida por Computador/métodos , Computação em Nuvem , Gráficos por Computador , Computadores , Análise de Dados , Genoma/genética , Internet , Reprodutibilidade dos Testes , Software
19.
Nat Genet ; 50(8): 1140-1150, 2018 08.
Artigo em Inglês | MEDLINE | ID: mdl-29988122

RESUMO

Over 90% of genetic variants associated with complex human traits map to non-coding regions, but little is understood about how they modulate gene regulation in health and disease. One possible mechanism is that genetic variants affect the activity of one or more cis-regulatory elements leading to gene expression variation in specific cell types. To identify such cases, we analyzed ATAC-seq and RNA-seq profiles from stimulated primary CD4+ T cells in up to 105 healthy donors. We found that regions of accessible chromatin (ATAC-peaks) are co-accessible at kilobase and megabase resolution, consistent with the three-dimensional chromatin organization measured by in situ Hi-C in T cells. Fifteen percent of genetic variants located within ATAC-peaks affected the accessibility of the corresponding peak (local-ATAC-QTLs). Local-ATAC-QTLs have the largest effects on co-accessible peaks, are associated with gene expression and are enriched for autoimmune disease variants. Our results provide insights into how natural genetic variants modulate cis-regulatory elements, in isolation or in concert, to influence gene expression.


Assuntos
Linfócitos T CD4-Positivos/fisiologia , Cromatina/genética , Polimorfismo de Nucleotídeo Único , Adulto , Doenças Autoimunes/genética , Feminino , Regulação da Expressão Gênica , Genótipo , Humanos , Masculino , Sequências Reguladoras de Ácido Nucleico
20.
Science ; 356(6333): 92-95, 2017 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-28336562

RESUMO

The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Aeaegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species.


Assuntos
Aedes/genética , Mapeamento de Sequências Contíguas/métodos , Genoma de Inseto , Animais , Sequência Conservada , Culex/genética , Rearranjo Gênico , Humanos , Conformação de Ácido Nucleico
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa