RESUMO
Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases.
Assuntos
Cromatina/química , Genoma Humano , Proteínas Repressoras/metabolismo , Transcrição Gênica , Animais , Fator de Ligação a CCCTC , Proteínas de Ciclo Celular/metabolismo , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Cromossomos/metabolismo , Empacotamento do DNA , Humanos , RNA Polimerase II/metabolismo , Salamandridae , CoesinasRESUMO
A key finding of the ENCODE project is that the enhancer landscape of mammalian cells undergoes marked alterations during ontogeny. However, the nature and extent of these changes are unclear. As part of the NIH Mouse Regulome Project, we here combined DNaseI hypersensitivity, ChIP-seq, and ChIA-PET technologies to map the promoter-enhancer interactomes of pluripotent ES cells and differentiated B lymphocytes. We confirm that enhancer usage varies widely across tissues. Unexpectedly, we find that this feature extends to broadly transcribed genes, including Myc and Pim1 cell-cycle regulators, which associate with an entirely different set of enhancers in ES and B cells. By means of high-resolution CpG methylomes, genome editing, and digital footprinting, we show that these enhancers recruit lineage-determining factors. Furthermore, we demonstrate that the turning on and off of enhancers during development correlates with promoter activity. We propose that organisms rely on a dynamic enhancer landscape to control basic cellular functions in a tissue-specific manner.
Assuntos
Linfócitos B/metabolismo , Células-Tronco Embrionárias/metabolismo , Elementos Facilitadores Genéticos , Regulação da Expressão Gênica no Desenvolvimento , Regiões Promotoras Genéticas , Regulon , Animais , Linhagem da Célula , Células Cultivadas , Ilhas de CpG , Metilação de DNA , Técnicas Genéticas , Camundongos , Especificidade de Órgãos , RNA Longo não Codificante/genética , Fatores de Transcrição/metabolismo , Transcrição GênicaRESUMO
Higher-order chromosomal organization for transcription regulation is poorly understood in eukaryotes. Using genome-wide Chromatin Interaction Analysis with Paired-End-Tag sequencing (ChIA-PET), we mapped long-range chromatin interactions associated with RNA polymerase II in human cells and uncovered widespread promoter-centered intragenic, extragenic, and intergenic interactions. These interactions further aggregated into higher-order clusters, wherein proximal and distal genes were engaged through promoter-promoter interactions. Most genes with promoter-promoter interactions were active and transcribed cooperatively, and some interacting promoters could influence each other implying combinatorial complexity of transcriptional controls. Comparative analyses of different cell lines showed that cell-specific chromatin interactions could provide structural frameworks for cell-specific transcription, and suggested significant enrichment of enhancer-promoter interactions for cell-specific functions. Furthermore, genetically-identified disease-associated noncoding elements were found to be spatially engaged with corresponding genes through long-range interactions. Overall, our study provides insights into transcription regulation by three-dimensional chromatin interactions for both housekeeping and cell-specific genes in human cells.
Assuntos
Cromatina/metabolismo , Regulação da Expressão Gênica , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , Transcrição Gênica , Linhagem Celular Tumoral , Imunoprecipitação da Cromatina , Elementos Facilitadores Genéticos , Estudo de Associação Genômica Ampla , HumanosRESUMO
The genomes of multicellular organisms are extensively folded into 3D chromosome territories within the nucleus1. Advanced 3D genome-mapping methods that combine proximity ligation and high-throughput sequencing (such as chromosome conformation capture, Hi-C)2, and chromatin immunoprecipitation techniques (such as chromatin interaction analysis by paired-end tag sequencing, ChIA-PET)3, have revealed topologically associating domains4 with frequent chromatin contacts, and have identified chromatin loops mediated by specific protein factors for insulation and regulation of transcription5-7. However, these methods rely on pairwise proximity ligation and reflect population-level views, and thus cannot reveal the detailed nature of chromatin interactions. Although single-cell Hi-C8 potentially overcomes this issue, this method may be limited by the sparsity of data that is inherent to current single-cell assays. Recent advances in microfluidics have opened opportunities for droplet-based genomic analysis9 but this approach has not yet been adapted for chromatin interaction analysis. Here we describe a strategy for multiplex chromatin-interaction analysis via droplet-based and barcode-linked sequencing, which we name ChIA-Drop. We demonstrate the robustness of ChIA-Drop in capturing complex chromatin interactions with single-molecule precision, which has not been possible using methods based on population-level pairwise contacts. By applying ChIA-Drop to Drosophila cells, we show that chromatin topological structures predominantly consist of multiplex chromatin interactions with high heterogeneity; ChIA-Drop also reveals promoter-centred multivalent interactions, which provide topological insights into transcription.
Assuntos
Cromatina/genética , Cromatina/metabolismo , Microfluídica/métodos , Análise de Sequência de DNA/métodos , Imagem Individual de Molécula/métodos , Imagem Individual de Molécula/normas , Animais , Sítios de Ligação/genética , Linhagem Celular , Cromatina/química , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Microfluídica/normas , Conformação de Ácido Nucleico , Regiões Promotoras Genéticas/genética , Ligação Proteica , RNA Polimerase II/química , RNA Polimerase II/metabolismo , Transcrição GênicaRESUMO
Histone H3 lysine-9 di-methylation (H3K9me2) and lysine-27 tri-methylation (H3K27me3) are linked to repression of gene expression, but the functions of repressive histone methylation dynamics during inflammatory responses remain enigmatic. Here, we report that lysine demethylases 7A (KDM7A) and 6A (UTX) play crucial roles in tumor necrosis factor (TNF)-α signaling in endothelial cells (ECs), where they are regulated by a novel TNF-α-responsive microRNA, miR-3679-5p. TNF-α rapidly induces co-occupancy of KDM7A and UTX at nuclear factor kappa-B (NF-κB)-associated elements in human ECs. KDM7A and UTX demethylate H3K9me2 and H3K27me3, respectively, and are both required for activation of NF-κB-dependent inflammatory genes. Chromosome conformation capture-based methods furthermore uncover increased interactions between TNF-α-induced super enhancers at NF-κB-relevant loci, coinciding with KDM7A and UTX recruitments. Simultaneous pharmacological inhibition of KDM7A and UTX significantly reduces leukocyte adhesion in mice, establishing the biological and potential translational relevance of this mechanism. Collectively, these findings suggest that rapid erasure of repressive histone marks by KDM7A and UTX is essential for NF-κB-dependent regulation of genes that control inflammatory responses of ECs.
Assuntos
Células Endoteliais/imunologia , Histona Desmetilases/metabolismo , Histonas/metabolismo , Histona Desmetilases com o Domínio Jumonji/metabolismo , MicroRNAs/genética , Animais , Adesão Celular , Células Endoteliais/citologia , Células Endoteliais/metabolismo , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Histonas/química , Células Endoteliais da Veia Umbilical Humana , Humanos , Lisina/metabolismo , Masculino , Metilação , Camundongos , Transdução de Sinais , Fator de Necrose Tumoral alfa/metabolismoRESUMO
The aberrant activities of transcription factors such as the androgen receptor (AR) underpin prostate cancer development. While the AR cis-regulation has been extensively studied in prostate cancer, information pertaining to the spatial architecture of the AR transcriptional circuitry remains limited. In this paper, we propose a novel framework to profile long-range chromatin interactions associated with AR and its collaborative transcription factor, erythroblast transformation-specific related gene (ERG), using chromatin interaction analysis by paired-end tag (ChIA-PET). We identified ERG-associated long-range chromatin interactions as a cooperative component in the AR-associated chromatin interactome, acting in concert to achieve coordinated regulation of a subset of AR target genes. Through multifaceted functional data analysis, we found that AR-ERG interaction hub regions are characterized by distinct functional signatures, including bidirectional transcription and cotranscription factor binding. In addition, cancer-associated long noncoding RNAs were found to be connected near protein-coding genes through AR-ERG looping. Finally, we found strong enrichment of prostate cancer genome-wide association study (GWAS) single nucleotide polymorphisms (SNPs) at AR-ERG co-binding sites participating in chromatin interactions and gene regulation, suggesting GWAS target genes identified from chromatin looping data provide more biologically relevant findings than using the nearest gene approach. Taken together, our results revealed an AR-ERG-centric higher-order chromatin structure that drives coordinated gene expression in prostate cancer progression and the identification of potential target genes for therapeutic intervention.
Assuntos
Cromatina/metabolismo , Regulação Neoplásica da Expressão Gênica , Neoplasias da Próstata/genética , Receptores Androgênicos/metabolismo , Transcrição Gênica , Linhagem Celular Tumoral , Cromatina/química , Redes Reguladoras de Genes , Genoma Humano , Humanos , Masculino , Proteínas de Fusão Oncogênica/análise , Polimorfismo de Nucleotídeo Único , Neoplasias da Próstata/metabolismo , RNA Longo não Codificante/metabolismo , Regulador Transcricional ERG/metabolismo , Regulador Transcricional ERG/fisiologiaRESUMO
Chromosomal structural variations play an important role in determining the transcriptional landscape of human breast cancers. To assess the nature of these structural variations, we analyzed eight breast tumor samples with a focus on regions of gene amplification using mate-pair sequencing of long-insert genomic DNA with matched transcriptome profiling. We found that tandem duplications appear to be early events in tumor evolution, especially in the genesis of amplicons. In a detailed reconstruction of events on chromosome 17, we found large unpaired inversions and deletions connect a tandemly duplicated ERBB2 with neighboring 17q21.3 amplicons while simultaneously deleting the intervening BRCA1 tumor suppressor locus. This series of events appeared to be unusually common when examined in larger genomic data sets of breast cancers albeit using approaches with lesser resolution. Using siRNAs in breast cancer cell lines, we showed that the 17q21.3 amplicon harbored a significant number of weak oncogenes that appeared consistently coamplified in primary tumors. Down-regulation of BRCA1 expression augmented the cell proliferation in ERBB2-transfected human normal mammary epithelial cells. Coamplification of other functionally tested oncogenic elements in other breast tumors examined, such as RIPK2 and MYC on chromosome 8, also parallel these findings. Our analyses suggest that structural variations efficiently orchestrate the gain and loss of cancer gene cassettes that engage many oncogenic pathways simultaneously and that such oncogenic cassettes are favored during the evolution of a cancer.
Assuntos
Proteína BRCA1/genética , Neoplasias da Mama/genética , Aberrações Cromossômicas , Cromossomos Humanos Par 17/genética , Receptor ErbB-2/genética , Sequência de Bases , Linhagem Celular Tumoral , Feminino , Amplificação de Genes , Duplicação Gênica , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Células MCF-7 , Dados de Sequência Molecular , Análise de Sequência de DNARESUMO
Tumour necrosis factor alpha (TNFα) is a potent cytokine that signals through nuclear factor kappa B (NFκB) to activate a subset of human genes. It is usually assumed that this involves RNA polymerases transcribing responsive genes wherever they might be in the nucleus. Using primary human endothelial cells, variants of chromosome conformation capture (including 4C and chromatin interaction analysis with paired-end tag sequencing), and fluorescence in situ hybridization to detect single nascent transcripts, we show that TNFα induces responsive genes to congregate in discrete 'NFκB factories'. Some factories further specialize in transcribing responsive genes encoding micro-RNAs that target downregulated mRNAs. We expect all signalling pathways to contain this extra leg, where responding genes are transcribed in analogous specialized factories.
Assuntos
Regulação da Expressão Gênica , MicroRNAs/metabolismo , Fator de Necrose Tumoral alfa/metabolismo , Cromossomos/ultraestrutura , Citocinas/biossíntese , Citoplasma/metabolismo , RNA Polimerases Dirigidas por DNA/metabolismo , Células Endoteliais/citologia , Humanos , Hibridização In Situ , Hibridização in Situ Fluorescente , N-Acetilglucosaminiltransferases/metabolismo , NF-kappa B/metabolismo , Conformação Proteica , Proteínas Repressoras/metabolismo , Transdução de Sinais , Proteínas Smad/metabolismo , Fatores de Tempo , Transcrição Gênica , Fator de Crescimento Transformador beta/metabolismoRESUMO
The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.
Assuntos
Bases de Dados Genéticas , RNA Longo não Codificante/genética , Processamento Alternativo , Animais , Núcleo Celular/genética , Núcleo Celular/metabolismo , Análise por Conglomerados , Evolução Molecular , Éxons , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Histonas/metabolismo , Humanos , Anotação de Sequência Molecular , Fases de Leitura Aberta , Especificidade de Órgãos/genética , Primatas/genética , Processamento Pós-Transcricional do RNA , Sítios de Splice de RNA , RNA Mensageiro/genética , Seleção Genética , Transcrição GênicaRESUMO
Genomes are organized into high-level three-dimensional structures, and DNA elements separated by long genomic distances can in principle interact functionally. Many transcription factors bind to regulatory DNA elements distant from gene promoters. Although distal binding sites have been shown to regulate transcription by long-range chromatin interactions at a few loci, chromatin interactions and their impact on transcription regulation have not been investigated in a genome-wide manner. Here we describe the development of a new strategy, chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) for the de novo detection of global chromatin interactions, with which we have comprehensively mapped the chromatin interaction network bound by oestrogen receptor alpha (ER-alpha) in the human genome. We found that most high-confidence remote ER-alpha-binding sites are anchored at gene promoters through long-range chromatin interactions, suggesting that ER-alpha functions by extensive chromatin looping to bring genes together for coordinated transcriptional regulation. We propose that chromatin interactions constitute a primary mechanism for regulating transcription in mammalian genomes.
Assuntos
Cromatina/genética , Cromatina/metabolismo , Receptor alfa de Estrogênio/metabolismo , Genoma Humano/genética , Sítios de Ligação , Linhagem Celular , Imunoprecipitação da Cromatina , Reagentes de Ligações Cruzadas , Formaldeído , Humanos , Regiões Promotoras Genéticas/genética , Ligação Proteica , Reprodutibilidade dos Testes , Análise de Sequência de DNA , Transcrição Gênica , Ativação TranscricionalRESUMO
BACKGROUND: Because few studies exist to describe the unique molecular network regulation behind pig pre-implantation embryonic development (PED), genetic engineering in the pig embryo is limited. Also, this lack of research has hindered derivation and application of porcine embryonic stem cells and porcine induced pluripotent stem cells (iPSCs). RESULTS: We identified and analyzed the genome wide transcriptomes of pig in vivo-derived and somatic cell nuclear transferred (SCNT) as well as mouse in vivo-derived pre-implantation embryos at different stages using mRNA deep sequencing. Comparison of the pig embryonic transcriptomes with those of mouse and human pre-implantation embryos revealed unique gene expression patterns during pig PED. Pig zygotic genome activation was confirmed to occur at the 4-cell stage via genome-wide gene expression analysis. This activation was delayed to the 8-cell stage in SCNT embryos. Specific gene expression analysis of the putative inner cell mass (ICM) and the trophectoderm (TE) revealed that pig and mouse pre-implantation embryos share regulatory networks during the first lineage segregation and primitive endoderm differentiation, but not during ectoderm commitment. Also, fatty acid metabolism appears to be a unique characteristic of pig pre-implantation embryonic development. In addition, the global gene expression patterns in the pig SCNT embryos were different from those in in vivo-derived pig embryos. CONCLUSIONS: Our results provide a resource for pluripotent stem cell engineering and for understanding pig development.
Assuntos
Blastocisto/metabolismo , Embrião de Mamíferos/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Animais , Biomarcadores/metabolismo , Ectoderma/metabolismo , Embrião de Mamíferos/citologia , Desenvolvimento Embrionário , Ácidos Graxos/metabolismo , Feminino , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Engenharia Genética , Ligação Genética , Genoma , Camundongos , Camundongos Endogâmicos C57BL , Células-Tronco Pluripotentes/citologia , Células-Tronco Pluripotentes/metabolismo , RNA Mensageiro/química , RNA Mensageiro/metabolismo , Análise de Sequência de RNA , Suínos , TranscriptomaRESUMO
Using a long-span, paired-end deep sequencing strategy, we have comprehensively identified cancer genome rearrangements in eight breast cancer genomes. Herein, we show that 40%-54% of these structural genomic rearrangements result in different forms of fusion transcripts and that 44% are potentially translated. We find that single segmental tandem duplication spanning several genes is a major source of the fusion gene transcripts in both cell lines and primary tumors involving adjacent genes placed in the reverse-order position by the duplication event. Certain other structural mutations, however, tend to attenuate gene expression. From these candidate gene fusions, we have found a fusion transcript (RPS6KB1-VMP1) recurrently expressed in â¼30% of breast cancers associated with potential clinical consequences. This gene fusion is caused by tandem duplication on 17q23 and appears to be an indicator of local genomic instability altering the expression of oncogenic components such as MIR21 and RPS6KB1.
Assuntos
Neoplasias da Mama/metabolismo , Rearranjo Gênico , Genoma Humano/genética , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Proteínas Recombinantes de Fusão/metabolismo , Proteínas Quinases S6 Ribossômicas/metabolismo , Transcrição Gênica , Neoplasias da Mama/genética , Linhagem Celular Tumoral , Mapeamento Cromossômico , Cromossomos Humanos Par 17/genética , Feminino , Dosagem de Genes , Perfilação da Expressão Gênica , Instabilidade Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Proteínas Recombinantes de Fusão/genética , Proteínas Quinases S6 Ribossômicas/genética , Análise de Sequência de DNARESUMO
Somatic genome rearrangements are thought to play important roles in cancer development. We optimized a long-span paired-end-tag (PET) sequencing approach using 10-Kb genomic DNA inserts to study human genome structural variations (SVs). The use of a 10-Kb insert size allows the identification of breakpoints within repetitive or homology-containing regions of a few kilobases in size and results in a higher physical coverage compared with small insert libraries with the same sequencing effort. We have applied this approach to comprehensively characterize the SVs of 15 cancer and two noncancer genomes and used a filtering approach to strongly enrich for somatic SVs in the cancer genomes. Our analyses revealed that most inversions, deletions, and insertions are germ-line SVs, whereas tandem duplications, unpaired inversions, interchromosomal translocations, and complex rearrangements are over-represented among somatic rearrangements in cancer genomes. We demonstrate that the quantitative and connective nature of DNA-PET data is precise in delineating the genealogy of complex rearrangement events, we observe signatures that are compatible with breakage-fusion-bridge cycles, and we discover that large duplications are among the initial rearrangements that trigger genome instability for extensive amplification in epithelial cancers.
Assuntos
Pareamento de Bases/genética , Neoplasias da Mama/genética , Mapeamento Cromossômico/métodos , Genoma Humano/genética , Variação Estrutural do Genoma/genética , Neoplasias Gástricas/genética , Linhagem Celular Tumoral , Biologia Computacional , DNA/genética , Feminino , Rearranjo Gênico , Humanos , Análise de Sequência de DNARESUMO
Long-range chromatin contacts between specific DNA regulatory elements play a pivotal role in gene expression regulation, and a global characterization of these interactions in the 3-dimensional (3D) chromatin structure is imperative in understanding signaling networks and cell states. Chromatin Interaction Analysis using Paired-End Tag sequencing (ChIA-PET) is a method which converts functional chromatin structure into millions of short tag sequences. Combining Chromatin Immunoprecipitation (ChIP), proximity ligation and high-throughput sequencing, ChIA-PET provides a global and unbiased interrogation of higher-order chromatin structures associated with specific protein factors. Here, we describe the detailed procedures of the ChIA-PET methodology, unraveling transcription-associated chromatin contacts in a model human cell line.
Assuntos
Cromatina/genética , Análise de Sequência de DNA , Sequência de Bases , Linhagem Celular , Cromatina/ultraestrutura , Imunoprecipitação da Cromatina , Mapeamento Cromossômico , Reagentes de Ligações Cruzadas/química , DNA/química , DNA/genética , DNA/isolamento & purificação , Epistasia Genética , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , Sonicação , Succinimidas/químicaRESUMO
MicroRNAs (miRNAs) are a class of small, noncoding RNAs that function as posttranscriptional regulators of gene expression. Many miRNAs are expressed in the developing brain and regulate multiple aspects of neural development, including neurogenesis, dendritogenesis, and synapse formation. Rett syndrome (RTT) is a progressive neurodevelopmental disorder caused by mutations in the gene encoding methyl-CpG-binding protein 2 (MECP2). Although Mecp2 is known to act as a global transcriptional regulator, miRNAs that are directly regulated by Mecp2 in the brain are not known. Using massively parallel sequencing methods, we have identified miRNAs whose expression is altered in cerebella of Mecp2-null mice before and after the onset of severe neurological symptoms. In vivo genome-wide analyses indicate that promoter regions of a significant fraction of dysregulated miRNA transcripts, including a large polycistronic cluster of brain-specific miRNAs, are DNA-methylated and are bound directly by Mecp2. Functional analysis demonstrates that the 3' UTR of messenger RNA encoding Brain-derived neurotrophic factor (Bdnf) can be targeted by multiple miRNAs aberrantly up-regulated in the absence of Mecp2. Taken together, these results suggest that dysregulation of miRNAs may contribute to RTT pathoetiology and also may provide a valuable resource for further investigations of the role of miRNAs in RTT.
Assuntos
Modelos Animais de Doenças , Estudo de Associação Genômica Ampla , Proteína 2 de Ligação a Metil-CpG/fisiologia , MicroRNAs/genética , Síndrome de Rett/genética , Regiões 3' não Traduzidas , Animais , Imunoprecipitação da Cromatina , Ensaio de Imunoadsorção Enzimática , Proteína 2 de Ligação a Metil-CpG/genética , Camundongos , Camundongos Knockout , Regiões Promotoras Genéticas , Síndrome de Rett/metabolismoRESUMO
Chromatin Interaction Analysis Using Paired-End Tag Sequencing (ChIA-PET) is an established method to map protein-mediated chromatin interactions. A limitation, however, is that it requires a hundred million cells per experiment, which hampers its broad application in biomedical research, particularly in studies in which it is impractical to obtain a large number of cells from rare samples. To reduce the required input cell number while retaining high data quality, we developed an in situ ChIA-PET protocol, which requires as few as 1 million cells. Here, we describe detailed step-by-step procedures for performing in situ ChIA-PET from cultured cells, including both an experimental protocol for sample preparation and data generation and a computational protocol for data processing and visualization using the ChIA-PIPE pipeline. As the protocol significantly simplifies the experimental procedure, reduces ligation noise, and decreases the required input of cells compared to previous versions of ChIA-PET protocols, it can be applied to generate high-resolution chromatin contact maps mediated by various protein factors for a wide range of human and mouse primary cells. © 2021 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Sample preparation and data generation Support Protocol: Bridge linker preparation Basic Protocol 2: Data processing and visualization.
Assuntos
Cromatina , Técnicas Genéticas , Animais , Linhagem Celular , Interpretação Estatística de Dados , Humanos , Camundongos , Análise de Sequência de DNARESUMO
OBJECTIVES: The availability of matched sequencing data for the same sample across different sequencing platforms is a necessity for validation and effective comparison of sequencing platforms. A commonly sequenced sample is the lab-adapted MG1655 strain of Escherichia coli; however, this strain is not fully representative of more complex and dynamic genomes of pathogenic E. coli strains. DATA DESCRIPTION: We present six new sequencing data sets for another E. coli strain, UTI89, which is an extraintestinal pathogenic strain isolated from a patient suffering from a urinary tract infection. We now provide matched whole genome sequencing data generated using the PacBio RSII, Oxford Nanopore MinION R9.4, Ion Torrent, ABI SOLiD, and Illumina NextSeq sequencers. Together with other publically available datasets, UTI89 has a nearly complete suite of data generated on most second- and third-generation sequencers. These data can be used as an additional validation set for new sequencing technologies and analytical methods. More than being another E. coli strain, however, UTI89 is pathogenic, with a 10% larger genome, additional pathogenicity islands, and a large plasmid, features that are common among other naturally occurring and disease-causing E. coli isolates. These data therefore provide a more medically relevant test set for development of algorithms.
Assuntos
Infecções por Escherichia coli , Proteínas de Escherichia coli , Infecções Urinárias , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Sequenciamento de Nucleotídeos em Larga Escala , HumanosRESUMO
BACKGROUND: Acute promyeloid leukemia (APL) is characterized by the oncogenic fusion protein PML-RARα, a major etiological agent in APL. However, the molecular mechanisms underlying the role of PML-RARα in leukemogenesis remain largely unknown. RESULTS: Using an inducible system, we comprehensively analyze the 3D genome organization in myeloid cells and its reorganization after PML-RARα induction and perform additional analyses in patient-derived APL cells with native PML-RARα. We discover that PML-RARα mediates extensive chromatin interactions genome-wide. Globally, it redefines the chromatin topology of the myeloid genome toward a more condensed configuration in APL cells; locally, it intrudes RNAPII-associated interaction domains, interrupts myeloid-specific transcription factors binding at enhancers and super-enhancers, and leads to transcriptional repression of genes critical for myeloid differentiation and maturation. CONCLUSIONS: Our results not only provide novel topological insights for the roles of PML-RARα in transforming myeloid cells into leukemia cells, but further uncover a topological framework of a molecular mechanism for oncogenic fusion proteins in cancers.
Assuntos
Montagem e Desmontagem da Cromatina , Regulação Neoplásica da Expressão Gênica , Leucemia Promielocítica Aguda/metabolismo , Proteínas de Fusão Oncogênica/metabolismo , Linhagem Celular Tumoral , Humanos , Leucemia Promielocítica Aguda/etiologiaRESUMO
Isogenic laboratory mouse strains enhance reproducibility because individual animals are genetically identical. For the most widely used isogenic strain, C57BL/6, there exists a wealth of genetic, phenotypic, and genomic data, including a high-quality reference genome (GRCm38.p6). Now 20 years after the first release of the mouse reference genome, C57BL/6J mice are at least 26 inbreeding generations removed from GRCm38 and the strain is now maintained with periodic reintroduction of cryorecovered mice derived from a single breeder pair, aptly named Adam and Eve. To provide an update to the mouse reference genome that more accurately represents the genome of today's C57BL/6J mice, we took advantage of long read, short read, and optical mapping technologies to generate a de novo assembly of the C57BL/6J Eve genome (B6Eve). Using these data, we have addressed recurring variants observed in previous mouse genomic studies. We have also identified structural variations, closed gaps in the mouse reference assembly, and revealed previously unannotated coding sequences. This B6Eve assembly explains discrepant observations that have been associated with GRCm38-based analyses, and will inform a reference genome that is more representative of the C57BL/6J mice that are in use today.
Assuntos
Genoma , Genômica , Animais , Biologia Computacional/métodos , Feminino , Genômica/métodos , Endogamia , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Linhagem , Fenótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Proper gene expression involves communication between the regulatory elements and promoters of genes. Because regulatory elements can be located over a large range of genomic distances (from as close as a few hundred bp to as much as several Mb away), contact and communication between regulators and the core transcriptional machinery at promoters are mediated through DNA looping. Today, chromosome conformation capture (3C)-based methods efficiently probe chromosome folding in the nucleus and thus provide a molecular description of physical proximity between enhancer(s) and their target promoter(s). One such method, chromatin interaction analysis using paired-end-tag (ChIA-PET) sequencing, is a leading high-throughput method for detection of genome wide chromatin interactions. Briefly, the method involves cross-linkage of chromatin (-DNA) fibers in cells in situ, fragmentation of the fixed chromatin-DNA complexes by sonication, followed by enrichment of the chromatin complexes with a dedicated antibody through the process of immunoprecipitation (IP). Next, application of the ChIA-PET protocol followed by deep sequencing and mapping of reads to the reference genome reveals both binding sites and remote chromatin interactions mediated by the protein factors of interest. The method detailed here focuses on ChIP sample preparation and can be completed in â¼5 d. The ChIA-PET method is detailed in an associated protocol. Because not all chromatin immunoprecipitation protocols are suitable for ChIA-PET, it is important to strictly follow this procedure before performing the ChIA-PET protocol.