Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
1.
Genome Res ; 32(4): 682-698, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35354608

RESUMO

The DNA in many organisms, including humans, is shown to be organized in topologically associating domains (TADs). In Drosophila, several architectural proteins are enriched at TAD borders, but it is still unclear whether these proteins play a functional role in the formation and maintenance of TADs. Here, we show that depletion of BEAF-32, Cp190, Chro, and Dref leads to changes in TAD organization and chromatin loops. Their depletion predominantly affects TAD borders located in regions moderately enriched in repressive modifications and depleted in active ones, whereas TAD borders located in euchromatin are resilient to these knockdowns. Furthermore, transcriptomic data has revealed hundreds of genes displaying differential expression in these knockdowns and showed that the majority of differentially expressed genes are located within reorganized TADs. Our work identifies a novel and functional role for architectural proteins at TAD borders in Drosophila and a link between TAD reorganization and subsequent changes in gene expression.


Assuntos
Cromatina , Proteínas de Drosophila , Animais , Cromatina/genética , Cromossomos/metabolismo , Proteínas de Ligação a DNA/genética , Drosophila/genética , Drosophila/metabolismo , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Proteínas do Olho/genética , Proteínas Associadas aos Microtúbulos/genética , Proteínas Nucleares/genética , Fatores de Transcrição/metabolismo
2.
Nucleic Acids Res ; 51(16): 8480-8495, 2023 09 08.
Artigo em Inglês | MEDLINE | ID: mdl-37486787

RESUMO

Transcription factors (TFs) are proteins that affect gene expression by binding to regulatory regions of DNA in a sequence specific manner. The binding of TFs to DNA is controlled by many factors, including the DNA sequence, concentration of TF, chromatin accessibility and co-factors. Here, we systematically investigated the binding mechanism of hundreds of TFs by analysing ChIP-seq data with our explainable statistical model, ChIPanalyser. This tool uses as inputs the DNA sequence binding motif; the capacity to distinguish between strong and weak binding sites; the concentration of TF; and chromatin accessibility. We found that approximately one third of TFs are predicted to bind the genome in a DNA accessibility independent fashion, which includes TFs that can open the chromatin, their co-factors and TFs with similar motifs. Our model predicted this to be the case when the TF binds to its strongest binding regions in the genome, and only a small number of TFs have the capacity to bind dense chromatin at their weakest binding regions, such as CTCF, USF2 and CEBPB. Our study demonstrated that the binding of hundreds of human and mouse TFs is predicted by ChIPanalyser with high accuracy and showed that many TFs can bind dense chromatin.


Assuntos
Cromatina , Fatores de Transcrição , Humanos , Animais , Camundongos , Cromatina/genética , Fatores de Transcrição/metabolismo , Cromossomos/metabolismo , DNA/química , Sítios de Ligação/genética , Ligação Proteica , Mamíferos/genética
3.
Nucleic Acids Res ; 50(6): e33, 2022 04 08.
Artigo em Inglês | MEDLINE | ID: mdl-34908129

RESUMO

Despite a vast expansion in the availability of epigenomic data, our knowledge of the chromatin landscape at interspersed repeats remains highly limited by difficulties in mapping short-read sequencing data to these regions. In particular, little is known about the locus-specific regulation of evolutionarily young transposable elements (TEs), which have been implicated in genome stability, gene regulation and innate immunity in a variety of developmental and disease contexts. Here we propose an approach for generating locus-specific protein-DNA binding profiles at interspersed repeats, which leverages information on the spatial proximity between repetitive and non-repetitive genomic regions. We demonstrate that the combination of HiChIP and a newly developed mapping tool (PAtChER) yields accurate protein enrichment profiles at individual repetitive loci. Using this approach, we reveal previously unappreciated variation in the epigenetic profiles of young TE loci in mouse and human cells. Insights gained using our method will be invaluable for dissecting the molecular determinants of TE regulation and their impact on the genome.


Assuntos
Cromatina , Elementos de DNA Transponíveis , Animais , Cromatina/genética , Elementos de DNA Transponíveis/genética , Regulação da Expressão Gênica , Genômica , Humanos , Camundongos
4.
Bioinformatics ; 38(14): 3523-3531, 2022 07 11.
Artigo em Inglês | MEDLINE | ID: mdl-35678507

RESUMO

MOTIVATION: Several computational and statistical methods have been developed to analyze data generated through the 3C-based methods, especially the Hi-C. Most of the existing methods do not account for dependency in Hi-C data. RESULTS: Here, we present ZipHiC, a novel statistical method to explore Hi-C data focusing on the detection of enriched contacts. ZipHiC implements a Bayesian method based on a hidden Markov random field (HMRF) model and the Approximate Bayesian Computation (ABC) to detect interactions in two-dimensional space based on a Hi-C contact frequency matrix. ZipHiC uses data on the sources of biases related to the contact frequency matrix, allows borrowing information from neighbours using the Potts model and improves computation speed using the ABC model. In addition to outperforming existing tools on both simulated and real data, our model also provides insights into different sources of biases that affects Hi-C data. We show that some datasets display higher biases from DNA accessibility or Transposable Elements content. Furthermore, our analysis in Drosophila melanogaster showed that approximately half of the detected significant interactions connect promoters with other parts of the genome indicating a functional biological role. Finally, we found that the micro-C datasets display higher biases from DNA accessibility compared to a similar Hi-C experiment, but this can be corrected by ZipHiC. AVAILABILITY AND IMPLEMENTATION: The R scripts are available at https://github.com/igosungithub/HMRFHiC.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Drosophila melanogaster , Software , Animais , Teorema de Bayes , Viés , DNA , Cromatina
5.
Genome Res ; 29(4): 613-625, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30709849

RESUMO

The organization of the genome into topologically associating domains (TADs) was shown to have a regulatory role in development and cellular function, but the mechanism involved in TAD establishment is still unclear. Here, we present the first high-resolution contact map of Drosophila neuronal cells (BG3) and identify different classes of TADs by comparing this to genome organization in embryonic cells (Kc167). We find that only some TADs are conserved in both cell lines, whereas the rest are cell-type-specific. This is supported by a change in the enrichment of architectural proteins at TAD borders, with BEAF-32 present in embryonic cells and CTCF in neuronal cells. Furthermore, we observe strong divergent transcription, together with RNA Polymerase II occupancy and an increase in DNA accessibility at the TAD borders. TAD borders that are specific to neuronal cells are enriched in enhancers controlled by neuronal-specific transcription factors. Our results suggest that TADs are dynamic across developmental stages and reflect the interplay between insulators, transcriptional states, and enhancer activities.


Assuntos
Montagem e Desmontagem da Cromatina , Cromatina/genética , Regulação da Expressão Gênica no Desenvolvimento , Neurogênese , Animais , Fator de Ligação a CCCTC/metabolismo , Linhagem Celular , Cromatina/química , Proteínas de Ligação a DNA/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila melanogaster , Proteínas do Olho/metabolismo , Genoma de Inseto , Neurônios/citologia , Neurônios/metabolismo , Ligação Proteica , RNA Polimerase II/metabolismo
6.
EMBO J ; 36(5): 617-628, 2017 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-28069706

RESUMO

Transgenerationally heritable epialleles are defined by the stable propagation of alternative transcriptional states through mitotic and meiotic cell cycles. Given that the propagation of DNA methylation at CpG sites, mediated in Arabidopsis by MET1, plays a central role in epigenetic inheritance, we examined genomewide DNA methylation in partial and complete loss-of-function met1 mutants. We interpreted the data in relation to transgenerational epiallelic stability, which allowed us to classify chromosomal targets of epigenetic regulation into (i) single copy and methylated exclusively at CpGs, readily forming epialleles, and (ii) transposon-derived, methylated at all cytosines, which may or may not form epialleles. We provide evidence that DNA sequence features such as density of CpGs and genomic repetitiveness of the loci predispose their susceptibility to epiallelic switching. The importance and predictive power of these genetic features were confirmed by analyses of common epialleles in natural Arabidopsis accessions, epigenetic recombinant inbred lines (epiRILs) and also verified in rice.


Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/genética , DNA (Citosina-5-)-Metiltransferases/metabolismo , Metilação de DNA , Epigênese Genética , Regulação da Expressão Gênica de Plantas , Proteínas de Arabidopsis/genética , DNA (Citosina-5-)-Metiltransferases/genética , DNA de Plantas/química , DNA de Plantas/metabolismo , Mutação
7.
Nucleic Acids Res ; 46(19): e114, 2018 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-29986099

RESUMO

DNA methylation has been associated with transcriptional repression and detection of differential methylation is important in understanding the underlying causes of differential gene expression. Bisulfite-converted genomic DNA sequencing is the current gold standard in the field for building genome-wide maps at a base pair resolution of DNA methylation. Here we systematically investigate the underlying features of detecting differential DNA methylation in CpG and non-CpG contexts, considering both the case of mammalian systems and plants. In particular, we introduce DMRcaller, a highly efficient R/Bioconductor package, which implements several methods to detect differentially methylated regions (DMRs) between two samples. Most importantly, we show that different algorithms are required to compute DMRs and the most appropriate algorithm in each case depends on the sequence context and levels of methylation. Furthermore, we show that DMRcaller outperforms other available packages and we propose a new method to select the parameters for this tool and for other available tools. DMRcaller is a comprehensive tool for differential methylation analysis which displays high sensitivity and specificity for the detection of DMRs and performs entire genome wide analysis within a few hours.


Assuntos
Biologia Computacional/métodos , Ilhas de CpG/genética , Metilação de DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Algoritmos , Animais , Processamento Eletrônico de Dados , Humanos , Anotação de Sequência Molecular/métodos , Sensibilidade e Especificidade
8.
Nucleic Acids Res ; 45(7): 3777-3784, 2017 04 20.
Artigo em Inglês | MEDLINE | ID: mdl-28053115

RESUMO

Methylation of cytosine is an epigenetic mark involved in the regulation of transcription, usually associated with transcriptional repression. In mammals, methylated cytosines are found predominantly in CpGs but in plants non-CpG methylation (in the CpHpG or CpHpH contexts, where H is A, C or T) is also present and is associated with the transcriptional silencing of transposable elements. In addition, CpG methylation is found in coding regions of active genes. In the absence of the demethylase of lysine 9 of histone 3 (IBM1), a subset of body-methylated genes acquires non-CpG methylation. This was shown to alter their expression and affect plant development. It is not clear why only certain body-methylated genes gain non-CpG methylation in the absence of IBM1 and others do not. Here we describe a link between CpG methylation and the establishment of methylation in the CpHpG context that explains the two classes of body-methylated genes. We provide evidence that external cytosines of CpCpG sites can only be methylated when internal cytosines are methylated. CpCpG sites methylated in both cytosines promote spreading of methylation in the CpHpG context in genes protected by IBM1. In contrast, CpCpG sites remain unmethylated in IBM1-independent genes and do not promote spread of CpHpG methylation.


Assuntos
Ilhas de CpG , Citosina/metabolismo , Metilação de DNA , Epigênese Genética , Regulação da Expressão Gênica de Plantas , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Genes de Plantas , Histona Desmetilases com o Domínio Jumonji/genética
9.
Nucleic Acids Res ; 43(1): 84-94, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25432957

RESUMO

The binding of transcription factors (TFs) is essential for gene expression. One important characteristic is the actual occupancy of a putative binding site in the genome. In this study, we propose an analytical model to predict genomic occupancy that incorporates the preferred target sequence of a TF in the form of a position weight matrix (PWM), DNA accessibility data (in the case of eukaryotes), the number of TF molecules expected to be bound specifically to the DNA and a parameter that modulates the specificity of the TF. Given actual occupancy data in the form of ChIP-seq profiles, we backwards inferred copy number and specificity for five Drosophila TFs during early embryonic development: Bicoid, Caudal, Giant, Hunchback and Kruppel. Our results suggest that these TFs display thousands of molecules that are specifically bound to the DNA and that whilst Bicoid and Caudal display a higher specificity, the other three TFs (Giant, Hunchback and Kruppel) display lower specificity in their binding (despite having PWMs with higher information content). This study gives further weight to earlier investigations into TF copy numbers that suggest a significant proportion of molecules are not bound specifically to the DNA.


Assuntos
Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Núcleo Celular/metabolismo , DNA/metabolismo , Drosophila melanogaster/embriologia , Drosophila melanogaster/genética , Genômica , Matrizes de Pontuação de Posição Específica , Ligação Proteica
10.
Nucleic Acids Res ; 42(7): 4196-207, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24476912

RESUMO

Site-specific transcription factors (TFs) bind to their target sites on the DNA, where they regulate the rate at which genes are transcribed. Bacterial TFs undergo facilitated diffusion (a combination of 3D diffusion around and 1D random walk on the DNA) when searching for their target sites. Using computer simulations of this search process, we show that the organization of the binding sites, in conjunction with TF copy number and binding site affinity, plays an important role in determining not only the steady state of promoter occupancy, but also the order at which TFs bind. These effects can be captured by facilitated diffusion-based models, but not by standard thermodynamics. We show that the spacing of binding sites encodes complex logic, which can be derived from combinations of three basic building blocks: switches, barriers and clusters, whose response alone and in higher orders of organization we characterize in detail. Effective promoter organizations are commonly found in the E. coli genome and are highly conserved between strains. This will allow studies of gene regulation at a previously unprecedented level of detail, where our framework can create testable hypothesis of promoter logic.


Assuntos
Escherichia coli/genética , Regiões Promotoras Genéticas , Sítios de Ligação , Fatores de Transcrição/metabolismo
11.
Bioinformatics ; 28(9): 1287-9, 2012 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-22426343

RESUMO

MOTIVATION: Transcription factors (TFs) are proteins that regulate gene activity by binding to specific sites on the DNA. Understanding the way these molecules locate their target site is of great importance in understanding gene regulation. We developed a comprehensive computational model of this process and estimated the model parameters in (N.R.Zabet and B.Adryan, submitted for publication). RESULTS: GRiP (gene regulation in prokaryotes) is a highly versatile implementation of this model and simulates the search process in a computationally efficient way. This program aims to provide researchers in the field with a flexible and highly customizable simulation framework. Its features include representation of DNA sequence, TFs and the interaction between TFs and the DNA (facilitated diffusion mechanism), or between various TFs (cooperative behaviour). The software will record both information on the dynamics associated with the search process (locations of molecules) and also steady-state results (affinity landscape, occupancy-bias and collision hotspots). AVAILABILITY: http://logic.sysbiol.cam.ac.uk/grip, program and source code


Assuntos
Software , Fatores de Transcrição/metabolismo , Algoritmos , Sequência de Bases , Sítios de Ligação , Simulação por Computador , DNA/química , DNA/genética , DNA/metabolismo , Escherichia coli K12/genética , Escherichia coli K12/metabolismo , Regulação da Expressão Gênica , Células Procarióticas/metabolismo , Ligação Proteica , Fatores de Transcrição/química , Fatores de Transcrição/genética
12.
Bioinformatics ; 28(11): 1517-24, 2012 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-22492644

RESUMO

MOTIVATION: Gene activity is mediated by site-specific transcription factors (TFs). Their binding to defined regions in the genome determines the rate at which their target genes are transcribed. RESULTS: We present a comprehensive computational model of the search process of TF for their genomic target site(s). The computational model considers: the DNA sequence, various TF species and the interaction of the individual molecules with the DNA or between themselves. We also demonstrate a systematic approach how to parametrize the system using available experimental data.


Assuntos
Simulação por Computador , DNA/metabolismo , Escherichia coli/metabolismo , Repressores Lac/metabolismo , DNA/química , Difusão Facilitada , Repressores Lac/química , Modelos Químicos , Termodinâmica , Ativação Transcricional
13.
Nat Struct Mol Biol ; 30(7): 935-947, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37308596

RESUMO

Mammalian genomes harbor abundant transposable elements (TEs) and their remnants, with numerous epigenetic repression mechanisms enacted to silence TE transcription. However, TEs are upregulated during early development, neuronal lineage, and cancers, although the epigenetic factors contributing to the transcription of TEs have yet to be fully elucidated. Here, we demonstrate that the male-specific lethal (MSL)-complex-mediated histone H4 acetylation at lysine 16 (H4K16ac) is enriched at TEs in human embryonic stem cells (hESCs) and cancer cells. This in turn activates transcription of subsets of full-length long interspersed nuclear elements (LINE1s, L1s) and endogenous retrovirus (ERV) long terminal repeats (LTRs). Furthermore, we show that the H4K16ac-marked L1 and LTR subfamilies display enhancer-like functions and are enriched in genomic locations with chromatin features associated with active enhancers. Importantly, such regions often reside at boundaries of topologically associated domains and loop with genes. CRISPR-based epigenetic perturbation and genetic deletion of L1s reveal that H4K16ac-marked L1s and LTRs regulate the expression of genes in cis. Overall, TEs enriched with H4K16ac contribute to the cis-regulatory landscape at specific genomic locations by maintaining an active chromatin landscape at TEs.


Assuntos
Elementos de DNA Transponíveis , Retrovirus Endógenos , Animais , Humanos , Masculino , Elementos de DNA Transponíveis/genética , Cromatina/genética , Sequências Reguladoras de Ácido Nucleico/genética , Retrovirus Endógenos/genética , Genômica , Mamíferos/genética
14.
Clin Epigenetics ; 14(1): 62, 2022 05 14.
Artigo em Inglês | MEDLINE | ID: mdl-35568878

RESUMO

BACKGROUND: Sex differences are known to play a role in disease aetiology, progression and outcome. Previous studies have revealed autosomal epigenetic differences between males and females in some tissues, including differences in DNA methylation patterns. Here, we report for the first time an analysis of autosomal sex differences in DNAme using the Illumina EPIC array in human whole blood by performing a discovery (n = 1171) and validation (n = 2471) analysis. RESULTS: We identified and validated 396 sex-associated differentially methylated CpG sites (saDMPs) with the majority found to be female-biased CpGs (74%). These saDMP's are enriched in CpG islands and CpG shores and located preferentially at 5'UTRs, 3'UTRs and enhancers. Additionally, we identified 266 significant sex-associated differentially methylated regions overlapping genes, which have previously been shown to exhibit epigenetic sex differences, and novel genes. Transcription factor binding site enrichment revealed enrichment of transcription factors related to critical developmental processes and sex determination such as SRY and ESR1. CONCLUSION: Our study reports a reliable catalogue of sex-associated CpG sites and elucidates several characteristics of these sites using large-scale discovery and validation data sets. This resource will benefit future studies aiming to investigate sex specific epigenetic signatures and further our understanding of the role of DNA methylation in sex differences in human whole blood.


Assuntos
Metilação de DNA , Caracteres Sexuais , Ilhas de CpG , Epigênese Genética , Epigenômica , Feminino , Humanos , Masculino
15.
Neuro Oncol ; 24(8): 1273-1285, 2022 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-35213723

RESUMO

BACKGROUND: Epigenetic changes play a key role in the pathogenesis of medulloblastoma (MB), the most common malignant pediatric brain tumor. METHODS: We explore the therapeutic potential of BMI1 and MAPK/ERK inhibition in BMI1High;CHD7Low MB cells and in a preclinical xenograft model. RESULTS: We identify a synergistic vulnerability of BMI1High;CHD7Low MB cells to a combination treatment with BMI1 and MAPK/ERK inhibitors. Mechanistically, CHD7-dependent binding of BMI1 to MAPK-regulated genes underpins the CHD7-BMI1-MAPK regulatory axis responsible of the antitumour effect of the inhibitors in vitro and in a preclinical mouse model. Increased ERK1 and ERK2 phosphorylation activity is found in BMI1High;CHD7Low G4 MB patients, raising the possibility that they could be amenable to a similar therapy. CONCLUSIONS: The molecular dissection of the CHD7-BMI1-MAPK regulatory axis in BMI1High;CHD7Low MB identifies this signature as a proxy to predict MAPK functional activation, which can be effectively drugged in preclinical models, and paves the way for further exploration of combined BMI1 and MAPK targeting in G4 MB patients.


Assuntos
Neoplasias Encefálicas , Neoplasias Cerebelares , Meduloblastoma , Inibidores de Proteínas Quinases , Animais , Neoplasias Encefálicas/genética , Linhagem Celular Tumoral , Proliferação de Células , Neoplasias Cerebelares/genética , Humanos , Meduloblastoma/genética , Camundongos , Complexo Repressor Polycomb 1/antagonistas & inibidores , Complexo Repressor Polycomb 1/genética , Inibidores de Proteínas Quinases/farmacologia , Proteínas Proto-Oncogênicas/genética
16.
J Theor Biol ; 284(1): 82-91, 2011 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-21723295

RESUMO

This paper compares the auto-repressed gene to a simple one (a gene without auto-regulation) in terms of response time and output noise under the assumption of fixed metabolic cost. The analysis shows that, in the case of non-vanishing leak expression rate, the negative feedback reduces both the switching on and switching off times of a gene. The noise of the auto-repressed gene will be lower than the one of the simple gene only for low leak expression rates. Summing up, for low, but non-vanishing leak expression rates, the auto-repressed gene is both faster and less noisier compared to the simple one.


Assuntos
Retroalimentação Fisiológica/fisiologia , Regulação da Expressão Gênica/fisiologia , Modelos Genéticos , Homeostase/genética , Biossíntese de Proteínas/genética , Proteínas/metabolismo , Tempo de Reação/genética
17.
Methods Mol Biol ; 2250: 219-238, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33900608

RESUMO

DNA methylation is a transgenerational stable epigenetic modification able to regulate gene expression and genome stability. The analysis of DNA methylation by genome-wide bisulfite sequencing become the main genomic approach to study epigenetics in many organisms; leading to standardization of the alignment and methylation call procedures. However, subsequent steps of the computational analysis should be tailored to the biological questions and the organisms used. Since most bioinformatics tools designed for epigenetic studies are built using mammalian models, they are potentially unsuitable for organisms with substantially different epigenetic regulation, such as plants. Therefore, in this chapter we propose a computational workflow for the analysis, visualization, and interpretation of data obtained from alignment of whole genome bisulfite sequencing of plant samples. Using almost exclusively the R working environment we will examine in depth how to tackle some plant-related issues during epigenetic analysis.


Assuntos
Biologia Computacional/métodos , Metilação de DNA , Genoma de Planta/genética , Genômica/métodos , Plantas/genética , Sequenciamento Completo do Genoma/métodos , Algoritmos , Mapeamento Cromossômico/métodos , Cromossomos de Plantas/genética , Citosina/química , Citosina/metabolismo , Epigênese Genética , Regulação da Expressão Gênica de Plantas , Software , Sulfitos/química
18.
Genome Biol ; 22(1): 308, 2021 11 08.
Artigo em Inglês | MEDLINE | ID: mdl-34749786

RESUMO

BACKGROUND: Enhancers are non-coding regions of the genome that control the activity of target genes. Recent efforts to identify active enhancers experimentally and in silico have proven effective. While these tools can predict the locations of enhancers with a high degree of accuracy, the mechanisms underpinning the activity of enhancers are often unclear. RESULTS: Using machine learning (ML) and a rule-based explainable artificial intelligence (XAI) model, we demonstrate that we can predict the location of known enhancers in Drosophila with a high degree of accuracy. Most importantly, we use the rules of the XAI model to provide insight into the underlying combinatorial histone modifications code of enhancers. In addition, we identified a large set of putative enhancers that display the same epigenetic signature as enhancers identified experimentally. These putative enhancers are enriched in nascent transcription, divergent transcription and have 3D contacts with promoters of transcribed genes. However, they display only intermediary enrichment of mediator and cohesin complexes compared to previously characterised active enhancers. We also found that 10-15% of the predicted enhancers display similar characteristics to super enhancers observed in other species. CONCLUSIONS: Here, we applied an explainable AI model to predict enhancers with high accuracy. Most importantly, we identified that different combinations of epigenetic marks characterise different groups of enhancers. Finally, we discovered a large set of putative enhancers which display similar characteristics with previously characterised active enhancers.


Assuntos
Inteligência Artificial , Drosophila melanogaster/genética , Elementos Facilitadores Genéticos , Epigênese Genética , Código das Histonas , Animais , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Aprendizado de Máquina , Anotação de Sequência Molecular , Regiões Promotoras Genéticas
19.
Comput Struct Biotechnol J ; 18: 3590-3605, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33304457

RESUMO

Transcription Factors (TFs) bind to DNA and control activity of target genes. Here, we present ChIPanalyser, a user-friendly, versatile and powerful R/Bioconductor package predicting and modelling the binding of TFs to DNA. ChIPanalyser performs similarly to state-of-the-art tools, but is an explainable model and provides biological insights into binding mechanisms of TFs. We focused on investigating the binding mechanisms of three TFs that are known architectural proteins CTCF, BEAF-32 and su(Hw) in three Drosophila cell lines (BG3, Kc167 and S2). While CTCF preferentially binds only to a subset of high affinity sites located mainly in open chromatin, BEAF-32 binds to most of its high affinity binding sites available in open chromatin. In contrast, su(Hw) binds to both open chromatin and also partially closed chromatin. Most importantly, differences in TF binding profiles between cell lines for these TFs are mainly driven by differences in DNA accessibility and not by differences in TF concentrations between cell lines. Finally, we investigated binding of Hox TFs in Drosophila and found that Ubx binds only in open chromatin, while Abd-B and Dfd are capable to bind in both open and partially closed chromatin. Overall, our results show that TFs display different binding mechanisms and that our model is able to recapitulate their specific binding behaviour.

20.
J Theor Biol ; 257(3): 419-29, 2009 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-19121637

RESUMO

We present three models of how transcription factors (TFs) bind to their specific binding sites on the DNA: a model based on statistical physics, a Markov-chain model and a computational simulation. Comparison of these models suggests that the effect of non-specific binding can be significant. We also investigate possible mechanisms for cooperativity. The simulation model suggests that direct interactions between TFs are unlikely to be the main source of cooperativity between specific binding sites, because such interactions tend to lead to the formation of clusters on the DNA with undesirable side-effects.


Assuntos
Modelos Genéticos , Fatores de Transcrição/metabolismo , Sítios de Ligação , DNA/metabolismo , Humanos , Cadeias de Markov , Modelos Estatísticos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA