Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 113
Filtrar
1.
bioRxiv ; 2024 Mar 14.
Artigo em Inglês | MEDLINE | ID: mdl-38559186

RESUMO

Paired-class homeodomain transcription factors (HD TFs) play essential roles in vertebrate development, and their mutations are linked to human diseases. One unique feature of paired-class HD is cooperative dimerization on specific palindrome DNA sequences. Yet, the functional significance of HD cooperative dimerization in animal development and its dysregulation in diseases remain elusive. Using the retinal TF Cone-rod Homeobox (CRX) as a model, we have studied how blindness-causing mutations in the paired HD, p.E80A and p.K88N, alter CRX's cooperative dimerization, lead to gene misexpression and photoreceptor developmental deficits in dominant manners. CRXE80A maintains binding at monomeric WT CRX motifs but is deficient in cooperative binding at dimeric motifs. CRXE80A's cooperativity defect impacts the exponential increase of photoreceptor gene expression in terminal differentiation and produces immature, non-functional photoreceptors in the CrxE80A retinas. CRXK88N is highly cooperative and localizes to ectopic genomic sites with strong enrichment of dimeric HD motifs. CRXK88N's altered biochemical properties disrupt CRX's ability to direct dynamic chromatin remodeling during development to activate photoreceptor differentiation programs and silence progenitor programs. Our study here provides in vitro and in vivo molecular evidence that paired-class HD cooperative dimerization regulates neuronal development and dysregulation of cooperative binding contributes to severe dominant blinding retinopathies.

2.
Bioinformatics ; 39(6)2023 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-37294804

RESUMO

MOTIVATION: Motifs play a crucial role in computational biology, as they provide valuable information about the binding specificity of proteins. However, conventional motif discovery methods typically rely on simple combinatoric or probabilistic approaches, which can be biased by heuristics such as substring-masking for multiple motif discovery. In recent years, deep neural networks have become increasingly popular for motif discovery, as they are capable of capturing complex patterns in data. Nonetheless, inferring motifs from neural networks remains a challenging problem, both from a modeling and computational standpoint, despite the success of these networks in supervised learning tasks. RESULTS: We present a principled representation learning approach based on a hierarchical sparse representation for motif discovery. Our method effectively discovers gapped, long, or overlapping motifs that we show to commonly exist in next-generation sequencing datasets, in addition to the short and enriched primary binding sites. Our model is fully interpretable, fast, and capable of capturing motifs in a large number of DNA strings. A key concept emerged from our approach-enumerating at the image level-effectively overcomes the k-mers paradigm, enabling modest computational resources for capturing the long and varied but conserved patterns, in addition to capturing the primary binding sites. AVAILABILITY AND IMPLEMENTATION: Our method is available as a Julia package under the MIT license at https://github.com/kchu25/MOTIFs.jl, and the results on experimental data can be found at https://zenodo.org/record/7783033.


Assuntos
Proteínas , Software , Proteínas/química , Sítios de Ligação , Redes Neurais de Computação , DNA
3.
Nucleic Acids Res ; 51(11): 5364-5376, 2023 06 23.
Artigo em Inglês | MEDLINE | ID: mdl-36951113

RESUMO

The human genome contains about 800 C2H2 zinc finger proteins (ZFPs), and most of them are composed of long arrays of zinc fingers. Standard ZFP recognition model asserts longer finger arrays should recognize longer DNA-binding sites. However, recent experimental efforts to identify in vivo ZFP binding sites contradict this assumption, with many exhibiting short motifs. Here we use ZFY, CTCF, ZIM3, and ZNF343 as examples to address three closely related questions: What are the reasons that impede current motif discovery methods? What are the functions of those seemingly unused fingers and how can we improve the motif discovery algorithms based on long ZFPs' biophysical properties? Using ZFY, we employed a variety of methods and find evidence for 'dependent recognition' where downstream fingers can recognize some previously undiscovered motifs only in the presence of an intact core site. For CTCF, high-throughput measurements revealed its upstream specificity profile depends on the strength of its core. Moreover, the binding strength of the upstream site modulates CTCF's sensitivity to different epigenetic modifications within the core, providing new insight into how the previously identified intellectual disability-causing and cancer-related mutant R567W disrupts upstream recognition and deregulates the epigenetic control by CTCF. Our results establish that, because of irregular motif structures, variable spacing and dependent recognition between sub-motifs, the specificities of long ZFPs are significantly underestimated, so we developed an algorithm, ModeMap, to infer the motifs and recognition models of ZIM3 and ZNF343, which facilitates high-confidence identification of specific binding sites, including repeats-derived elements. With revised concept, technique, and algorithm, we can discover the overlooked specificities and functions of those 'extra' fingers, and therefore decipher their broader roles in human biology and diseases.


Assuntos
DNA , Fatores de Transcrição , Dedos de Zinco , Humanos , Sítios de Ligação , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Algoritmos , Motivos de Nucleotídeos , Motivos de Aminoácidos , DNA/química , DNA/metabolismo
5.
Mol Biol Evol ; 38(7): 2854-2868, 2021 06 25.
Artigo em Inglês | MEDLINE | ID: mdl-33720298

RESUMO

Transcription factor-driven cell fate engineering in pluripotency induction, transdifferentiation, and forward reprogramming requires efficiency, speed, and maturity for widespread adoption and clinical translation. Here, we used Oct4, Sox2, Klf4, and c-Myc driven pluripotency reprogramming to evaluate methods for enhancing and tailoring cell fate transitions, through directed evolution with iterative screening of pooled mutant libraries and phenotypic selection. We identified an artificially evolved and enhanced POU factor (ePOU) that substantially outperforms wild-type Oct4 in terms of reprogramming speed and efficiency. In contrast to Oct4, not only can ePOU induce pluripotency with Sox2 alone, but it can also do so in the absence of Sox2 in a three-factor ePOU/Klf4/c-Myc cocktail. Biochemical assays combined with genome-wide analyses showed that ePOU possesses a new preference to dimerize on palindromic DNA elements. Yet, the moderate capacity of Oct4 to function as a pioneer factor, its preference to bind octamer DNA and its capability to dimerize with Sox2 and Sox17 proteins remain unchanged in ePOU. Compared with Oct4, ePOU is thermodynamically stabilized and persists longer in reprogramming cells. In consequence, ePOU: 1) differentially activates several genes hitherto not implicated in reprogramming, 2) reveals an unappreciated role of thyrotropin-releasing hormone signaling, and 3) binds a distinct class of retrotransposons. Collectively, these features enable ePOU to accelerate the establishment of the pluripotency network. This demonstrates that the phenotypic selection of novel factor variants from mammalian cells with desired properties is key to advancing cell fate conversions with artificially evolved biomolecules.


Assuntos
Técnicas de Reprogramação Celular , Evolução Molecular Direcionada , Fatores do Domínio POU/genética , Animais , Fator 4 Semelhante a Kruppel , Camundongos , Engenharia de Proteínas
6.
Commun Biol ; 3(1): 761, 2020 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-33311538

RESUMO

Post-transcriptional autoregulation of gene expression is common in bacteria but many fewer examples are known in eukaryotes. We used the yeast collection of genes fused to GFP as a rapid screen for examples of feedback regulation in ribosomal proteins by overexpressing a non-regulatable version of a gene and observing the effects on the expression of the GFP-fused version. We tested 95 ribosomal protein genes and found a wide continuum of effects, with 30% showing at least a 3-fold reduction in expression. Two genes, RPS22B and RPL1B, showed over a 10-fold repression. In both cases the cis-regulatory segment resides in the 5' UTR of the gene as shown by placing that segment of the mRNA upstream of GFP alone and demonstrating it is sufficient to cause repression of GFP when the protein is over-expressed. Further analyses showed that the intron in the 5' UTR of RPS22B is required for regulation, presumably because the protein inhibits splicing that is necessary for translation. The 5' UTR of RPL1B contains a sequence and structure motif that is conserved in the binding sites of Rpl1 orthologs from bacteria to mammals, and mutations within the motif eliminate repression.


Assuntos
Retroalimentação Fisiológica , Proteínas Fúngicas/genética , Regulação Fúngica da Expressão Gênica , Proteínas Ribossômicas/genética , Sequência de Bases , Proteínas Fúngicas/química , Proteínas Fúngicas/metabolismo , Ordem dos Genes , Genes Reporter , Homeostase , Plasmídeos/genética , Proteínas Ribossômicas/química , Proteínas Ribossômicas/metabolismo
7.
G3 (Bethesda) ; 10(10): 3797-3810, 2020 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-32817123

RESUMO

Genome-wide analysis of transcriptome data in Chlamydomonas reinhardtii shows periodic patterns in gene expression levels when cultures are grown under alternating light and dark cycles so that G1 of the cell cycle occurs in the light phase and S/M/G0 occurs during the dark phase. However, alternative splicing, a process that enables a greater protein diversity from a limited set of genes, remains largely unexplored by previous transcriptome based studies in C. reinhardtii In this study, we used existing longitudinal RNA-seq data obtained during the light-dark cycle to investigate the changes in the alternative splicing pattern and found that 3277 genes (19.75% of 17,746 genes) undergo alternative splicing. These splicing events include Alternative 5' (Alt 5'), Alternative 3' (Alt 3') and Exon skipping (ES) events that are referred as alternative site selection (ASS) events and Intron retention (IR) events. By clustering analysis, we identified a subset of events (26 ASS events and 10 IR events) that show periodic changes in the splicing pattern during the cell cycle. About two-thirds of these 36 genes either introduce a pre-termination codon (PTC) or introduce insertions or deletions into functional domains of the proteins, which implicate splicing in altering gene function. These findings suggest that alternative splicing is also regulated during the Chlamydomonas cell cycle, although not as extensively as changes in gene expression. The longitudinal changes in the alternative splicing pattern during the cell cycle captured by this study provides an important resource to investigate alternative splicing in genes of interest during the cell cycle in Chlamydomonas reinhardtii and other eukaryotes.


Assuntos
Processamento Alternativo , Chlamydomonas reinhardtii , Ciclo Celular/genética , Chlamydomonas reinhardtii/genética , Éxons , Íntrons
8.
Nat Rev Genet ; 21(11): 699-714, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32665585

RESUMO

Despite enormous progress in understanding the fundamentals of bacterial gene regulation, our knowledge remains limited when compared with the number of bacterial genomes and regulatory systems to be discovered. Derived from a small number of initial studies, classic definitions for concepts of gene regulation have evolved as the number of characterized promoters has increased. Together with discoveries made using new technologies, this knowledge has led to revised generalizations and principles. In this Expert Recommendation, we suggest precise, updated definitions that support a logical, consistent conceptual framework of bacterial gene regulation, focusing on transcription initiation. The resulting concepts can be formalized by ontologies for computational modelling, laying the foundation for improved bioinformatics tools, knowledge-based resources and scientific communication. Thus, this work will help researchers construct better predictive models, with different formalisms, that will be useful in engineering, synthetic biology, microbiology and genetics.


Assuntos
Bactérias/genética , Regulação Bacteriana da Expressão Gênica , Iniciação da Transcrição Genética , Óperon , Regiões Promotoras Genéticas , Regulon , Fatores de Transcrição/fisiologia
9.
BMC Bioinformatics ; 19(1): 86, 2018 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-29510689

RESUMO

BACKGROUND: Transcription factor (TF) binding site specificity is commonly represented by some form of matrix model in which the positions in the binding site are assumed to contribute independently to the site's activity. The independence assumption is known to be an approximation, often a good one but sometimes poor. Alternative approaches have been developed that use k-mers (DNA "words" of length k) to account for the non-independence, and more recently DNA structural parameters have been incorporated into the models. ChIP-seq data are often used to assess the discriminatory power of motifs and to compare different models. However, to measure the improvement due to using more complex models, one must compare to optimized matrix models. RESULTS: We describe a program "Discriminative Additive Model Optimization" (DAMO) that uses positive and negative examples, as in ChIP-seq data, and finds the additive position weight matrix (PWM) that maximizes the Area Under the Receiver Operating Characteristic Curve (AUROC). We compare to a recent study where structural parameters, serving as features in a gradient boosting classifier algorithm, are shown to improve the AUROC over JASPAR position frequency matrices (PFMs). In agreement with the previous results, we find that adding structural parameters gives the largest improvement, but most of the gain can be obtained by an optimized PWM and nearly all of the gain can be obtained with a di-nucleotide extension to the PWM. CONCLUSION: To appropriately compare different models for TF bind sites, optimized models must be used. PWMs and their extensions are good representations of binding specificity for most TFs, and more complex models, including the incorporation of DNA shape features and gradient boosting classifiers, provide only moderate improvements for a few TFs.


Assuntos
Algoritmos , DNA/química , Modelos Moleculares , Motivos de Nucleotídeos/genética , Matrizes de Pontuação de Posição Específica , Área Sob a Curva , Sítios de Ligação , Bases de Dados de Ácidos Nucleicos , Humanos , Ligação Proteica
10.
BMC Mol Biol ; 19(1): 5, 2018 03 27.
Artigo em Inglês | MEDLINE | ID: mdl-29587652

RESUMO

BACKGROUND: BATF family transcription factors (BATF, BATF2 and BATF3) form hetero-trimers with JUNB and either IRF4 or IRF8 to regulate cell fate in T cells and dendritic cells in vivo. While each combination of the hetero-trimer has a distinct role, some degree of cross-compensation was observed. The basis for the differential actions of IRF4 and IRF8 with BATF factors and JUNB is still unknown. We propose that the differences in function between these hetero-trimers may be caused by differences in their DNA binding preferences. While all three BATF family transcription factors have similar binding preferences when binding as a hetero-dimer with JUNB, the cooperative binding of IRF4 or IRF8 to the hetero-dimer/DNA complex could change the preferences. We used Spec-seq, which allows for the efficient and accurate determination of relative affinity to a large collection of sequences in parallel, to find differences between cooperative DNA binding of IRF4, IRF8 and BATF family members. RESULTS: We found that without IRF binding, all three hetero-dimer pairs exhibit nearly the same binding preferences to both expected wildtype binding sites TRE (TGA(C/G)TCA) and CRE (TGACGTCA). IRF4 and IRF8 show the very similar DNA binding preferences when binding with any of the three hetero-dimers. No major change of binding preferences was found in the half-sites between different hetero-trimers. IRF proteins bind with substantially lower affinity with either a single nucleotide spacer between IRF and BATF binding site or with an alternative mode of binding in the opposite orientation. In addition, the preference to CRE binding site was reduced with either IRF binding in all BATF-JUNB combinations. CONCLUSIONS: The specificities of BATF, BATF2 and BATF3 are all very similar as are their interactions with IRF4 and IRF8. IRF proteins binding adjacent to BATF sites increases affinity substantially compared to sequences with spacings between the sites, indicating cooperative binding through protein-protein interactions. The preference for the type of BATF binding site, TRE or CRE, is also altered when IRF proteins bind. These in vitro preferences aid in the understanding of in vivo binding activities.


Assuntos
Fatores de Transcrição de Zíper de Leucina Básica/metabolismo , Fatores Reguladores de Interferon/genética , Análise de Sequência de DNA/métodos , Fatores de Transcrição/genética , Animais , Fatores de Transcrição de Zíper de Leucina Básica/química , Fatores de Transcrição de Zíper de Leucina Básica/genética , Sítios de Ligação , Humanos , Fatores Reguladores de Interferon/química , Fatores Reguladores de Interferon/metabolismo , Camundongos , Multimerização Proteica , Proteínas Repressoras/química , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Proteínas Supressoras de Tumor/química , Proteínas Supressoras de Tumor/genética , Proteínas Supressoras de Tumor/metabolismo
11.
Sci Adv ; 3(11): eaao1799, 2017 11.
Artigo em Inglês | MEDLINE | ID: mdl-29159284

RESUMO

Methylation of CpG (cytosine-phosphate-guanine) dinucleotides is a common epigenetic mark that influences gene expression. The effects of methylation on transcription factor (TF) binding are unknown for most TFs and, even when known, such knowledge is often only qualitative. In reality, methylation sensitivity is a quantitative effect, just as changes to the DNA sequence have quantitative effects on TF binding affinity. We describe Methyl-Spec-seq, an easy-to-use method that measures the effects of CpG methylation (mCPG) on binding affinity for hundreds to thousands of variants in parallel, allowing one to quantitatively assess the effects at every position in a binding site. We demonstrate its use on several important DNA binding proteins. We calibrate the accuracy of Methyl-Spec-seq using a novel two-color competitive fluorescence anisotropy method that can accurately determine the relative affinities of two sequences in solution. We also present software that extends standard methods for representing, visualizing, and searching for matches to binding site motifs to include the effects of methylation. These tools facilitate the study of the consequences for gene regulation of epigenetic marks on DNA.


Assuntos
Metilação de DNA , DNA/metabolismo , Fatores de Transcrição/metabolismo , Animais , Fator de Ligação a CCCTC/química , Fator de Ligação a CCCTC/metabolismo , Ilhas de CpG , DNA/química , Polarização de Fluorescência , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/metabolismo , Camundongos , Receptor Patched-1/química , Receptor Patched-1/metabolismo , Ligação Proteica , Proteínas Recombinantes/biossíntese , Proteínas Recombinantes/química , Proteínas Recombinantes/isolamento & purificação , Proteínas Repressoras/química , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Fatores de Transcrição/química
12.
J Mol Biol ; 429(23): 3626-3634, 2017 11 24.
Artigo em Inglês | MEDLINE | ID: mdl-29050852

RESUMO

Sox2 and Pax6 co-regulate genes in neural lineages and the lens by forming a ternary complex likely facilitated allosterically through DNA. We used the quantitative and scalable cooperativity-by-sequencing (Coop-seq) approach to interrogate Sox2/Pax6 dimerization on a DNA library where five positions of the Pax6 half-site were randomized yielding 1024 cooperativity factors. Consensus positions normally required for the high-affinity DNA binding by Pax6 need to be mutated for effective dimerization with Sox2. Out of the five randomized bases, a 5' thymidine is present in most of the top ranking elements. However, this thymidine maps to a region outside of the Pax half site and is not expected to directly interact with Pax6 in known binding modes suggesting structural reconfigurations. Re-analysis of ChIP-seq data identified several genomic regions where the cooperativity promoting sequence pattern is co-bound by Sox2 and Pax6. A highly conserved Sox2/Pax6 bound site near the Sprouty2 locus was verified to promote cooperative dimerization designating Sprouty2 as a potential target reliant on Sox2/Pax6 cooperativity in several neural cell types. Collectively, the functional interplay of Sox2 and Pax6 demands the relaxation of high-affinity binding sites and is enabled by alternative DNA sequences. We conclude that this binding mode evolved to warrant that a subset of target genes is only regulated in the presence of suitable partner factors.


Assuntos
DNA/metabolismo , Fator de Transcrição PAX6/metabolismo , Fatores de Transcrição SOXB1/metabolismo , Análise de Sequência de DNA/métodos , DNA/química , DNA/genética , Humanos , Modelos Moleculares , Fator de Transcrição PAX6/química , Fator de Transcrição PAX6/genética , Ligação Proteica , Conformação Proteica , Multimerização Proteica , Fatores de Transcrição SOXB1/química , Fatores de Transcrição SOXB1/genética
13.
PLoS Comput Biol ; 13(7): e1005638, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28686588

RESUMO

The specificities of transcription factors are most commonly represented with probabilistic models. These models provide a probability for each base occurring at each position within the binding site and the positions are assumed to contribute independently. The model is simple and intuitive and is the basis for many motif discovery algorithms. However, the model also has inherent limitations that prevent it from accurately representing true binding probabilities, especially for the highest affinity sites under conditions of high protein concentration. The limitations are not due to the assumption of independence between positions but rather are caused by the non-linear relationship between binding affinity and binding probability and the fact that independent normalization at each position skews the site probabilities. Generally probabilistic models are reasonably good approximations, but new high-throughput methods allow for biophysical models with increased accuracy that should be used whenever possible.


Assuntos
DNA/química , DNA/metabolismo , Modelos Estatísticos , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Biologia Computacional , Simulação por Computador , Software
14.
Nucleic Acids Res ; 45(14): 8199-8207, 2017 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-28510715

RESUMO

The quantitative specificity of the STAT1 transcription factor was determined by measuring the relative affinity to hundreds of variants of the consensus binding site including variations in the length of the site. The known consensus sequence is observed to have the highest affinity, with all variants decreasing binding affinity considerably. There is very little loss of binding affinity when the CpG within the consensus binding site is methylated. Additionally, the specificity of mutant proteins, with variants of amino acids that interact with the DNA, was determined and nearly all of them are observed to lose specificity across the entire binding site. The change of Asn at position 460 to His, which corresponds to the natural amino acid at the homologous position in STAT6, does not change the specificity nor does it change the length preference to match that of STAT6. These results provide the first quantitative analysis of changes in binding affinity for the STAT1 protein, and several variants of it, to hundreds of different binding sites including different spacer lengths, and the effect of CpG methylation.


Assuntos
Ilhas de CpG/genética , DNA/genética , Variação Genética , Fator de Transcrição STAT1/genética , Algoritmos , Sequência de Aminoácidos , Sequência de Bases , Sítios de Ligação/genética , Ligação Competitiva , DNA/metabolismo , Metilação de DNA , Eletroforese em Gel de Poliacrilamida , Cinética , Mutação de Sentido Incorreto , Ligação Proteica , Fator de Transcrição STAT1/metabolismo , Fator de Transcrição STAT6/genética , Fator de Transcrição STAT6/metabolismo , Homologia de Sequência de Aminoácidos
15.
Bioinformatics ; 33(15): 2288-2295, 2017 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-28379348

RESUMO

MOTIVATION: Characterizing the binding specificities of transcription factors (TFs) is crucial to the study of gene expression regulation. Recently developed high-throughput experimental methods, including protein binding microarrays (PBM) and high-throughput SELEX (HT-SELEX), have enabled rapid measurements of the specificities for hundreds of TFs. However, few studies have developed efficient algorithms for estimating binding motifs based on HT-SELEX data. Also the simple method of constructing a position weight matrix (PWM) by comparing the frequency of the preferred sequence with single-nucleotide variants has the risk of generating motifs with higher information content than the true binding specificity. RESULTS: We developed an algorithm called BEESEM that builds on a comprehensive biophysical model of protein-DNA interactions, which is trained using the expectation maximization method. BEESEM is capable of selecting the optimal motif length and calculating the confidence intervals of estimated parameters. By comparing BEESEM with the published motifs estimated using the same HT-SELEX data, we demonstrate that BEESEM provides significant improvements. We also evaluate several motif discovery algorithms on independent PBM and ChIP-seq data. BEESEM provides significantly better fits to in vitro data, but its performance is similar to some other methods on in vivo data under the criterion of the area under the receiver operating characteristic curve (AUROC). This highlights the limitations of the purely rank-based AUROC criterion. Using quantitative binding data to assess models, however, demonstrates that BEESEM improves on prior models. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://stormo.wustl.edu/resources.html . CONTACT: stormo@wustl.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Imunoprecipitação da Cromatina/métodos , DNA/metabolismo , Análise Serial de Proteínas/métodos , Software , Termodinâmica , Fatores de Transcrição/metabolismo , Algoritmos , Animais , Sítios de Ligação , DNA/química , Humanos , Camundongos , Matrizes de Pontuação de Posição Específica , Ligação Proteica , Análise de Sequência de DNA/métodos , Fatores de Transcrição/química
16.
Nucleic Acids Res ; 45(2): 832-845, 2017 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-27915232

RESUMO

Cooperative binding of transcription factors is known to be important in the regulation of gene expression programs conferring cellular identities. However, current methods to measure cooperativity parameters have been laborious and therefore limited to studying only a few sequence variants at a time. We developed Coop-seq (cooperativity by sequencing) that is capable of efficiently and accurately determining the cooperativity parameters for hundreds of different DNA sequences in a single experiment. We apply Coop-seq to 12 dimer pairs from the Sox and POU families of transcription factors using 324 unique sequences with changed half-site orientation, altered spacing and discrete randomization within the binding elements. The study reveals specific dimerization profiles of different Sox factors with Oct4. By contrast, Oct4 and the three neural class III POU factors Brn2, Brn4 and Oct6 assemble with Sox2 in a surprisingly indistinguishable manner. Two novel half-site configurations can support functional Sox/Oct dimerization in addition to known composite motifs. Moreover, Coop-seq uncovers a nucleotide switch within the POU half-site when spacing is altered, which is mirrored in genomic loci bound by Sox2/Oct4 complexes.


Assuntos
Fatores do Domínio POU/metabolismo , Fatores de Transcrição SOX/metabolismo , Animais , DNA/química , DNA/metabolismo , Camundongos , Modelos Moleculares , Fator 3 de Transcrição de Octâmero/química , Fator 3 de Transcrição de Octâmero/metabolismo , Fatores do Domínio POU/química , Ligação Proteica , Conformação Proteica , Multimerização Proteica , Fatores de Transcrição SOX/química , Fatores de Transcrição SOXB1/química , Fatores de Transcrição SOXB1/metabolismo
17.
Cell Syst ; 3(3): 216-218, 2016 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-27684185

RESUMO

Incorporating information about DNA structure can increase the reliability of predictions of transcription factor binding sites.


Assuntos
DNA/química , Sítios de Ligação , Ligação Proteica , Reprodutibilidade dos Testes , Fatores de Transcrição
18.
G3 (Bethesda) ; 6(3): 653-67, 2016 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-26772747

RESUMO

Transcriptional control of gene expression requires interactions between the cis-regulatory elements (CREs) controlling gene promoters. We developed a sensitive computational method to identify CRE combinations with conserved spacing that does not require genome alignments. When applied to seven sensu stricto and sensu lato Saccharomyces species, 80% of the predicted interactions displayed some evidence of combinatorial transcriptional behavior in several existing datasets including: (1) chromatin immunoprecipitation data for colocalization of transcription factors, (2) gene expression data for coexpression of predicted regulatory targets, and (3) gene ontology databases for common pathway membership of predicted regulatory targets. We tested several predicted CRE interactions with chromatin immunoprecipitation experiments in a wild-type strain and strains in which a predicted cofactor was deleted. Our experiments confirmed that transcription factor (TF) occupancy at the promoters of the CRE combination target genes depends on the predicted cofactor while occupancy of other promoters is independent of the predicted cofactor. Our method has the additional advantage of identifying regulatory differences between species. By analyzing the S. cerevisiae and S. bayanus genomes, we identified differences in combinatorial cis-regulation between the species and showed that the predicted changes in gene regulation explain several of the species-specific differences seen in gene expression datasets. In some instances, the same CRE combinations appear to regulate genes involved in distinct biological processes in the two different species. The results of this research demonstrate that (1) combinatorial cis-regulation can be inferred by multi-genome analysis and (2) combinatorial cis-regulation can explain differences in gene expression between species.


Assuntos
Regulação Fúngica da Expressão Gênica , Sequências Reguladoras de Ácido Nucleico , Saccharomyces/genética , Imunoprecipitação da Cromatina , Análise por Conglomerados , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Sequenciamento de Nucleotídeos em Larga Escala , Saccharomyces/metabolismo , Transdução de Sinais , Fatores de Transcrição/metabolismo , Transcrição Gênica , Transcriptoma
19.
Curr Protoc Bioinformatics ; 51: 2.15.1-2.15.6, 2015 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-26334922

RESUMO

Transcription factors (TFs) recognize and bind to specific DNA sequences. The specificity of a TF is usually represented as a position weight matrix (PWM). Several databases of DNA motifs exist and are used in biological research to address important biological questions. This overview describes PWMs and some of the most commonly used motif databases, as well as a few of their common applications.


Assuntos
DNA/genética , Mineração de Dados/métodos , Bases de Dados de Ácidos Nucleicos , Motivos de Nucleotídeos/genética , Análise de Sequência de DNA/métodos , Fatores de Transcrição/genética , Sítios de Ligação , DNA/química , Bases de Dados de Proteínas , Dados de Sequência Molecular , Ligação Proteica , Fatores de Transcrição/química
20.
J Biol Chem ; 290(32): 19756-69, 2015 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-26088140

RESUMO

Combinatorial gene regulation through feed-forward loops (FFLs) can bestow specificity and temporal control to client gene expression; however, characteristics of binding sites that mediate these effects are not established. We previously showed that the glucocorticoid receptor (GR) and KLF15 form coherent FFLs that cooperatively induce targets such as the amino acid-metabolizing enzymes AASS and PRODH and incoherent FFLs exemplified by repression of MT2A by KLF15. Here, we demonstrate that GR and KLF15 physically interact and identify low affinity GR binding sites within glucocorticoid response elements (GREs) for PRODH and AASS that contribute to combinatorial regulation with KLF15. We used deep sequencing and electrophoretic mobility shift assays to derive in vitro GR binding affinities across sequence space. We applied these data to show that AASS GRE activity correlated (r(2) = 0.73) with predicted GR binding affinities across a 50-fold affinity range in transfection assays; however, the slope of the linear relationship more than doubled when KLF15 was expressed. Whereas activity of the MT2A GRE was even more strongly (r(2) = 0.89) correlated with GR binding site affinity, the slope of the linear relationship was sharply reduced by KLF15, consistent with incoherent FFL logic. Thus, GRE architecture and co-regulator expression together determine the functional parameters that relate GR binding site affinity to hormone-induced transcriptional responses. Utilization of specific affinity response functions and GR binding sites by FFLs may contribute to the diversity of gene expression patterns within GR-regulated transcriptomes.


Assuntos
Fatores de Transcrição Kruppel-Like/metabolismo , Proteínas Nucleares/metabolismo , Prolina Oxidase/metabolismo , Receptores de Glucocorticoides/metabolismo , Elementos de Resposta , Sacaropina Desidrogenases/metabolismo , Transcrição Gênica , Animais , Sequência de Bases , Sítios de Ligação , Brônquios/citologia , Brônquios/efeitos dos fármacos , Brônquios/metabolismo , Linhagem Celular , Dexametasona/farmacologia , Ensaio de Desvio de Mobilidade Eletroforética , Células Epiteliais/citologia , Células Epiteliais/efeitos dos fármacos , Células Epiteliais/metabolismo , Fibroblastos/citologia , Fibroblastos/efeitos dos fármacos , Fibroblastos/metabolismo , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Fatores de Transcrição Kruppel-Like/química , Fatores de Transcrição Kruppel-Like/genética , Camundongos , Dados de Sequência Molecular , Proteínas Nucleares/química , Proteínas Nucleares/genética , Prolina Oxidase/química , Prolina Oxidase/genética , Regiões Promotoras Genéticas , Ligação Proteica , Receptores de Glucocorticoides/química , Receptores de Glucocorticoides/genética , Sacaropina Desidrogenases/química , Sacaropina Desidrogenases/genética , Transdução de Sinais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...