Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Genes Dev ; 36(3-4): 225-240, 2022 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-35144965

RESUMEN

The BEN domain is a recently recognized DNA binding module that is present in diverse metazoans and certain viruses. Several BEN domain factors are known as transcriptional repressors, but, overall, relatively little is known of how BEN factors identify their targets in humans. In particular, X-ray structures of BEN domain:DNA complexes are only known for Drosophila factors bearing a single BEN domain, which lack direct vertebrate orthologs. Here, we characterize several mammalian BEN domain (BD) factors, including from two NACC family BTB-BEN proteins and from BEND3, which has four BDs. In vitro selection data revealed sequence-specific binding activities of isolated BEN domains from all of these factors. We conducted detailed functional, genomic, and structural studies of BEND3. We show that BD4 is a major determinant for in vivo association and repression of endogenous BEND3 targets. We obtained a high-resolution structure of BEND3-BD4 bound to its preferred binding site, which reveals how BEND3 identifies cognate DNA targets and shows differences with one of its non-DNA-binding BEN domains (BD1). Finally, comparison with our previous invertebrate BEN structures, along with additional structural predictions using AlphaFold2 and RoseTTAFold, reveal distinct strategies for target DNA recognition by different types of BEN domain proteins. Together, these studies expand the DNA recognition activities of BEN factors and provide structural insights into sequence-specific DNA binding by mammalian BEN proteins.


Asunto(s)
Proteínas Represoras , Factores de Transcripción , Animales , Sitios de Unión , Drosophila/metabolismo , Mamíferos , Unión Proteica , Dominios Proteicos , Proteínas Represoras/genética , Factores de Transcripción/metabolismo
2.
Genome Res ; 2023 Nov 09.
Artículo en Inglés | MEDLINE | ID: mdl-37945377

RESUMEN

Mammalian mRNA and lncRNA exons are often small compared to introns. The exon definition model predicts that exons splice autonomously, dependent on proximal exon sequence features, explaining their delineation within large introns. This model has not been examined on a genome-wide scale, however, leaving open the question of how often mRNA and lncRNA exons are autonomous. It is also unknown how frequently such exons can arise by chance. Here, we directly assayed large fragments (500-1000 bp) of the human genome by exon trapping, which detects exons spliced into a heterologous transgene, here designed with a large intron context. We define the trapped exons as "autonomous." We obtained ∼1.25 million trapped exons, including most known mRNA and well-annotated lncRNA internal exons, demonstrating that human exons are predominantly autonomous. mRNA exons are trapped with the highest efficiency. Nearly a million of the trapped exons are unannotated, most located in intergenic regions and antisense to mRNA, with depletion from the forward strand of introns. These exons are not conserved, suggesting they are nonfunctional and arose from random mutations. They are nonetheless highly enriched with known splicing promoting sequence features that delineate known exons. Novel autonomous exons are more numerous than annotated lncRNA exons, and computational models also indicate they will occur with similar frequency in any randomly generated sequence. These results show that most human coding exons splice autonomously, and provide an explanation for the existence of many unconserved lncRNAs, as well as a new annotation and inclusion levels of spliceable loci in the human genome.

3.
Mol Biol Evol ; 38(5): 2070-2075, 2021 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-33561217

RESUMEN

The human transcription factor (TF) CGGBP1 (CGG-binding protein) is conserved only in amniotes and is believed to derive from the zf-BED and Hermes transposase DNA-binding domains (DBDs) of a hAT DNA transposon. Here, we show that sequence-specific DNA-binding proteins with this bipartite domain structure have resulted from dozens of independent hAT domestications in different eukaryotic lineages. CGGBPs display a wide range of sequence specificity, usually including preferences for CGG or CGC trinucleotides, whereas some bind AT-rich motifs. The CGGBPs are almost entirely nonsyntenic, and their protein sequences, DNA-binding motifs, and patterns of presence or absence in genomes are uncharacteristic of ancestry via speciation. At least eight CGGBPs in the coelacanth Latimeria chalumnae bind distinct motifs, and the expression of the corresponding genes varies considerably across tissues, suggesting tissue-restricted function.


Asunto(s)
Elementos Transponibles de ADN , Proteínas de Unión al ADN/genética , Peces/genética , Animales , Proteínas de Unión al ADN/metabolismo , Peces/metabolismo , Humanos
4.
Biochem J ; 477(7): 1345-1362, 2020 04 17.
Artículo en Inglés | MEDLINE | ID: mdl-32207815

RESUMEN

We report the identification and characterization of a bacteriophage λ-encoded protein, NinH. Sequence homology suggests similarity between NinH and Fis, a bacterial nucleoid-associated protein (NAP) involved in numerous DNA topology manipulations, including chromosome condensation, transcriptional regulation and phage site-specific recombination. We find that NinH functions as a homodimer and is able to bind and bend double-stranded DNA in vitro. Furthermore, NinH shows a preference for a 15 bp signature sequence related to the degenerate consensus favored by Fis. Structural studies reinforced the proposed similarity to Fis and supported the identification of residues involved in DNA binding which were demonstrated experimentally. Overexpression of NinH proved toxic and this correlated with its capacity to associate with DNA. NinH is the first example of a phage-encoded Fis-like NAP that likely influences phage excision-integration reactions or bacterial gene expression.


Asunto(s)
Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Bacteriófago lambda/genética , Bacteriófago lambda/metabolismo , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Proteínas Virales/genética , Proteínas Virales/metabolismo , Proteínas Bacterianas/química , Secuencia de Bases , Sitios de Unión , Simulación por Computador , ADN/metabolismo , ADN Viral/metabolismo , Proteínas de Unión al ADN/química , Escherichia coli/genética , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Factor Proteico para Inverción de Estimulación/química , Factor Proteico para Inverción de Estimulación/genética , Expresión Génica , Proteínas Mutantes/metabolismo , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Multimerización de Proteína/genética , Proteínas Virales/química
5.
J Biol Chem ; 293(9): 3218-3233, 2018 03 02.
Artículo en Inglés | MEDLINE | ID: mdl-29217772

RESUMEN

Plant nucleotide-binding leucine-rich repeat (NLR) proteins enable the immune system to recognize and respond to pathogen attack. An early consequence of immune activation is transcriptional reprogramming, and some NLRs have been shown to act in the nucleus and interact with transcription factors. The Rx1 NLR protein of potato is further able to bind and distort double-stranded DNA. However, Rx1 host targets that support a role for Rx1 in transcriptional reprogramming at DNA are unknown. Here, we report a functional interaction between Rx1 and NbGlk1, a Golden2-like transcription factor. Rx1 binds to NbGlk1 in vitro and in planta. NbGlk1 binds to known Golden2-like consensus DNA sequences. Rx1 reduces the binding affinity of NbGlk1 for DNA in vitro. NbGlk1 activates cellular responses to potato virus X, whereas Rx1 associates with NbGlk1 and prevents its assembly on DNA in planta unless activated by PVX. This study provides new mechanistic insight into how an NLR can coordinate an immune signaling response at DNA following pathogen perceptions.


Asunto(s)
ADN/metabolismo , Espacio Intracelular/metabolismo , Proteínas NLR/metabolismo , Proteínas de Plantas/metabolismo , Factores de Transcripción/metabolismo , Proteínas NLR/química , Proteínas de Plantas/química , Unión Proteica , Dominios Proteicos , Nicotiana
6.
Dev Cell ; 58(19): 1898-1916.e9, 2023 Oct 09.
Artículo en Inglés | MEDLINE | ID: mdl-37557175

RESUMEN

Chromatin accessibility is integral to the process by which transcription factors (TFs) read out cis-regulatory DNA sequences, but it is difficult to differentiate between TFs that drive accessibility and those that do not. Deep learning models that learn complex sequence rules provide an unprecedented opportunity to dissect this problem. Using zygotic genome activation in Drosophila as a model, we analyzed high-resolution TF binding and chromatin accessibility data with interpretable deep learning and performed genetic validation experiments. We identify a hierarchical relationship between the pioneer TF Zelda and the TFs involved in axis patterning. Zelda consistently pioneers chromatin accessibility proportional to motif affinity, whereas patterning TFs augment chromatin accessibility in sequence contexts where they mediate enhancer activation. We conclude that chromatin accessibility occurs in two tiers: one through pioneering, which makes enhancers accessible but not necessarily active, and the second when the correct combination of TFs leads to enhancer activation.

7.
Nat Genet ; 51(6): 981-989, 2019 06.
Artículo en Inglés | MEDLINE | ID: mdl-31133749

RESUMEN

Transcription factor (TF) binding specificities (motifs) are essential for the analysis of gene regulation. Accurate prediction of TF motifs is critical, because it is infeasible to assay all TFs in all sequenced eukaryotic genomes. There is ongoing controversy regarding the degree of motif diversification among related species that is, in part, because of uncertainty in motif prediction methods. Here we describe similarity regression, a significantly improved method for predicting motifs, which we use to update and expand the Cis-BP database. Similarity regression inherently quantifies TF motif evolution, and shows that previous claims of near-complete conservation of motifs between human and Drosophila are inflated, with nearly half of the motifs in each species absent from the other, largely due to extensive divergence in C2H2 zinc finger proteins. We conclude that diversification in DNA-binding motifs is pervasive, and present a new tool and updated resource to study TF diversity and gene regulation across eukaryotes.


Asunto(s)
Secuencia de Bases , Sitios de Unión , Evolución Molecular , Factores de Transcripción/metabolismo , Animales , Biología Computacional/métodos , Secuencia Conservada , Bases de Datos Genéticas , Regulación de la Expresión Génica , Humanos , Motivos de Nucleótidos , Unión Proteica
8.
Elife ; 42015 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-25905672

RESUMEN

Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs.


Asunto(s)
Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , ADN de Helmintos/genética , Factores de Transcripción/genética , Dedos de Zinc/genética , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Sitios de Unión , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/metabolismo , ADN de Helmintos/química , ADN de Helmintos/metabolismo , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Datos de Secuencia Molecular , Regiones Promotoras Genéticas , Unión Proteica , Dominios y Motivos de Interacción de Proteínas , Receptores Citoplasmáticos y Nucleares , Factores de Transcripción/química , Factores de Transcripción/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA