Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Genome Res ; 2023 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-37945377

RESUMO

Mammalian mRNA and lncRNA exons are often small compared to introns. The exon definition model predicts that exons splice autonomously, dependent on proximal exon sequence features, explaining their delineation within large introns. This model has not been examined on a genome-wide scale, however, leaving open the question of how often mRNA and lncRNA exons are autonomous. It is also unknown how frequently such exons can arise by chance. Here, we directly assayed large fragments (500-1000 bp) of the human genome by exon trapping, which detects exons spliced into a heterologous transgene, here designed with a large intron context. We define the trapped exons as "autonomous." We obtained ∼1.25 million trapped exons, including most known mRNA and well-annotated lncRNA internal exons, demonstrating that human exons are predominantly autonomous. mRNA exons are trapped with the highest efficiency. Nearly a million of the trapped exons are unannotated, most located in intergenic regions and antisense to mRNA, with depletion from the forward strand of introns. These exons are not conserved, suggesting they are nonfunctional and arose from random mutations. They are nonetheless highly enriched with known splicing promoting sequence features that delineate known exons. Novel autonomous exons are more numerous than annotated lncRNA exons, and computational models also indicate they will occur with similar frequency in any randomly generated sequence. These results show that most human coding exons splice autonomously, and provide an explanation for the existence of many unconserved lncRNAs, as well as a new annotation and inclusion levels of spliceable loci in the human genome.

2.
Dev Cell ; 58(19): 1898-1916.e9, 2023 Oct 09.
Artigo em Inglês | MEDLINE | ID: mdl-37557175

RESUMO

Chromatin accessibility is integral to the process by which transcription factors (TFs) read out cis-regulatory DNA sequences, but it is difficult to differentiate between TFs that drive accessibility and those that do not. Deep learning models that learn complex sequence rules provide an unprecedented opportunity to dissect this problem. Using zygotic genome activation in Drosophila as a model, we analyzed high-resolution TF binding and chromatin accessibility data with interpretable deep learning and performed genetic validation experiments. We identify a hierarchical relationship between the pioneer TF Zelda and the TFs involved in axis patterning. Zelda consistently pioneers chromatin accessibility proportional to motif affinity, whereas patterning TFs augment chromatin accessibility in sequence contexts where they mediate enhancer activation. We conclude that chromatin accessibility occurs in two tiers: one through pioneering, which makes enhancers accessible but not necessarily active, and the second when the correct combination of TFs leads to enhancer activation.

3.
Genes Dev ; 36(3-4): 225-240, 2022 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-35144965

RESUMO

The BEN domain is a recently recognized DNA binding module that is present in diverse metazoans and certain viruses. Several BEN domain factors are known as transcriptional repressors, but, overall, relatively little is known of how BEN factors identify their targets in humans. In particular, X-ray structures of BEN domain:DNA complexes are only known for Drosophila factors bearing a single BEN domain, which lack direct vertebrate orthologs. Here, we characterize several mammalian BEN domain (BD) factors, including from two NACC family BTB-BEN proteins and from BEND3, which has four BDs. In vitro selection data revealed sequence-specific binding activities of isolated BEN domains from all of these factors. We conducted detailed functional, genomic, and structural studies of BEND3. We show that BD4 is a major determinant for in vivo association and repression of endogenous BEND3 targets. We obtained a high-resolution structure of BEND3-BD4 bound to its preferred binding site, which reveals how BEND3 identifies cognate DNA targets and shows differences with one of its non-DNA-binding BEN domains (BD1). Finally, comparison with our previous invertebrate BEN structures, along with additional structural predictions using AlphaFold2 and RoseTTAFold, reveal distinct strategies for target DNA recognition by different types of BEN domain proteins. Together, these studies expand the DNA recognition activities of BEN factors and provide structural insights into sequence-specific DNA binding by mammalian BEN proteins.


Assuntos
Proteínas Repressoras , Fatores de Transcrição , Animais , Sítios de Ligação , Drosophila/metabolismo , Mamíferos , Ligação Proteica , Domínios Proteicos , Proteínas Repressoras/genética , Fatores de Transcrição/metabolismo
4.
Mol Biol Evol ; 38(5): 2070-2075, 2021 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-33561217

RESUMO

The human transcription factor (TF) CGGBP1 (CGG-binding protein) is conserved only in amniotes and is believed to derive from the zf-BED and Hermes transposase DNA-binding domains (DBDs) of a hAT DNA transposon. Here, we show that sequence-specific DNA-binding proteins with this bipartite domain structure have resulted from dozens of independent hAT domestications in different eukaryotic lineages. CGGBPs display a wide range of sequence specificity, usually including preferences for CGG or CGC trinucleotides, whereas some bind AT-rich motifs. The CGGBPs are almost entirely nonsyntenic, and their protein sequences, DNA-binding motifs, and patterns of presence or absence in genomes are uncharacteristic of ancestry via speciation. At least eight CGGBPs in the coelacanth Latimeria chalumnae bind distinct motifs, and the expression of the corresponding genes varies considerably across tissues, suggesting tissue-restricted function.


Assuntos
Elementos de DNA Transponíveis , Proteínas de Ligação a DNA/genética , Peixes/genética , Animais , Proteínas de Ligação a DNA/metabolismo , Peixes/metabolismo , Humanos
5.
Biochem J ; 477(7): 1345-1362, 2020 04 17.
Artigo em Inglês | MEDLINE | ID: mdl-32207815

RESUMO

We report the identification and characterization of a bacteriophage λ-encoded protein, NinH. Sequence homology suggests similarity between NinH and Fis, a bacterial nucleoid-associated protein (NAP) involved in numerous DNA topology manipulations, including chromosome condensation, transcriptional regulation and phage site-specific recombination. We find that NinH functions as a homodimer and is able to bind and bend double-stranded DNA in vitro. Furthermore, NinH shows a preference for a 15 bp signature sequence related to the degenerate consensus favored by Fis. Structural studies reinforced the proposed similarity to Fis and supported the identification of residues involved in DNA binding which were demonstrated experimentally. Overexpression of NinH proved toxic and this correlated with its capacity to associate with DNA. NinH is the first example of a phage-encoded Fis-like NAP that likely influences phage excision-integration reactions or bacterial gene expression.


Assuntos
Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Bacteriófago lambda/genética , Bacteriófago lambda/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Proteínas Virais/genética , Proteínas Virais/metabolismo , Proteínas de Bactérias/química , Sequência de Bases , Sítios de Ligação , Simulação por Computador , DNA/metabolismo , DNA Viral/metabolismo , Proteínas de Ligação a DNA/química , Escherichia coli/genética , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Fator Proteico para Inversão de Estimulação/química , Fator Proteico para Inversão de Estimulação/genética , Expressão Gênica , Proteínas Mutantes/metabolismo , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Multimerização Proteica/genética , Proteínas Virais/química
6.
Nat Genet ; 51(6): 981-989, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31133749

RESUMO

Transcription factor (TF) binding specificities (motifs) are essential for the analysis of gene regulation. Accurate prediction of TF motifs is critical, because it is infeasible to assay all TFs in all sequenced eukaryotic genomes. There is ongoing controversy regarding the degree of motif diversification among related species that is, in part, because of uncertainty in motif prediction methods. Here we describe similarity regression, a significantly improved method for predicting motifs, which we use to update and expand the Cis-BP database. Similarity regression inherently quantifies TF motif evolution, and shows that previous claims of near-complete conservation of motifs between human and Drosophila are inflated, with nearly half of the motifs in each species absent from the other, largely due to extensive divergence in C2H2 zinc finger proteins. We conclude that diversification in DNA-binding motifs is pervasive, and present a new tool and updated resource to study TF diversity and gene regulation across eukaryotes.


Assuntos
Sequência de Bases , Sítios de Ligação , Evolução Molecular , Fatores de Transcrição/metabolismo , Animais , Biologia Computacional/métodos , Sequência Conservada , Bases de Dados Genéticas , Regulação da Expressão Gênica , Humanos , Motivos de Nucleotídeos , Ligação Proteica
7.
Cancer Res ; 79(3): 467-481, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30487138

RESUMO

Genome-wide association studies have identified 40 ovarian cancer risk loci. However, the mechanisms underlying these associations remain elusive. In this study, we conducted a two-pronged approach to identify candidate causal SNPs and assess underlying biological mechanisms at chromosome 9p22.2, the first and most statistically significant associated locus for ovarian cancer susceptibility. Three transcriptional regulatory elements with allele-specific effects and a scaffold/matrix attachment region were characterized and, through physical DNA interactions, BNC2 was established as the most likely target gene. We determined the consensus binding sequence for BNC2 in vitro, verified its enrichment in BNC2 ChIP-seq regions, and validated a set of its downstream target genes. Fine-mapping by dense regional genotyping in over 15,000 ovarian cancer cases and 30,000 controls identified SNPs in the scaffold/matrix attachment region as among the most likely causal variants. This study reveals a comprehensive regulatory landscape at 9p22.2 and proposes a likely mechanism of susceptibility to ovarian cancer. SIGNIFICANCE: Mapping the 9p22.2 ovarian cancer risk locus identifies BNC2 as an ovarian cancer risk gene.See related commentary by Choi and Brown, p. 439.


Assuntos
Carcinoma Epitelial do Ovário/genética , Cromossomos Humanos Par 9 , Neoplasias Ovarianas/genética , Sequência de Bases , Proteínas de Ciclo Celular/genética , Linhagem Celular Tumoral , Mapeamento Cromossômico , Cistadenocarcinoma Seroso/genética , DNA de Neoplasias/genética , Proteínas de Ligação a DNA/genética , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Células HEK293 , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único
8.
J Biol Chem ; 293(9): 3218-3233, 2018 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-29217772

RESUMO

Plant nucleotide-binding leucine-rich repeat (NLR) proteins enable the immune system to recognize and respond to pathogen attack. An early consequence of immune activation is transcriptional reprogramming, and some NLRs have been shown to act in the nucleus and interact with transcription factors. The Rx1 NLR protein of potato is further able to bind and distort double-stranded DNA. However, Rx1 host targets that support a role for Rx1 in transcriptional reprogramming at DNA are unknown. Here, we report a functional interaction between Rx1 and NbGlk1, a Golden2-like transcription factor. Rx1 binds to NbGlk1 in vitro and in planta. NbGlk1 binds to known Golden2-like consensus DNA sequences. Rx1 reduces the binding affinity of NbGlk1 for DNA in vitro. NbGlk1 activates cellular responses to potato virus X, whereas Rx1 associates with NbGlk1 and prevents its assembly on DNA in planta unless activated by PVX. This study provides new mechanistic insight into how an NLR can coordinate an immune signaling response at DNA following pathogen perceptions.


Assuntos
DNA/metabolismo , Espaço Intracelular/metabolismo , Proteínas NLR/metabolismo , Proteínas de Plantas/metabolismo , Fatores de Transcrição/metabolismo , Proteínas NLR/química , Proteínas de Plantas/química , Ligação Proteica , Domínios Proteicos , Nicotiana
9.
Elife ; 42015 Apr 23.
Artigo em Inglês | MEDLINE | ID: mdl-25905672

RESUMO

Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs.


Assuntos
Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , DNA de Helmintos/genética , Fatores de Transcrição/genética , Dedos de Zinco/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Sítios de Ligação , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/metabolismo , DNA de Helmintos/química , DNA de Helmintos/metabolismo , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Dados de Sequência Molecular , Regiões Promotoras Genéticas , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Receptores Citoplasmáticos e Nucleares , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...