RESUMEN
Many animal and plant genomes are transcribed much more extensively than current annotations predict. However, the biological function of these unannotated transcribed regions is largely unknown. Approximately 7% and 23% of the detected transcribed nucleotides during D. melanogaster embryogenesis map to unannotated intergenic and intronic regions, respectively. Based on computational analysis of coordinated transcription, we conservatively estimate that 29% of all unannotated transcribed sequences function as missed or alternative exons of well-characterized protein-coding genes. We estimate that 15.6% of intergenic transcribed regions function as missed or alternative transcription start sites (TSS) used by 11.4% of the expressed protein-coding genes. Identification of P element mutations within or near newly identified 5' exons provides a strategy for mapping previously uncharacterized mutations to their respective genes. Collectively, these data indicate that at least 85% of the fly genome is transcribed and processed into mature transcripts representing at least 30% of the fly genome.
Asunto(s)
Drosophila melanogaster/embriología , Drosophila melanogaster/genética , Regulación del Desarrollo de la Expresión Génica , Transcripción Genética , Secuencia de Aminoácidos , Animales , ADN Intergénico , Proteínas de Drosophila/genética , Embrión no Mamífero , Exones , Genoma de los Insectos , Datos de Secuencia Molecular , Mutación , Análisis de Secuencia por Matrices de Oligonucleótidos , Sitio de Iniciación de la TranscripciónRESUMEN
The estrogen receptor is the master transcriptional regulator of breast cancer phenotype and the archetype of a molecular therapeutic target. We mapped all estrogen receptor and RNA polymerase II binding sites on a genome-wide scale, identifying the authentic cis binding sites and target genes, in breast cancer cells. Combining this unique resource with gene expression data demonstrates distinct temporal mechanisms of estrogen-mediated gene regulation, particularly in the case of estrogen-suppressed genes. Furthermore, this resource has allowed the identification of cis-regulatory sites in previously unexplored regions of the genome and the cooperating transcription factors underlying estrogen signaling in breast cancer.
Asunto(s)
Genoma Humano , Receptores de Estrógenos/metabolismo , Elementos de Respuesta , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Adenocarcinoma/genética , Neoplasias de la Mama/genética , Células Cultivadas , Mapeo Cromosómico/métodos , Secuencia Conservada , Proteínas de Unión al ADN/metabolismo , Regulación hacia Abajo , Expresión Génica , Regulación de la Expresión Génica , Humanos , Análisis por Micromatrices/métodos , Proteínas Nucleares/metabolismo , Proteína de Interacción con Receptores Nucleares 1 , Elementos de Respuesta/fisiología , Factores de Transcripción/fisiología , Sitio de Iniciación de la TranscripciónRESUMEN
Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. We used whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over 40 well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.
Asunto(s)
Blastodermo/metabolismo , Drosophila melanogaster/embriología , Factores de Transcripción/metabolismo , Animales , Sitios de Unión , ADN/metabolismo , Evolución Molecular , MicroARNs/metabolismoRESUMEN
ETS is a family of transcriptional regulators with functions in most biological processes. Dysregulated ETS factor function leads to altered expression of multiple genes that play critical roles in many of the processes required for cancer progression. While the Ets family gene, prostate-derived ETS factor (PDEF), is expressed in epithelial tissues including prostate, breast, and colon, PDEF protein expression has been found to be reduced or lost during prostate and breast cancer progression. The goal of this study was to examine the expression and biologic impact of altered PDEF expression in colon cancer. PDEF mRNA and protein are not detectable in several colon-cancer-derived cell lines. Re-expression of PDEF in colon cancer cells inhibits growth and migration. Growth affects are due to altered cellular proliferation, indicated by increased altered cell population in G(1) and S phases of the cell cycle, as well as increased apoptosis. Relevant to its modulation of growth and migration phenotypes, PDEF expression resulted in altered expression of genes with established roles in cell cycle, motility, and invasion. Furthermore, chromatin immunoprecipitation studies show that p21 and urokinase plasminogen activator (uPA) are direct PDEF transcriptional targets. While non-tumor colon epithelium expresses PDEF mRNA and protein, the majority of tumors showed decreased mRNA and/or protein expression. In human tumor tissue samples, PDEF expression was inversely correlated with the expression levels of uPA. Collectively, the data support the model that PDEF is a negative regulator of tumor progression by modulating the expression of growth and migration promoting genes.
Asunto(s)
Movimiento Celular , Neoplasias del Colon/metabolismo , Proteínas Proto-Oncogénicas c-ets/genética , Células CACO-2/metabolismo , Procesos de Crecimiento Celular , Línea Celular Tumoral , Neoplasias del Colon/genética , Humanos , Proteínas Proto-Oncogénicas c-ets/metabolismo , ARN Mensajero/metabolismo , Tiorredoxinas/genética , Tiorredoxinas/metabolismo , Activador de Plasminógeno de Tipo Uroquinasa/metabolismoRESUMEN
Ets transcription factors control multiple biological processes, including cell proliferation, differentiation, apoptosis, angiogenesis, transformation, and invasion. Pdef is an Ets transcription factor originally identified in prostate tissue. We demonstrate that human Pdef is expressed at high levels primarily in tissues with high epithelial cell content, including prostate, colon, and breast. We also determined that Pdef protein is reduced in human invasive breast cancer and is absent in invasive breast cancer cell lines. We next assessed the functional consequences of these observations. Significantly, expression of Pdef in breast cancer cells leads to inhibition of invasion, migration, and growth. Expression of Pdef also results in the down-regulation of urokinase-type plasminogen activator and activation of the promoter of the tumor suppressor gene, MASPIN: Growth-suppressive effects of Pdef expression are mediated in part by a G(0)-G(1) cell cycle arrest associated with elevated p21 levels. Collectively, these results indicate that Pdef loss may alter the expression of genes controlling progression to invasive breast cancer.
Asunto(s)
Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Regulación Neoplásica de la Expresión Génica/fisiología , Factores de Transcripción/biosíntesis , Neoplasias de la Mama/genética , Ciclo Celular/fisiología , División Celular/fisiología , Movimiento Celular/fisiología , Regulación hacia Abajo , Humanos , Invasividad Neoplásica , Proteínas Proto-Oncogénicas c-ets , ARN Mensajero/biosíntesis , ARN Mensajero/genética , Factores de Transcripción/genética , Factores de Transcripción/fisiología , Células Tumorales CultivadasRESUMEN
The ETS1 transcription factor is a member of the Ets family of conserved sequence-specific DNA-binding proteins. ETS1 has been shown to play important roles in various cellular processes such as proliferation, differentiation, lymphoid development, motility, invasion and angiogenesis. These diverse roles of ETS1 are likely to be dependent on specific protein interactions. To identify proteins that interact with ETS1, a yeast two-hybrid screen was conducted. Here, we describe the functional interaction between SP100 and ETS1. SP100 protein interacts with ETS1 both in vitro and in vivo. SP100 is localized to nuclear bodies and ETS1 expression alters the nuclear body morphology in living cells. SP100 negatively modulates ETS1 transcriptional activation of the MMP1 and uPA promoters in a dose-dependent manner, decreases the expression of these endogenous genes, and reduces ETS1 DNA binding. Expression of SP100 inhibits the invasion of breast cancer cells and is induced by Interferon-alpha, which has been shown to inhibit the invasion of cancer cells. These data demonstrate that SP100 modulates ETS1-dependent biological processes.
Asunto(s)
Antígenos Nucleares/fisiología , Autoantígenos/fisiología , Proteínas Portadoras/fisiología , Péptidos y Proteínas de Señalización Intracelular , Proteínas Nucleares/fisiología , Proteínas Adaptadoras Transductoras de Señales , Secuencia de Bases , Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Línea Celular Tumoral , Núcleo Celular/metabolismo , Proteínas Co-Represoras , Cartilla de ADN , Humanos , Chaperonas Moleculares , Invasividad Neoplásica , Técnicas del Sistema de Dos HíbridosRESUMEN
The Ets family of transcription factors regulates many biological processes. Within the Ets family are a subset of proteins that have epithelial restricted expression patterns, both in tissues and cell lines. These Epithelial-specific Ets (Ese) factors, cluster into two groups based on the sequence of the Ets DNA binding domain: (i) Ese1, Ese2 and Ese3, and (ii) the more divergent, Pdef. Gene targeting and cell culture studies have demonstrated that Ese factors are important for cellular differentiation. Furthermore, the ability of Ese factors to regulate expression of genes that are markers of differentiation supports this model. Significantly, the expression profile of these Ese factors is altered in epithelial cancers relative to the respective normal tissues. It is likely that this variation in expression affects downstream target genes important for carcinogenesis such as oncogenes and tumor suppressor genes. Understanding the role of these Ese factors in normal cellular differentiation may elucidate specific mechanisms by which cancer cells can become undifferentiated.
Asunto(s)
Neoplasias/metabolismo , Proteínas Proto-Oncogénicas/fisiología , Factores de Transcripción/fisiología , Animales , Diferenciación Celular/fisiología , División Celular/fisiología , Células Epiteliales/citología , Células Epiteliales/metabolismo , Humanos , Neoplasias/genética , Neoplasias/patología , Proteínas Proto-Oncogénicas/genética , Proteínas Proto-Oncogénicas/metabolismo , Proteínas Proto-Oncogénicas c-ets , Distribución Tisular , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
Significant fractions of eukaryotic genomes give rise to RNA, much of which is unannotated and has reduced protein-coding potential. The genomic origins and the associations of human nuclear and cytosolic polyadenylated RNAs longer than 200 nucleotides (nt) and whole-cell RNAs less than 200 nt were investigated in this genome-wide study. Subcellular addresses for nucleotides present in detected RNAs were assigned, and their potential processing into short RNAs was investigated. Taken together, these observations suggest a novel role for some unannotated RNAs as primary transcripts for the production of short RNAs. Three potentially functional classes of RNAs have been identified, two of which are syntenically conserved and correlate with the expression state of protein-coding genes. These data support a highly interleaved organization of the human transcriptome.
Asunto(s)
Genoma Humano , Precursores del ARN/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN/genética , Transcripción Genética , Animales , Línea Celular Tumoral , Núcleo Celular/metabolismo , Citosol/metabolismo , Exones , Expresión Génica , Genoma , Células HeLa , Humanos , Ratones , Regiones Promotoras Genéticas , ARN/metabolismo , Precursores del ARN/metabolismo , Sintenía , Regiones Terminadoras GenéticasRESUMEN
Dorsal-ventral (DV) patterning of the Drosophila embryo is initiated by Dorsal, a sequence-specific transcription factor distributed in a broad nuclear gradient in the precellular embryo. Previous studies have identified as many as 70 protein-coding genes and one microRNA (miRNA) gene that are directly or indirectly regulated by this gradient. A gene regulation network, or circuit diagram, including the functional interconnections among 40 Dorsal target genes and 20 associated tissue-specific enhancers, has been determined for the initial stages of gastrulation. Here, we attempt to extend this analysis by identifying additional DV patterning genes using a recently developed whole-genome tiling array. This analysis led to the identification of another 30 protein-coding genes, including the Drosophila homolog of Idax, an inhibitor of Wnt signaling. In addition, remote 5' exons were identified for at least 10 of the approximately 100 protein-coding genes that were missed in earlier annotations. As many as nine intergenic uncharacterized transcription units were identified, including two that contain known microRNAs, miR-1 and -9a. We discuss the potential functions of these recently identified genes and suggest that intronic enhancers are a common feature of the DV gene network.
Asunto(s)
Tipificación del Cuerpo/genética , Drosophila melanogaster/embriología , Drosophila melanogaster/genética , Regulación del Desarrollo de la Expresión Génica , Genoma de los Insectos/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Animales , Embrión no Mamífero/embriología , Embrión no Mamífero/metabolismoRESUMEN
MicroRNAs (miRNAs) regulate posttranscriptional gene activity by binding to specific sequences in the 3' UTRs of target mRNAs. A number of metazoan miRNAs have been shown to exhibit tissue-specific patterns of expression. Here, we investigate the possibility that localized expression is mediated by tissue-specific enhancers, comparable to those seen for protein-coding genes. Two miRNA loci in Drosophila melanogaster are investigated, the mir-309-6 polycistron (8-miR) and the mir-1 gene. The 8-miR locus contains a cluster of eight distinct miRNAs that are transcribed in a common precursor RNA. The 8-miR primary transcript displays a dynamic pattern of expression in early embryos, including repression at the anterior and posterior poles. An 800-bp 5' enhancer was identified that recapitulates this complex pattern when attached to a RNA polymerase II core promoter fused to a lacZ-reporter gene. The miR-1 locus is specifically expressed in the mesoderm of gastrulating embryos. Bioinformatics methods were used to identify a mesoderm-specific enhancer located approximately 5 kb 5' of the miR-1 transcription unit. Evidence is presented that the 8-miR enhancer is regulated by the localized Huckebein repressor, whereas miR-1 is activated by Dorsal and Twist. These results provide evidence that restricted activities of the 8-miR and miR-1 miRNAs are mediated by classical tissue-specific enhancers.
Asunto(s)
Drosophila melanogaster/genética , Regulación del Desarrollo de la Expresión Génica/genética , MicroARNs/genética , Factores de Transcripción , Animales , Proteínas de Unión al ADN/fisiología , Proteínas de Drosophila/fisiología , Drosophila melanogaster/embriología , Embrión no Mamífero , Elementos de Facilitación Genéticos , Genes , Mesodermo/metabolismo , Proteínas Nucleares/fisiología , Fosfoproteínas/fisiología , Precursores del ARN , Distribución Tisular , Factores de Transcripción/fisiología , Transcripción Genética , Proteína 1 Relacionada con Twist/fisiologíaRESUMEN
Sites of transcription of polyadenylated and nonpolyadenylated RNAs for 10 human chromosomes were mapped at 5-base pair resolution in eight cell lines. Unannotated, nonpolyadenylated transcripts comprise the major proportion of the transcriptional output of the human genome. Of all transcribed sequences, 19.4, 43.7, and 36.9% were observed to be polyadenylated, nonpolyadenylated, and bimorphic, respectively. Half of all transcribed sequences are found only in the nucleus and for the most part are unannotated. Overall, the transcribed portions of the human genome are predominantly composed of interlaced networks of both poly A+ and poly A- annotated transcripts and unannotated transcripts of unknown function. This organization has important implications for interpreting genotype-phenotype associations, regulation of gene expression, and the definition of a gene.
Asunto(s)
Cromosomas Humanos/genética , Genoma Humano , ARN Mensajero/análisis , Transcripción Genética , Línea Celular , Línea Celular Tumoral , Núcleo Celular/metabolismo , Cromosomas Humanos Par 13/genética , Cromosomas Humanos Par 14/genética , Cromosomas Humanos Par 19/genética , Cromosomas Humanos Par 20/genética , Cromosomas Humanos Par 21/genética , Cromosomas Humanos Par 22/genética , Cromosomas Humanos Par 6/genética , Cromosomas Humanos Par 7/genética , Cromosomas Humanos X/genética , Cromosomas Humanos Y/genética , Biología Computacional , Citosol/metabolismo , ADN Complementario , ADN Intergénico , Exones , Femenino , Humanos , Intrones , Masculino , Datos de Secuencia Molecular , Técnicas de Amplificación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Mapeo Físico de Cromosoma , Empalme del ARNRESUMEN
In the past decade, microarray technology has become a major tool for high-throughput comprehensive analysis of gene expression, genotyping and resequencing applications. Currently, the most widely employed application of high-density oligonucleotide arrays (HDOAs) involves monitoring changes in gene expression. This application has been carried out in a variety of organisms ranging from Escherichia coli to humans. The recent near completion of the human and mouse genome sequences, however, as well as the genomes of other model experimental species, has allowed for novel applications of HDOAs, such as: the discovery of novel transcripts, mapping functionally important genomic regions and identifying functional domains in RNA molecules. Integrating all this information will provide novel global views of the locations of RNA transcription, DNA replication and the protein nucleic acid interactions that regulate these processes.
Asunto(s)
Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , ARN/análisis , Replicación del ADN , ADN Complementario/genética , Perfilación de la Expresión Génica/instrumentación , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos/instrumentación , ARN/genética , Transcripción GenéticaRESUMEN
Extracellular matrix (ECM) production and turnover are tightly controlled under normal physiological conditions. Ets factors regulate matrix turnover by activating transcription of several metalloproteinases (MMPs) and are frequently overexpressed in aggressive tumors and arthritis. Because of the prominent role of transforming growth factor beta (TGF-beta) in ECM synthesis, this study was undertaken to determine the possible interactions between Ets1 and the TGF-beta pathway. Experiments using adenoviral delivery of Ets1 in human fibroblasts have established that Ets1 strongly suppresses TGF-beta induction of collagen type I and other matrix-related genes and reverses TGF-beta-dependent inhibition of MMP-1. Subsequent experiments utilizing COL1A2 promoter demonstrated that Ets1 in the presence of TGF-beta signaling interferes with the stimulatory role of p300. To gain further insight into the mechanism of Ets1 inhibition of the TGF-beta signaling, the protein levels and post-translational modifications of Ets1 after TGF-beta treatment were analyzed. The level of total Ets1 protein was not affected after 24 h of TGF-beta stimulation. Moreover, TGF-beta did not affect either serine or threonine phosphorylation levels of Ets1. However, TGF-beta induced rapid and prolonged lysine acetylation of Ets1. In addition, analyses of endogenous p300.Ets1 complexes revealed that acetylated Ets1 is preferentially associated with the p300/CBP complexes. TGF-beta treatment leads to dissociation of Ets1 from the CBP/p300 complexes. Together, these findings suggest that elevated expression of Ets1 in fibroblasts fundamentally alters their responses to TGF-beta in favor of matrix degradation and away from matrix deposition as exemplified by arthritis and cancer.
Asunto(s)
Proteínas Proto-Oncogénicas/fisiología , Transducción de Señal/fisiología , Factores de Transcripción/fisiología , Factor de Crecimiento Transformador beta/metabolismo , Acetilación , Células Cultivadas , Fibrosis , Regulación de la Expresión Génica/fisiología , Humanos , Inhibidores de la Metaloproteinasa de la Matriz , Proteína Proto-Oncogénica c-ets-1 , Proteínas Proto-Oncogénicas/metabolismo , Proteínas Proto-Oncogénicas c-ets , Factores de Transcripción/metabolismo , Transcripción Genética/fisiología , Factor de Crecimiento Transformador beta/antagonistas & inhibidoresRESUMEN
Using high-density oligonucleotide arrays representing essentially all nonrepetitive sequences on human chromosomes 21 and 22, we map the binding sites in vivo for three DNA binding transcription factors, Sp1, cMyc, and p53, in an unbiased manner. This mapping reveals an unexpectedly large number of transcription factor binding site (TFBS) regions, with a minimal estimate of 12,000 for Sp1, 25,000 for cMyc, and 1600 for p53 when extrapolated to the full genome. Only 22% of these TFBS regions are located at the 5' termini of protein-coding genes while 36% lie within or immediately 3' to well-characterized genes and are significantly correlated with noncoding RNAs. A significant number of these noncoding RNAs are regulated in response to retinoic acid, and overlapping pairs of protein-coding and noncoding RNAs are often coregulated. Thus, the human genome contains roughly comparable numbers of protein-coding and noncoding genes that are bound by common transcription factors and regulated by common environmental signals.