RESUMO
In the process of seeking novel lung host defense regulators by analyzing genome-wide RNA sequence data from normal human airway epithelium, we detected expression of POU domain class 2-associating factor 1 (POU2AF1), a known transcription cofactor previously thought to be expressed only in lymphocytes. Lymphocyte contamination of human airway epithelial samples obtained by bronchoscopy and brushing was excluded by immunohistochemistry staining, the observation of upregulation of POU2AF1 in purified airway basal stem/progenitor cells undergoing differentiation, and analysis of differentiating single basal cell clones. Lentivirus-mediated upregulation of POU2AF1 in airway basal cells induced upregulation of host defense genes, including MX1, IFIT3, IFITM, and known POU2AF1 downstream genes HLA-DRA, ID2, ID3, IL6, and BCL6. Interestingly, expression of these genes paralleled changes of POU2AF1 expression during airway epithelium differentiation in vitro, suggesting POU2AF1 helps to maintain a host defense tone even in pathogen-free condition. Cigarette smoke, a known risk factor for airway infection, suppressed POU2AF1 expression both in vivo in humans and in vitro in human airway epithelial cultures, accompanied by deregulation of POU2AF1 downstream genes. Finally, enhancing POU2AF1 expression in human airway epithelium attenuated the suppression of host defense genes by smoking. Together, these findings suggest a novel function of POU2AF1 as a potential regulator of host defense genes in the human airway epithelium.
Assuntos
Regulação da Expressão Gênica , Imunidade/genética , Mucosa Respiratória/imunologia , Mucosa Respiratória/metabolismo , Transativadores/genética , Transativadores/metabolismo , Diferenciação Celular , Análise por Conglomerados , Células Epiteliais/metabolismo , Perfilação da Expressão Gênica , Humanos , Mucosa Respiratória/citologia , Fumar/efeitos adversosRESUMO
MPromDb (Mammalian Promoter Database) is a curated database that strives to annotate gene promoters identified from ChIP-seq results with the goal of providing an integrated resource for mammalian transcriptional regulation and epigenetics. We analyzed 507 million uniquely aligned RNAP-II ChIP-seq reads from 26 different data sets that include six human cell-types and 10 distinct mouse cell/tissues. The updated MPromDb version consists of computationally predicted (novel) and known active RNAP-II promoters (42,893 human and 48,366 mouse promoters) from various data sets freely available at NCBI GEO database. We found that 36% and 40% of protein-coding genes have alternative promoters in human and mouse genomes and â¼40% of promoters are tissue/cell specific. The identified RNAP-II promoters were annotated using various known and novel gene models. Additionally, for novel promoters we looked into other evidences-GenBank mRNAs, spliced ESTs, CAGE promoter tags and mRNA-seq reads. Users can search the database based on gene id/symbol, or by specific tissue/cell type and filter results based on any combination of tissue/cell specificity, Known/Novel, CpG/NonCpG, and protein-coding/non-coding gene promoters. We have also integrated GBrowse genome browser with MPromDb for visualization of ChIP-seq profiles and to display the annotations. The current release of MPromDb can be accessed at http://bioinformatics.wistar.upenn.edu/MPromDb/.
Assuntos
Imunoprecipitação da Cromatina , Bases de Dados de Ácidos Nucleicos , Regiões Promotoras Genéticas , Animais , Gráficos por Computador , Humanos , Camundongos , RNA Polimerase II/metabolismo , Análise de Sequência de DNA , Integração de SistemasRESUMO
Alternative promoters that are differentially used in various cellular contexts and tissue types add to the transcriptional complexity in mammalian genome. Identification of alternative promoters and the annotation of their activity in different tissues is one of the major challenges in understanding the transcriptional regulation of the mammalian genes and their isoforms. To determine the use of alternative promoters in different tissues, we performed ChIP-seq experiments using antibody against RNA Pol-II, in five adult mouse tissues (brain, liver, lung, spleen and kidney). Our analysis identified 38 639 Pol-II promoters, including 12 270 novel promoters, for both protein coding and non-coding mouse genes. Of these, 6384 promoters are tissue specific which are CpG poor and we find that only 34% of the novel promoters are located in CpG-rich regions, suggesting that novel promoters are mostly tissue specific. By identifying the Pol-II bound promoter(s) of each annotated gene in a given tissue, we found that 37% of the protein coding genes use alternative promoters in the five mouse tissues. The promoter annotations and ChIP-seq data presented here will aid ongoing efforts of characterizing gene regulatory regions in mammalian genomes.
Assuntos
Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , Animais , Imunoprecipitação da Cromatina , Mapeamento Cromossômico , Genoma , Camundongos , Análise de Sequência de DNA/normas , Transcrição GênicaRESUMO
Recent research has identified critical roles for microRNAs in a large number of cellular processes, including tumorigenic transformation. While significant progress has been made towards understanding the mechanisms of gene regulation by microRNAs, much less is known about factors affecting the expression of these noncoding transcripts. Here, we demonstrate for the first time a functional link between hypoxia, a well-documented tumor microenvironment factor, and microRNA expression. Microarray-based expression profiles revealed that a specific spectrum of microRNAs (including miR-23, -24, -26, -27, -103, -107, -181, -210, and -213) is induced in response to low oxygen, at least some via a hypoxia-inducible-factor-dependent mechanism. Select members of this group (miR-26, -107, and -210) decrease proapoptotic signaling in a hypoxic environment, suggesting an impact of these transcripts on tumor formation. Interestingly, the vast majority of hypoxia-induced microRNAs are also overexpressed in a variety of human tumors.
Assuntos
Hipóxia Celular , MicroRNAs/metabolismo , Apoptose , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Imunoprecipitação da Cromatina , Neoplasias do Colo/patologia , Feminino , Perfilação da Expressão Gênica , Genes Reporter , Células HCT116 , Células HT29 , Humanos , Luciferases/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos , Plasmídeos , Reação em Cadeia da Polimerase Via Transcriptase Reversa , TransfecçãoRESUMO
Understanding the genetics of field-based tolerance to high iron-associated (HIA) stress in rice can accelerate the development of new varieties with enhanced yield performance in West African lowland ecosystems. To date, few field-based studies have been undertaken to rigorously evaluate rice yield performance under HIA stress conditions. In this study, two NERICA × O. sativa bi-parental rice populations and one O.sativa diversity panel consisting of 296 rice accessions were evaluated for grain yield and leaf bronzing symptoms over multiple years in four West African HIA stress and control sites. Mapping of these traits identified a large number of QTLs and single nucleotide polymorphisms (SNPs) associated with stress tolerance in the field. Favorable alleles associated with tolerance to high levels of iron in anaerobic rice soils were rare and almost exclusively derived from the indica subpopulation, including the most favorable alleles identified in NERICA varieties. These findings highlight the complex genetic architecture underlying rice response to HIA stress and suggest that a recurrent selection program focusing on an expanded indica genepool could be productively used in combination with genomic selection to increase the efficiency of selection in breeding programs designed to enhance tolerance to this prevalent abiotic stress in West Africa.
RESUMO
Several studies have reported that a high expression ratio of HOXB13 to IL17BR predicts tumor recurrence in node-negative, estrogen receptor (ER) alpha-positive breast cancer patients treated with tamoxifen. The molecular mechanisms underlying this dysregulation of gene expression remain to be explored. Our epigenetic analysis has found that increased promoter methylation of one of these genes, HOXB13, correlate with the decreased expression of its transcript in breast cancer cell lines (P < 0.005). Transcriptional silencing of this gene can be reversed by a demethylation treatment. HOXB13 is suppressed by the activation of estrogen signaling in ERalpha-positive breast cancer cells. However, treatment with 4-hydroxytamoxifen (4-OHT), an antiestrogen, abrogates the ERalpha-mediated suppression in cancer cells. The notion that this transcriptional induction of HOXB13 occurs in vitro with simultaneous exposure to both estrogen and 4-OHT may provide a biological explanation for its aberrant expression in many node-negative patients undergoing tamoxifen therapy. Interestingly, promoter hypermethylation of HOXB13 is more frequently observed in ERalpha-positive patients with increased lymph node metastasis (P = 0.031) and large tumor sizes (>5 cm) (P = 0.008). In addition, this aberrant epigenetic event is associated with shorter disease-free survival (P = 0.029) in cancer patients. These results suggest that hypermethylation of HOXB13 is a late event of breast tumorigenesis and a poor prognostic indicator of node-positive cancer patients.
Assuntos
Neoplasias da Mama/genética , Metilação de DNA , Estradiol/farmacologia , Regulação Neoplásica da Expressão Gênica , Proteínas de Homeodomínio/genética , Neoplasias da Mama/metabolismo , Linhagem Celular Tumoral , Ilhas de CpG , Receptor alfa de Estrogênio/biossíntese , Estrogênios/metabolismo , Feminino , Inativação Gênica , Proteínas de Homeodomínio/biossíntese , Humanos , Regiões Promotoras Genéticas , Transdução de Sinais , Tamoxifeno/análogos & derivados , Tamoxifeno/farmacologiaRESUMO
As sequencing and genotyping technologies evolve, crop genetics researchers accumulate increasing numbers of genomic data sets from various genotyping platforms on different germplasm panels. Imputation is an effective approach to increase marker density of existing data sets toward the goal of integrating resources for downstream applications. While a number of imputation software packages are available, the limitations to utilization for the rice community include high computational demand and lack of a reference panel. To address these challenges, we develop the Rice Imputation Server, a publicly available web application leveraging genetic information from a globally diverse rice reference panel assembled here. This resource allows researchers to benefit from increased marker density without needing to perform imputation on their own machines. We demonstrate improvements that imputed data provide to rice genome-wide association (GWA) results of grain amylose content and show that the major functional nucleotide polymorphism is tagged only in the imputed data set.
Assuntos
Oryza/genética , Frequência do Gene/genética , Estudo de Associação Genômica Ampla , Genótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences. RESULTS: In this study, we have performed the first analysis of these core promoter elements in orthologous mouse and human promoters with experimentally-supported transcription start sites. We have identified these various elements using a combination of positional weight matrices (PWMs) and the degree of conservation of orthologous mouse and human sequences--a procedure that significantly reduces the false positive rate of motif discovery. Our analysis of 9,010 orthologous mouse-human promoter pairs revealed two combinations of three-way synergistic effects, TATA-Inr-MTE and BRE-Inr-MTE. The former has previously been putatively identified in human, but the latter represents a novel synergistic relationship. CONCLUSION: Our results demonstrate that DNA sequence conservation can greatly improve the identification of functional core promoter elements in the human genome. The data also underscores the importance of synergistic occurrence of two or more core promoter elements. Furthermore, the sequence data and results presented here can help build better computational models for predicting the transcription start sites in the promoter regions, which remains one of the most challenging problems.
Assuntos
Mapeamento Cromossômico/métodos , Sequência Conservada/genética , Evolução Molecular , Regiões Promotoras Genéticas/genética , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Genoma Humano , Humanos , Camundongos , Dados de Sequência Molecular , Alinhamento de Sequência , Homologia de Sequência do Ácido NucleicoRESUMO
BACKGROUND: The TGF-beta/SMAD pathway is part of a broader signaling network in which crosstalk between pathways occurs. While the molecular mechanisms of TGF-beta/SMAD signaling pathway have been studied in detail, the global networks downstream of SMAD remain largely unknown. The regulatory effect of SMAD complex likely depends on transcriptional modules, in which the SMAD binding elements and partner transcription factor binding sites (SMAD modules) are present in specific context. RESULTS: To address this question and develop a computational model for SMAD modules, we simultaneously performed chromatin immunoprecipitation followed by microarray analysis (ChIP-chip) and mRNA expression profiling to identify TGF-beta/SMAD regulated and synchronously coexpressed gene sets in ovarian surface epithelium. Intersecting the ChIP-chip and gene expression data yielded 150 direct targets, of which 141 were grouped into 3 co-expressed gene sets (sustained up-regulated, transient up-regulated and down-regulated), based on their temporal changes in expression after TGF-beta activation. We developed a data-mining method driven by the Random Forest algorithm to model SMAD transcriptional modules in the target sequences. The predicted SMAD modules contain SMAD binding element and up to 2 of 7 other transcription factor binding sites (E2F, P53, LEF1, ELK1, COUPTF, PAX4 and DR1). CONCLUSION: Together, the computational results further the understanding of the interactions between SMAD and other transcription factors at specific target promoters, and provide the basis for more targeted experimental verification of the co-regulatory modules.