Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 17 de 17
Filter
1.
Genome Res ; 26(2): 151-62, 2016 Feb.
Article in English | MEDLINE | ID: mdl-26728717

ABSTRACT

An open question in the history of human migration is the identity of the earliest Eurasian populations that have left contemporary descendants. The Arabian Peninsula was the initial site of the out-of-Africa migrations that occurred between 125,000 and 60,000 yr ago, leading to the hypothesis that the first Eurasian populations were established on the Peninsula and that contemporary indigenous Arabs are direct descendants of these ancient peoples. To assess this hypothesis, we sequenced the entire genomes of 104 unrelated natives of the Arabian Peninsula at high coverage, including 56 of indigenous Arab ancestry. The indigenous Arab genomes defined a cluster distinct from other ancestral groups, and these genomes showed clear hallmarks of an ancient out-of-Africa bottleneck. Similar to other Middle Eastern populations, the indigenous Arabs had higher levels of Neanderthal admixture compared to Africans but had lower levels than Europeans and Asians. These levels of Neanderthal admixture are consistent with an early divergence of Arab ancestors after the out-of-Africa bottleneck but before the major Neanderthal admixture events in Europe and other regions of Eurasia. When compared to worldwide populations sampled in the 1000 Genomes Project, although the indigenous Arabs had a signal of admixture with Europeans, they clustered in a basal, outgroup position to all 1000 Genomes non-Africans when considering pairwise similarity across the entire genome. These results place indigenous Arabs as the most distant relatives of all other contemporary non-Africans and identify these people as direct descendants of the first Eurasian populations established by the out-of-Africa migrations.


Subject(s)
Arabs/genetics , Black People/genetics , Human Migration , Neanderthals/genetics , White People/genetics , Animals , Cluster Analysis , DNA, Mitochondrial/genetics , Gene Frequency , Humans , Hybridization, Genetic , Markov Chains , Models, Genetic , Phylogeny , Principal Component Analysis , Qatar , Sequence Analysis, DNA
2.
J Immunol ; 196(7): 3159-67, 2016 Apr 01.
Article in English | MEDLINE | ID: mdl-26927796

ABSTRACT

In the process of seeking novel lung host defense regulators by analyzing genome-wide RNA sequence data from normal human airway epithelium, we detected expression of POU domain class 2-associating factor 1 (POU2AF1), a known transcription cofactor previously thought to be expressed only in lymphocytes. Lymphocyte contamination of human airway epithelial samples obtained by bronchoscopy and brushing was excluded by immunohistochemistry staining, the observation of upregulation of POU2AF1 in purified airway basal stem/progenitor cells undergoing differentiation, and analysis of differentiating single basal cell clones. Lentivirus-mediated upregulation of POU2AF1 in airway basal cells induced upregulation of host defense genes, including MX1, IFIT3, IFITM, and known POU2AF1 downstream genes HLA-DRA, ID2, ID3, IL6, and BCL6. Interestingly, expression of these genes paralleled changes of POU2AF1 expression during airway epithelium differentiation in vitro, suggesting POU2AF1 helps to maintain a host defense tone even in pathogen-free condition. Cigarette smoke, a known risk factor for airway infection, suppressed POU2AF1 expression both in vivo in humans and in vitro in human airway epithelial cultures, accompanied by deregulation of POU2AF1 downstream genes. Finally, enhancing POU2AF1 expression in human airway epithelium attenuated the suppression of host defense genes by smoking. Together, these findings suggest a novel function of POU2AF1 as a potential regulator of host defense genes in the human airway epithelium.


Subject(s)
Gene Expression Regulation , Immunity/genetics , Respiratory Mucosa/immunology , Respiratory Mucosa/metabolism , Trans-Activators/genetics , Trans-Activators/metabolism , Cell Differentiation , Cluster Analysis , Epithelial Cells/metabolism , Gene Expression Profiling , Humans , Respiratory Mucosa/cytology , Smoking/adverse effects
3.
Am J Respir Crit Care Med ; 194(5): 587-95, 2016 09 01.
Article in English | MEDLINE | ID: mdl-27007171

ABSTRACT

RATIONALE: Waterpipes, also called hookahs, are currently used by millions of people worldwide. Despite the increasing use of waterpipe smoking, there is limited data on the health effects of waterpipe smoking and there are no federal regulations regarding its use. OBJECTIVES: To assess the effects of waterpipe smoking on the human lung using clinical and biological parameters in young, light-use waterpipe smokers. METHODS: We assessed young, light-use, waterpipe-only smokers in comparison with lifelong nonsmokers using clinical parameters of cough and sputum scores, lung function, and chest high-resolution computed tomography as well as biological parameters of lung epithelial lining fluid metabolome, small airway epithelial (SAE) cell differential and transcriptome, alveolar macrophage transcriptome, and plasma apoptotic endothelial cell microparticles. MEASUREMENTS AND MAIN RESULTS: Compared with nonsmokers, waterpipe smokers had more cough and sputum as well as a lower lung diffusing capacity, abnormal epithelial lining fluid metabolome profile, increased proportions of SAE secretory and intermediate cells, reduced proportions of SAE ciliated and basal cells, markedly abnormal SAE and alveolar macrophage transcriptomes, and elevated levels of apoptotic endothelial cell microparticles. CONCLUSIONS: Young, light-use, waterpipe-only smokers have a variety of abnormalities in multiple lung-related biological and clinical parameters, suggesting that even limited waterpipe use has broad consequences on human lung biology and health. We suggest that large epidemiological studies should be initiated to investigate the harmful effects of waterpipe smoking.


Subject(s)
Lung/pathology , Lung/physiopathology , Pulmonary Diffusing Capacity , Smoking/adverse effects , Tobacco Use Disorder/complications , Transcriptome/drug effects , Adult , Carbon Monoxide/analysis , Carboxyhemoglobin/analysis , Case-Control Studies , Cell-Derived Microparticles/drug effects , Cotinine/urine , Cough/etiology , Cough/microbiology , Epithelial Cells/drug effects , Female , Forced Expiratory Volume/physiology , Humans , Male , Nicotine/urine , Pulmonary Alveoli/cytology , Pulmonary Alveoli/drug effects , Sputum/chemistry , Sputum/drug effects , Thorax/diagnostic imaging , Tomography, X-Ray Computed , Young Adult
4.
BMC Genomics ; 16: 834, 2015 Oct 22.
Article in English | MEDLINE | ID: mdl-26490036

ABSTRACT

BACKGROUND: The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid approach that integrates array genotyping intensity data and next-generation sequencing reads to call CNVs in the Qatari population. METHODS: CNVs were detected in 97 unrelated Qatari individuals by running two calling algorithms on each of two primary datasets: high-resolution genotyping (Illumina Omni 2.5M) and high depth whole-genome sequencing (Illumina PE 100bp). The four call-sets were integrated to identify high confidence CNV regions, which were subsequently annotated for putative functional effect and compared to public databases of CNVs in other populations. The availability of genome sequence was leveraged to identify tagging SNPs in high LD with common deletions in this population, enabling their imputation from genotyping experiments in the future. RESULTS: Genotyping intensities and genome sequencing data from 97 Qataris were analyzed with four different algorithms and integrated to discover 16,660 high confidence CNV regions (CNVRs) in the total population, affecting ~28 Mb in the median Qatari genome. Up to 40% of all CNVs affected genes, including novel CNVs affecting Mendelian disease genes, segregating at different frequencies in the 3 major Qatari subpopulations, including those with Bedouin, Persian/South Asian, and African ancestry. Consistent with high consanguinity levels in the Bedouin subpopulation, we found an increased burden for homozygous deletions in this group. In comparison to known CNVs in the comprehensive Database of Genomic Variants, we found that 5% of all CNVRs in Qataris were completely novel, with an enrichment of CNVs affecting several known chromosomal disorder loci and genes known to regulate sugar metabolism and type 2 diabetes in the Qatari cohort. Finally, we leveraged the availability of genome sequence to find suitable tagging SNPs for common deletions in this population. CONCLUSION: We combine four independently generated datasets from 97 individuals to study CNVs for the first time at high-resolution in a Gulf Arab population.


Subject(s)
DNA Copy Number Variations , Gene Dosage , Genetics, Population , Genome, Human , Genomics , Computational Biology/methods , Genetic Predisposition to Disease , Genome-Wide Association Study , Genomics/methods , High-Throughput Nucleotide Sequencing , Humans , Molecular Sequence Annotation , Polymorphism, Single Nucleotide , Qatar
5.
Hum Mutat ; 35(1): 105-16, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24123366

ABSTRACT

Exome sequencing of families of related individuals has been highly successful in identifying genetic polymorphisms responsible for Mendelian disorders. Here, we demonstrate the value of the reverse approach, where we use exome sequencing of a sample of unrelated individuals to analyze allele frequencies of known causal mutations for Mendelian diseases. We sequenced the exomes of 100 individuals representing the three major genetic subgroups of the Qatari population (Q1 Bedouin, Q2 Persian-South Asian, Q3 African) and identified 37 variants in 33 genes with effects on 36 clinically significant Mendelian diseases. These include variants not present in 1000 Genomes and variants at high frequency when compared with 1000 Genomes populations. Several of these Mendelian variants were only segregating in one Qatari subpopulation, where the observed subpopulation specificity trends were confirmed in an independent population of 386 Qataris. Premarital genetic screening in Qatar tests for only four out of the 37, such that this study provides a set of Mendelian disease variants with potential impact on the epidemiological profile of the population that could be incorporated into the testing program if further experimental and clinical characterization confirms high penetrance.


Subject(s)
Chromosomes, Human/genetics , Genetic Diseases, Inborn/genetics , Genetic Predisposition to Disease , Genetic Testing , Genetic Variation , Sequence Analysis, DNA , Databases, Genetic , Exome , Female , Gene Frequency , Genetic Diseases, Inborn/epidemiology , Humans , Male , Prevalence , Qatar/epidemiology
6.
Eur Respir J ; 42(5): 1332-44, 2013 Nov.
Article in English | MEDLINE | ID: mdl-23645403

ABSTRACT

Lung cancer, including lung adenocarcinoma, is a heterogeneous disease, which evolves from molecular alterations in the airway epithelium. This study explores whether a subtype of lung adenocarcinomas expresses the unique molecular features of human airway basal cells (BCs), and how expression of the airway BC features correlates with the molecular, pathological and clinical phenotype of lung adenocarcinoma. Three independent lung adenocarcinoma data sets were analysed for expression of genes that constitute the airway BC signature. Expression of the BC signature in lung adenocarcinoma was then correlated to clinical and biological parameters. Remarkable enrichment of airway BC signature genes was found in lung adenocarcinomas. A subset of lung adenocarcinomas (BC-high adenocarcinoma) exhibited high expression of BC signature genes in association with poorer tumour grade, higher frequency of vascular invasion and shorter survival than adenocarcinomas with lower expression of these genes. At the molecular level, BC-high adenocarcinomas displayed a higher frequency of KRAS mutations, activation of transcriptional networks and pathways related to cell cycle, extracellular matrix organisation, and a distinct differentiation pattern with suppression of ciliated and exocrine bronchiolar cell (Clara cell)-related genes. Activation of the airway BC programme is a molecular feature of a distinct, aggressive subtype of lung adenocarcinoma.


Subject(s)
Adenocarcinoma/classification , Adenocarcinoma/genetics , Lung Neoplasms/classification , Lung Neoplasms/genetics , Neoplasms, Basal Cell/metabolism , Adenocarcinoma/mortality , Adenocarcinoma of Lung , Aged , Biopsy , Cell Differentiation , Cohort Studies , Epithelial Cells/pathology , ErbB Receptors/genetics , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Humans , Lung Neoplasms/mortality , Male , Middle Aged , Phenotype , Principal Component Analysis , Proportional Hazards Models , Transcriptome , ras Proteins/genetics
7.
Nucleic Acids Res ; 39(Database issue): D92-7, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21097880

ABSTRACT

MPromDb (Mammalian Promoter Database) is a curated database that strives to annotate gene promoters identified from ChIP-seq results with the goal of providing an integrated resource for mammalian transcriptional regulation and epigenetics. We analyzed 507 million uniquely aligned RNAP-II ChIP-seq reads from 26 different data sets that include six human cell-types and 10 distinct mouse cell/tissues. The updated MPromDb version consists of computationally predicted (novel) and known active RNAP-II promoters (42,893 human and 48,366 mouse promoters) from various data sets freely available at NCBI GEO database. We found that 36% and 40% of protein-coding genes have alternative promoters in human and mouse genomes and ∼40% of promoters are tissue/cell specific. The identified RNAP-II promoters were annotated using various known and novel gene models. Additionally, for novel promoters we looked into other evidences-GenBank mRNAs, spliced ESTs, CAGE promoter tags and mRNA-seq reads. Users can search the database based on gene id/symbol, or by specific tissue/cell type and filter results based on any combination of tissue/cell specificity, Known/Novel, CpG/NonCpG, and protein-coding/non-coding gene promoters. We have also integrated GBrowse genome browser with MPromDb for visualization of ChIP-seq profiles and to display the annotations. The current release of MPromDb can be accessed at http://bioinformatics.wistar.upenn.edu/MPromDb/.


Subject(s)
Chromatin Immunoprecipitation , Databases, Nucleic Acid , Promoter Regions, Genetic , Animals , Computer Graphics , Humans , Mice , RNA Polymerase II/metabolism , Sequence Analysis, DNA , Systems Integration
8.
Nucleic Acids Res ; 39(1): 190-201, 2011 Jan.
Article in English | MEDLINE | ID: mdl-20843783

ABSTRACT

Alternative promoters that are differentially used in various cellular contexts and tissue types add to the transcriptional complexity in mammalian genome. Identification of alternative promoters and the annotation of their activity in different tissues is one of the major challenges in understanding the transcriptional regulation of the mammalian genes and their isoforms. To determine the use of alternative promoters in different tissues, we performed ChIP-seq experiments using antibody against RNA Pol-II, in five adult mouse tissues (brain, liver, lung, spleen and kidney). Our analysis identified 38 639 Pol-II promoters, including 12 270 novel promoters, for both protein coding and non-coding mouse genes. Of these, 6384 promoters are tissue specific which are CpG poor and we find that only 34% of the novel promoters are located in CpG-rich regions, suggesting that novel promoters are mostly tissue specific. By identifying the Pol-II bound promoter(s) of each annotated gene in a given tissue, we found that 37% of the protein coding genes use alternative promoters in the five mouse tissues. The promoter annotations and ChIP-seq data presented here will aid ongoing efforts of characterizing gene regulatory regions in mammalian genomes.


Subject(s)
Promoter Regions, Genetic , RNA Polymerase II/metabolism , Animals , Chromatin Immunoprecipitation , Chromosome Mapping , Genome , Mice , Sequence Analysis, DNA/standards , Transcription, Genetic
10.
Mol Cell Biol ; 27(5): 1859-67, 2007 Mar.
Article in English | MEDLINE | ID: mdl-17194750

ABSTRACT

Recent research has identified critical roles for microRNAs in a large number of cellular processes, including tumorigenic transformation. While significant progress has been made towards understanding the mechanisms of gene regulation by microRNAs, much less is known about factors affecting the expression of these noncoding transcripts. Here, we demonstrate for the first time a functional link between hypoxia, a well-documented tumor microenvironment factor, and microRNA expression. Microarray-based expression profiles revealed that a specific spectrum of microRNAs (including miR-23, -24, -26, -27, -103, -107, -181, -210, and -213) is induced in response to low oxygen, at least some via a hypoxia-inducible-factor-dependent mechanism. Select members of this group (miR-26, -107, and -210) decrease proapoptotic signaling in a hypoxic environment, suggesting an impact of these transcripts on tumor formation. Interestingly, the vast majority of hypoxia-induced microRNAs are also overexpressed in a variety of human tumors.


Subject(s)
Cell Hypoxia , MicroRNAs/metabolism , Apoptosis , Breast Neoplasms/pathology , Cell Line, Tumor , Chromatin Immunoprecipitation , Colonic Neoplasms/pathology , Female , Gene Expression Profiling , Genes, Reporter , HCT116 Cells , HT29 Cells , Humans , Luciferases/metabolism , Oligonucleotide Array Sequence Analysis , Plasmids , Reverse Transcriptase Polymerase Chain Reaction , Transfection
11.
Front Plant Sci ; 11: 604938, 2020.
Article in English | MEDLINE | ID: mdl-33584748

ABSTRACT

Understanding the genetics of field-based tolerance to high iron-associated (HIA) stress in rice can accelerate the development of new varieties with enhanced yield performance in West African lowland ecosystems. To date, few field-based studies have been undertaken to rigorously evaluate rice yield performance under HIA stress conditions. In this study, two NERICA × O. sativa bi-parental rice populations and one O.sativa diversity panel consisting of 296 rice accessions were evaluated for grain yield and leaf bronzing symptoms over multiple years in four West African HIA stress and control sites. Mapping of these traits identified a large number of QTLs and single nucleotide polymorphisms (SNPs) associated with stress tolerance in the field. Favorable alleles associated with tolerance to high levels of iron in anaerobic rice soils were rare and almost exclusively derived from the indica subpopulation, including the most favorable alleles identified in NERICA varieties. These findings highlight the complex genetic architecture underlying rice response to HIA stress and suggest that a recurrent selection program focusing on an expanded indica genepool could be productively used in combination with genomic selection to increase the efficiency of selection in breeding programs designed to enhance tolerance to this prevalent abiotic stress in West Africa.

12.
Carcinogenesis ; 29(7): 1459-65, 2008 Jul.
Article in English | MEDLINE | ID: mdl-18499701

ABSTRACT

Several studies have reported that a high expression ratio of HOXB13 to IL17BR predicts tumor recurrence in node-negative, estrogen receptor (ER) alpha-positive breast cancer patients treated with tamoxifen. The molecular mechanisms underlying this dysregulation of gene expression remain to be explored. Our epigenetic analysis has found that increased promoter methylation of one of these genes, HOXB13, correlate with the decreased expression of its transcript in breast cancer cell lines (P < 0.005). Transcriptional silencing of this gene can be reversed by a demethylation treatment. HOXB13 is suppressed by the activation of estrogen signaling in ERalpha-positive breast cancer cells. However, treatment with 4-hydroxytamoxifen (4-OHT), an antiestrogen, abrogates the ERalpha-mediated suppression in cancer cells. The notion that this transcriptional induction of HOXB13 occurs in vitro with simultaneous exposure to both estrogen and 4-OHT may provide a biological explanation for its aberrant expression in many node-negative patients undergoing tamoxifen therapy. Interestingly, promoter hypermethylation of HOXB13 is more frequently observed in ERalpha-positive patients with increased lymph node metastasis (P = 0.031) and large tumor sizes (>5 cm) (P = 0.008). In addition, this aberrant epigenetic event is associated with shorter disease-free survival (P = 0.029) in cancer patients. These results suggest that hypermethylation of HOXB13 is a late event of breast tumorigenesis and a poor prognostic indicator of node-positive cancer patients.


Subject(s)
Breast Neoplasms/genetics , DNA Methylation , Estradiol/pharmacology , Gene Expression Regulation, Neoplastic , Homeodomain Proteins/genetics , Breast Neoplasms/metabolism , Cell Line, Tumor , CpG Islands , Estrogen Receptor alpha/biosynthesis , Estrogens/metabolism , Female , Gene Silencing , Homeodomain Proteins/biosynthesis , Humans , Promoter Regions, Genetic , Signal Transduction , Tamoxifen/analogs & derivatives , Tamoxifen/pharmacology
13.
Nat Commun ; 9(1): 3519, 2018 08 29.
Article in English | MEDLINE | ID: mdl-30158584

ABSTRACT

As sequencing and genotyping technologies evolve, crop genetics researchers accumulate increasing numbers of genomic data sets from various genotyping platforms on different germplasm panels. Imputation is an effective approach to increase marker density of existing data sets toward the goal of integrating resources for downstream applications. While a number of imputation software packages are available, the limitations to utilization for the rice community include high computational demand and lack of a reference panel. To address these challenges, we develop the Rice Imputation Server, a publicly available web application leveraging genetic information from a globally diverse rice reference panel assembled here. This resource allows researchers to benefit from increased marker density without needing to perform imputation on their own machines. We demonstrate improvements that imputed data provide to rice genome-wide association (GWA) results of grain amylose content and show that the major functional nucleotide polymorphism is tagged only in the imputed data set.


Subject(s)
Oryza/genetics , Gene Frequency/genetics , Genome-Wide Association Study , Genotype , Polymorphism, Single Nucleotide/genetics
14.
BMC Bioinformatics ; 7: 114, 2006 Mar 07.
Article in English | MEDLINE | ID: mdl-16522199

ABSTRACT

BACKGROUND: The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences. RESULTS: In this study, we have performed the first analysis of these core promoter elements in orthologous mouse and human promoters with experimentally-supported transcription start sites. We have identified these various elements using a combination of positional weight matrices (PWMs) and the degree of conservation of orthologous mouse and human sequences--a procedure that significantly reduces the false positive rate of motif discovery. Our analysis of 9,010 orthologous mouse-human promoter pairs revealed two combinations of three-way synergistic effects, TATA-Inr-MTE and BRE-Inr-MTE. The former has previously been putatively identified in human, but the latter represents a novel synergistic relationship. CONCLUSION: Our results demonstrate that DNA sequence conservation can greatly improve the identification of functional core promoter elements in the human genome. The data also underscores the importance of synergistic occurrence of two or more core promoter elements. Furthermore, the sequence data and results presented here can help build better computational models for predicting the transcription start sites in the promoter regions, which remains one of the most challenging problems.


Subject(s)
Chromosome Mapping/methods , Conserved Sequence/genetics , Evolution, Molecular , Promoter Regions, Genetic/genetics , Sequence Analysis, DNA/methods , Animals , Base Sequence , Genome, Human , Humans , Mice , Molecular Sequence Data , Sequence Alignment , Sequence Homology, Nucleic Acid
15.
Nat Commun ; 7: 10532, 2016 Feb 04.
Article in English | MEDLINE | ID: mdl-26842267

ABSTRACT

Increasing food production is essential to meet the demands of a growing human population, with its rising income levels and nutritional expectations. To address the demand, plant breeders seek new sources of genetic variation to enhance the productivity, sustainability and resilience of crop varieties. Here we launch a high-resolution, open-access research platform to facilitate genome-wide association mapping in rice, a staple food crop. The platform provides an immortal collection of diverse germplasm, a high-density single-nucleotide polymorphism data set tailored for gene discovery, well-documented analytical strategies, and a suite of bioinformatics resources to facilitate biological interpretation. Using grain length, we demonstrate the power and resolution of our new high-density rice array, the accompanying genotypic data set, and an expanded diversity panel for detecting major and minor effect QTLs and subpopulation-specific alleles, with immediate implications for rice improvement.


Subject(s)
Access to Information , Chromosome Mapping , Databases, Genetic , Edible Grain/genetics , Genome, Plant/genetics , Oryza/genetics , Quantitative Trait Loci/genetics , Agriculture , Alleles , Computational Biology , Edible Grain/anatomy & histology , Epistasis, Genetic , Genetic Variation , Genome-Wide Association Study , Genotype , Phenotype , Plant Breeding , Polymorphism, Single Nucleotide , Principal Component Analysis
16.
PLoS One ; 9(2): e88051, 2014.
Article in English | MEDLINE | ID: mdl-24498427

ABSTRACT

Genome-wide association studies (GWAS) and candidate gene studies have identified a number of risk loci associated with the smoking-related disease COPD, a disorder that originates in the airway epithelium. Since airway basal cell (BC) stem/progenitor cells exhibit the earliest abnormalities associated with smoking (hyperplasia, squamous metaplasia), we hypothesized that smoker BC have a dysregulated transcriptome, enriched, in part, at known GWAS/candidate gene loci. Massive parallel RNA sequencing was used to compare the transcriptome of BC purified from the airway epithelium of healthy nonsmokers (n = 10) and healthy smokers (n = 7). The chromosomal location of the differentially expressed genes was compared to loci identified by GWAS to confer risk for COPD. Smoker BC have 676 genes differentially expressed compared to nonsmoker BC, dominated by smoking up-regulation. Strikingly, 166 (25%) of these genes are located on chromosome 19, with 13 localized to 19q13.2 (p<10⁻4 compared to chance), including 4 genes (NFKBIB, LTBP4, EGLN2 and TGFB1) associated with risk for COPD. These observations provide the first direct connection between known genetic risks for smoking-related lung disease and airway BC, the population of lung cells that undergo the earliest changes associated with smoking.


Subject(s)
Biomarkers/metabolism , Chromosomes, Human, Pair 19/genetics , Endothelial Cells/metabolism , Gene Expression Profiling , Genetic Loci , Pulmonary Disease, Chronic Obstructive/genetics , Respiratory Mucosa/metabolism , Smoking/adverse effects , Adult , DNA Copy Number Variations/genetics , DNA Methylation , Genome-Wide Association Study , Humans , Oligonucleotide Array Sequence Analysis , Pulmonary Disease, Chronic Obstructive/chemically induced , Pulmonary Disease, Chronic Obstructive/pathology , Respiratory Mucosa/drug effects , Respiratory Mucosa/pathology , Smoking/genetics
17.
BMC Syst Biol ; 3: 73, 2009 Jul 17.
Article in English | MEDLINE | ID: mdl-19615063

ABSTRACT

BACKGROUND: The TGF-beta/SMAD pathway is part of a broader signaling network in which crosstalk between pathways occurs. While the molecular mechanisms of TGF-beta/SMAD signaling pathway have been studied in detail, the global networks downstream of SMAD remain largely unknown. The regulatory effect of SMAD complex likely depends on transcriptional modules, in which the SMAD binding elements and partner transcription factor binding sites (SMAD modules) are present in specific context. RESULTS: To address this question and develop a computational model for SMAD modules, we simultaneously performed chromatin immunoprecipitation followed by microarray analysis (ChIP-chip) and mRNA expression profiling to identify TGF-beta/SMAD regulated and synchronously coexpressed gene sets in ovarian surface epithelium. Intersecting the ChIP-chip and gene expression data yielded 150 direct targets, of which 141 were grouped into 3 co-expressed gene sets (sustained up-regulated, transient up-regulated and down-regulated), based on their temporal changes in expression after TGF-beta activation. We developed a data-mining method driven by the Random Forest algorithm to model SMAD transcriptional modules in the target sequences. The predicted SMAD modules contain SMAD binding element and up to 2 of 7 other transcription factor binding sites (E2F, P53, LEF1, ELK1, COUPTF, PAX4 and DR1). CONCLUSION: Together, the computational results further the understanding of the interactions between SMAD and other transcription factors at specific target promoters, and provide the basis for more targeted experimental verification of the co-regulatory modules.


Subject(s)
Chromatin/metabolism , Smad Proteins/metabolism , Algorithms , Animals , Base Sequence , Cell Line , Gene Expression Profiling , Genome , Humans , Immunoprecipitation , Mice , Models, Biological , Oligonucleotide Array Sequence Analysis , Promoter Regions, Genetic/genetics , RNA, Messenger/genetics , Rats , Reproducibility of Results , Sequence Analysis, DNA , Smad Proteins/genetics , Transforming Growth Factor beta/genetics , Transforming Growth Factor beta/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL