RESUMEN
Ancient human migrations led to the settlement of population groups in varied environmental contexts worldwide. The extent to which adaptation to local environments has shaped human genetic diversity is a longstanding question in human evolution. Recent studies have suggested that introgression of archaic alleles in the genome of modern humans may have contributed to adaptation to environmental pressures such as pathogen exposure. Functional genomic studies have demonstrated that variation in gene expression across individuals and in response to environmental perturbations is a main mechanism underlying complex trait variation. We considered gene expression response to in vitro treatments as a molecular phenotype to identify genes and regulatory variants that may have played an important role in adaptations to local environments. We investigated if Neanderthal introgression in the human genome may contribute to the transcriptional response to environmental perturbations. To this end we used eQTLs for genes differentially expressed in a panel of 52 cellular environments, resulting from 5 cell types and 26 treatments, including hormones, vitamins, drugs, and environmental contaminants. We found that SNPs with introgressed Neanderthal alleles (N-SNPs) disrupt binding of transcription factors important for environmental responses, including ionizing radiation and hypoxia, and for glucose metabolism. We identified an enrichment for N-SNPs among eQTLs for genes differentially expressed in response to 8 treatments, including glucocorticoids, caffeine, and vitamin D. Using Massively Parallel Reporter Assays (MPRA) data, we validated the regulatory function of 21 introgressed Neanderthal variants in the human genome, corresponding to 8 eQTLs regulating 15 genes that respond to environmental perturbations. These findings expand the set of environments where archaic introgression may have contributed to adaptations to local environments in modern humans and provide experimental validation for the regulatory function of introgressed variants.
Asunto(s)
Exposición a Riesgos Ambientales , Genoma Humano , Hombre de Neandertal/genética , Adaptación Fisiológica/genética , Alelos , Animales , Regulación de la Expresión Génica , Migración Humana , Humanos , Polimorfismo de Nucleótido Simple , Unión Proteica , Sitios de Carácter Cuantitativo , Factores de Transcripción/metabolismoRESUMEN
Environmental perturbations have large effects on both organismal and cellular traits, including gene expression, but the extent to which the environment affects RNA processing remains largely uncharacterized. Recent studies have identified a large number of genetic variants associated with variation in RNA processing that also have an important role in complex traits; yet we do not know in which contexts the different underlying isoforms are used. Here, we comprehensively characterized changes in RNA processing events across 89 environments in five human cell types and identified 15,300 event shifts (FDR = 15%) comprised of eight event types in over 4,000 genes. Many of these changes occur consistently in the same direction across conditions, indicative of global regulation by trans factors. Accordingly, we demonstrate that environmental modulation of splicing factor binding predicts shifts in intron retention, and that binding of transcription factors predicts shifts in alternative first exon (AFE) usage in response to specific treatments. We validated the mechanism hypothesized for AFE in two independent datasets. Using ATAC-seq, we found altered binding of 64 factors in response to selenium at sites of AFE shift, including ELF2 and other factors in the ETS family. We also performed AFE QTL mapping in 373 individuals and found an enrichment for SNPs predicted to disrupt binding of the ELF2 factor. Together, these results demonstrate that RNA processing is dramatically changed in response to environmental perturbations through specific mechanisms regulated by trans factors.
Asunto(s)
Ambiente , Procesamiento Postranscripcional del ARN , Línea Celular , Exones , Regulación de la Expresión Génica , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Intrones , Sitios de Carácter Cuantitativo , Alineación de Secuencia , Análisis de Secuencia de ARN , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
Genetic variants in gene regulatory sequences can modify gene expression and mediate the molecular response to environmental stimuli. In addition, genotype-environment interactions (GxE) contribute to complex traits such as cardiovascular disease. Caffeine is the most widely consumed stimulant and is known to produce a vascular response. To investigate GxE for caffeine, we treated vascular endothelial cells with caffeine and used a massively parallel reporter assay to measure allelic effects on gene regulation for over 43,000 genetic variants. We identified 665 variants with allelic effects on gene regulation and 6 variants that regulate the gene expression response to caffeine (GxE, false discovery rate [FDR] < 5%). When overlapping our GxE results with expression quantitative trait loci colocalized with coronary artery disease and hypertension, we dissected their regulatory mechanisms and showed a modulatory role for caffeine. Our results demonstrate that massively parallel reporter assay is a powerful approach to identify and molecularly characterize GxE in the specific context of caffeine consumption.
Asunto(s)
Células Endoteliales , Interacción Gen-Ambiente , Cafeína/farmacología , Regulación de la Expresión Génica , Sitios de Carácter CuantitativoRESUMEN
Genetic effects on gene expression and splicing can be modulated by cellular and environmental factors; yet interactions between genotypes, cell type, and treatment have not been comprehensively studied together. We used an induced pluripotent stem cell system to study multiple cell types derived from the same individuals and exposed them to a large panel of treatments. Cellular responses involved different genes and pathways for gene expression and splicing and were highly variable across contexts. For thousands of genes, we identified variable allelic expression across contexts and characterized different types of gene-environment interactions, many of which are associated with complex traits. Promoter functional and evolutionary features distinguished genes with elevated allelic imbalance mean and variance. On average, half of the genes with dynamic regulatory interactions were missed by large eQTL mapping studies, indicating the importance of exploring multiple treatments to reveal previously unrecognized regulatory loci that may be important for disease.
The activity of the genes in a cell depends on the type of cell they are in, the interactions with other genes, the environment and genetics. Active genes produce a greater number of mRNA molecules, which act as messenger molecules to instruct the cell to produce proteins. The amount of mRNA molecules in cells can be measured to assess the levels of gene activity. Genes produce mRNAs through a process called transcription, and the collection of all the mRNA molecules in a cell is called the transcriptome. Cells obtained from human samples can be grown in the lab under different conditions, and this can be used to transform them into different types of cells. These cells can then be exposed to different treatments such as specific chemicals to understand how the environment affects them. Cells derived from different people may respond differently to the same treatment based on their unique genetics. Exposing different types of cells from many people to different treatments can help explain how genetics, the environment and cell type affect gene activity. Findley et al. grew three different types of cells from six different people in the lab. The cells were exposed to 28 different treatments, which reflect different environmental changes. Studying all these different factors together allowed Findley et al. to understand how genetics, cell type and environment affect the activity of over 53,000 genes. Around half of the effects due to an interaction between genetics and the environment and had not been seen in other larger studies of the transcriptome. Many of these newly observed changes are in genes that have connections to different diseases, including heart disease. The results of Findley et al. provide evidence indicating to which extent lifestyle and the environment can interact with an individual's genetic makeup to impact gene activity and long-term health. The more researchers can understand these factors, the more useful they can be in helping to predict, detect and treat illnesses. The findings also show how genes and the environment interact, which may be relevant to understanding disease development. There is more work to be done to understand a wider range of environmental factors across more cell types. It will also be important to establish how this work on cells grown in the lab translates to human health.
Asunto(s)
Regulación de la Expresión Génica/genética , Células Madre Pluripotentes Inducidas/metabolismo , Linfocitos/metabolismo , Miocitos Cardíacos/metabolismo , Empalme Alternativo , Diferenciación Celular/genética , Línea Celular , Femenino , Humanos , Células Madre Pluripotentes Inducidas/citología , Linfocitos/citología , Miocitos Cardíacos/citología , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ARNRESUMEN
GWAS and eQTL studies identified thousands of genetic variants associated with complex traits and gene expression. Despite the important role of environmental exposures in complex traits, only a limited number of environmental factors were measured in these studies. Measuring molecular phenotypes in tightly controlled cellular environments provides a more tractable setting to study gene-environment interactions in the absence of other confounding variables. We performed RNA-seq and ATAC-seq in endothelial cells exposed to retinoic acid, dexamethasone, caffeine, and selenium to model genetic and environmental effects on gene regulation in the vascular endothelium-a common site of pathology in cardiovascular disease. We found that genes near regions of differentially accessible chromatin were more likely to be differentially expressed [OR = (3.41, 6.52), [Formula: see text]]. Furthermore, we confirmed that environment-specific changes in transcription factor binding are a key mechanism for cellular response to environmental stimuli. Single nucleotide polymorphisms (SNPs) in these transcription response factor footprints for dexamethasone, caffeine, and retinoic acid were enriched in GTEx eQTLs from artery tissues, indicating that these environmental conditions are latently present in GTEx samples. Additionally, SNPs in footprints for response factors in caffeine are enriched in colocalized eQTLs for coronary artery disease (CAD), suggesting a role for caffeine in CAD risk. By combining GWAS, eQTLs, and response genes, we annotated environmental components that can increase or decrease disease risk through changes in gene expression in 43 genes. Interestingly, each treatment may amplify or buffer genetic risk for CAD, depending on the particular SNP or gene considered.
Asunto(s)
Enfermedad de la Arteria Coronaria/genética , Interacción Gen-Ambiente , Predisposición Genética a la Enfermedad , Sitios de Carácter Cuantitativo/genética , Cafeína/farmacología , Células Endoteliales/efectos de los fármacos , Regulación de la Expresión Génica/efectos de los fármacos , Humanos , Fenotipo , RNA-Seq , Factores de Riesgo , Selenio/farmacología , Tretinoina/farmacologíaRESUMEN
Variation in gut microbiome is associated with wellness and disease in humans, and yet the molecular mechanisms by which this variation affects the host are not well understood. A likely mechanism is that of changing gene regulation in interfacing host epithelial cells. Here, we treated colonic epithelial cells with live microbiota from five healthy individuals and quantified induced changes in transcriptional regulation and chromatin accessibility in host cells. We identified over 5,000 host genes that change expression, including 588 distinct associations between specific taxa and host genes. The taxa with the strongest influence on gene expression alter the response of genes associated with complex traits. Using ATAC-seq, we showed that a subset of these changes in gene expression are associated with changes in host chromatin accessibility and transcription factor binding induced by exposure to gut microbiota. We then created a manipulated microbial community with titrated doses of Collinsella, demonstrating that manipulation of the composition of the microbiome under both natural and controlled conditions leads to distinct and predictable gene expression profiles in host cells. Taken together, our results suggest that specific microbes play an important role in regulating expression of individual host genes involved in human complex traits. The ability to fine-tune the expression of host genes by manipulating the microbiome suggests future therapeutic routes.IMPORTANCE The composition of the gut microbiome has been associated with various aspects of human health, but the mechanism of this interaction is still unclear. We utilized a cellular system to characterize the effect of the microbiome on human gene expression. We showed that some of these changes in expression may be mediated by changes in chromatin accessibility. Furthermore, we validate the role of a specific microbe and show that changes in its abundance can modify the host gene expression response. These results show an important role of gut microbiota in regulating host gene expression and suggest that manipulation of microbiome composition could be useful in future therapies.
RESUMEN
Single-stranded DNA (ssDNA) recombineering is a technology which is used to make subtle changes in the chromosome of several bacterial genera. Cells which express a single-stranded DNA binding protein (RecT or Bet) are transformed with an oligonucleotide which is incorporated via an annealing and replication-dependent mechanism. By in silico analysis we identified ssDNA binding protein homologs in the genus Lactobacillus and Lactococcus lactis. To assess whether we could further improve the recombineering efficiency in Lactobacillus reuteri ATCC PTA 6475 we expressed several RecT homologs in this strain. RecT derived from Enterococcus faecalis CRMEN 19 yielded comparable efficiencies compared with a native RecT protein, but none of the other proteins further increased the recombineering efficiency. We successfully improved recombineering efficiency 10-fold in L. lactis by increasing oligonucleotide concentration combined with the use of oligonucleotides containing phosphorothioate-linkages (PTOs). Surprisingly, neither increased oligonucleotide concentration nor PTO linkages enhanced recombineering in L. reuteri 6475. To emphasize the utility of this technology in improving probiotic features we modified six bases in a transcriptional regulatory element region of the pdu-operon of L. reuteri 6475, yielding a 3-fold increase in the production of the antimicrobial compound reuterin. Directed genetic modification of lactic acid bacteria through ssDNA recombineering will simplify strain improvement in a way that, when mutating a single base, is genetically indistinguishable from strains obtained through directed evolution.