RESUMEN
While Mediator plays a key role in eukaryotic transcription, little is known about its mechanism of action. This study combines CRISPR-Cas9 genetic screens, degron assays, Hi-C, and cryoelectron microscopy (cryo-EM) to dissect the function and structure of mammalian Mediator (mMED). Deletion analyses in B, T, and embryonic stem cells (ESC) identified a core of essential subunits required for Pol II recruitment genome-wide. Conversely, loss of non-essential subunits mostly affects promoters linked to multiple enhancers. Contrary to current models, however, mMED and Pol II are dispensable to physically tether regulatory DNA, a topological activity requiring architectural proteins. Cryo-EM analysis revealed a conserved core, with non-essential subunits increasing structural complexity of the tail module, a primary transcription factor target. Changes in tail structure markedly increase Pol II and kinase module interactions. We propose that Mediator's structural pliability enables it to integrate and transmit regulatory signals and act as a functional, rather than an architectural bridge, between promoters and enhancers.
Asunto(s)
Complejo Mediador/metabolismo , ARN Polimerasa II/metabolismo , Animales , Linfocitos T CD4-Positivos/citología , Linfocitos T CD4-Positivos/metabolismo , Sistemas CRISPR-Cas/genética , Proteínas de Ciclo Celular/metabolismo , Células Cultivadas , Proteínas Cromosómicas no Histona/metabolismo , Microscopía por Crioelectrón , Elementos de Facilitación Genéticos , Edición Génica , Humanos , Masculino , Complejo Mediador/química , Complejo Mediador/genética , Ratones , Ratones Endogámicos C57BL , Células Madre Embrionarias de Ratones/citología , Células Madre Embrionarias de Ratones/metabolismo , Regiones Promotoras Genéticas , Estructura Cuaternaria de Proteína , ARN Polimerasa II/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , CohesinasRESUMEN
RATIONALE: Mutations in the transcription factor TBX20 (T-box 20) are associated with congenital heart disease. Germline ablation of Tbx20 results in abnormal heart development and embryonic lethality by embryonic day 9.5. Because Tbx20 is expressed in multiple cell lineages required for myocardial development, including pharyngeal endoderm, cardiogenic mesoderm, endocardium, and myocardium, the cell type-specific requirement for TBX20 in early myocardial development remains to be explored. OBJECTIVE: Here, we investigated roles of TBX20 in midgestation cardiomyocytes for heart development. METHODS AND RESULTS: Ablation of Tbx20 from developing cardiomyocytes using a doxycycline inducible cTnTCre transgene led to embryonic lethality. The circumference of developing ventricular and atrial chambers, and in particular that of prospective left atrium, was significantly reduced in Tbx20 conditional knockout mutants. Cell cycle analysis demonstrated reduced proliferation of Tbx20 mutant cardiomyocytes and their arrest at the G1-S phase transition. Genome-wide transcriptome analysis of mutant cardiomyocytes revealed differential expression of multiple genes critical for cell cycle regulation. Moreover, atrial and ventricular gene programs seemed to be aberrantly regulated. Putative direct TBX20 targets were identified using TBX20 ChIP-Seq (chromatin immunoprecipitation with high throughput sequencing) from embryonic heart and included key cell cycle genes and atrial and ventricular specific genes. Notably, TBX20 bound a conserved enhancer for a gene key to atrial development and identity, COUP-TFII/Nr2f2 (chicken ovalbumin upstream promoter transcription factor 2/nuclear receptor subfamily 2, group F, member 2). This enhancer interacted with the NR2F2 promoter in human cardiomyocytes and conferred atrial specific gene expression in a transgenic mouse in a TBX20-dependent manner. CONCLUSIONS: Myocardial TBX20 directly regulates a subset of genes required for fetal cardiomyocyte proliferation, including those required for the G1-S transition. TBX20 also directly downregulates progenitor-specific genes and, in addition to regulating genes that specify chamber versus nonchamber myocardium, directly activates genes required for establishment or maintenance of atrial and ventricular identity. TBX20 plays a previously unappreciated key role in atrial development through direct regulation of an evolutionarily conserved COUPT-FII enhancer.
Asunto(s)
Regulación del Desarrollo de la Expresión Génica , Atrios Cardíacos/embriología , Miocitos Cardíacos/metabolismo , Proteínas de Dominio T Box/genética , Animales , Línea Celular , Proliferación Celular , Células Cultivadas , Fase G1 , Atrios Cardíacos/citología , Atrios Cardíacos/metabolismo , Ratones , Ratones Endogámicos C57BL , Mutación , Miocitos Cardíacos/citología , Miocitos Cardíacos/fisiología , Fase S , Proteínas de Dominio T Box/metabolismoRESUMEN
Genome-wide association studies (GWAS) have reproducibly associated variants within introns of FTO with increased risk for obesity and type 2 diabetes (T2D). Although the molecular mechanisms linking these noncoding variants with obesity are not immediately obvious, subsequent studies in mice demonstrated that FTO expression levels influence body mass and composition phenotypes. However, no direct connection between the obesity-associated variants and FTO expression or function has been made. Here we show that the obesity-associated noncoding sequences within FTO are functionally connected, at megabase distances, with the homeobox gene IRX3. The obesity-associated FTO region directly interacts with the promoters of IRX3 as well as FTO in the human, mouse and zebrafish genomes. Furthermore, long-range enhancers within this region recapitulate aspects of IRX3 expression, suggesting that the obesity-associated interval belongs to the regulatory landscape of IRX3. Consistent with this, obesity-associated single nucleotide polymorphisms are associated with expression of IRX3, but not FTO, in human brains. A direct link between IRX3 expression and regulation of body mass and composition is demonstrated by a reduction in body weight of 25 to 30% in Irx3-deficient mice, primarily through the loss of fat mass and increase in basal metabolic rate with browning of white adipose tissue. Finally, hypothalamic expression of a dominant-negative form of Irx3 reproduces the metabolic phenotypes of Irx3-deficient mice. Our data suggest that IRX3 is a functional long-range target of obesity-associated variants within FTO and represents a novel determinant of body mass and composition.
Asunto(s)
Proteínas de Homeodominio/genética , Intrones/genética , Oxigenasas de Función Mixta/genética , Obesidad/genética , Oxo-Ácido-Liasas/genética , Proteínas/genética , Factores de Transcripción/genética , Tejido Adiposo/metabolismo , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato , Animales , Metabolismo Basal/genética , Índice de Masa Corporal , Peso Corporal/genética , Encéfalo/metabolismo , Diabetes Mellitus Tipo 2/genética , Dieta , Genes Dominantes/genética , Proteínas de Homeodominio/metabolismo , Humanos , Hipotálamo/metabolismo , Masculino , Ratones , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Regiones Promotoras Genéticas/genética , Delgadez/genética , Factores de Transcripción/deficiencia , Factores de Transcripción/metabolismo , Pez Cebra/embriología , Pez Cebra/genéticaRESUMEN
The ongoing requirement in adult heart for transcription factors with key roles in cardiac development is not well understood. We recently demonstrated that TBX20, a transcriptional regulator required for cardiac development, has key roles in the maintenance of functional and structural phenotypes in adult mouse heart. Conditional ablation of Tbx20 in adult cardiomyocytes leads to a rapid onset and progression of heart failure, with prominent conduction and contractility phenotypes that lead to death. Here we describe a more comprehensive molecular characterization of the functions of TBX20 in adult mouse heart. Coupling genome-wide chromatin immunoprecipitation and transcriptome analyses (RNA-Seq), we identified a subset of genes that change expression in Tbx20 adult cardiomyocyte-specific knockout hearts which are direct downstream targets of TBX20. This analysis revealed a dual role for TBX20 as both a transcriptional activator and a repressor, and that each of these functions regulates genes with very specialized and distinct molecular roles. We also show how TBX20 binds to its targets genome-wide in a context-dependent manner, using various cohorts of co-factors to either promote or repress distinct genetic programs within adult heart. Our integrative approach has uncovered several novel aspects of TBX20 and T-box protein function within adult heart. Sequencing data accession number (http://www.ncbi.nlm.nih.gov/geo): GSE30943.
Asunto(s)
Miocardio/metabolismo , Miocitos Cardíacos/metabolismo , Proteínas de Dominio T Box/genética , Proteínas de Dominio T Box/metabolismo , Factores de Transcripción/genética , Animales , Cardiomiopatías/genética , Cardiomiopatías/patología , Cardiomiopatías/fisiopatología , Inmunoprecipitación de Cromatina , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Ratones , Ratones Noqueados , Contracción Miocárdica , Factores de Transcripción/metabolismoRESUMEN
Vascular homeostasis and pathophysiology are tightly regulated by mechanical forces generated by hemodynamics. Vascular disorders such as atherosclerotic diseases largely occur at curvatures and bifurcations where disturbed blood flow activates endothelial cells while unidirectional flow at the straight part of vessels promotes endothelial health. Integrated analysis of the endothelial transcriptome, the 3D epigenome, and human genetics systematically identified the SNP-enriched cistrome in vascular endothelium subjected to well-defined atherosclerosis-prone disturbed flow or atherosclerosis-protective unidirectional flow. Our results characterized the endothelial typical- and super-enhancers and underscored the critical regulatory role of flow-sensitive endothelial super-enhancers. CRISPR interference and activation validated the function of a previously unrecognized unidirectional flow-induced super-enhancer that upregulates antioxidant genes NQO1, CYB5B, and WWP2, and a disturbed flow-induced super-enhancer in endothelium which drives prothrombotic genes EDN1 and HIVEP in vascular endothelium. Our results employing multiomics identify the cis-regulatory architecture of the flow-sensitive endothelial epigenome related to atherosclerosis and highlight the regulatory role of super-enhancers in mechanotransduction mechanisms.
Asunto(s)
Aterosclerosis , Células Endoteliales , Mecanotransducción Celular , Humanos , Aterosclerosis/genética , Endotelio VascularRESUMEN
The various organogenic programs deployed during embryonic development rely on the precise expression of a multitude of genes in time and space. Identifying the cis-regulatory elements responsible for this tightly orchestrated regulation of gene expression is an essential step in understanding the genetic pathways involved in development. We describe a strategy to systematically identify tissue-specific cis-regulatory elements that share combinations of sequence motifs. Using heart development as an experimental framework, we employed a combination of Gibbs sampling and linear regression to build a classifier that identifies heart enhancers based on the presence and/or absence of various sequence features, including known and putative transcription factor (TF) binding specificities. In distinguishing heart enhancers from a large pool of random noncoding sequences, the performance of our classifier is vastly superior to four commonly used methods, with an accuracy reaching 92% in cross-validation. Furthermore, most of the binding specificities learned by our method resemble the specificities of TFs widely recognized as key players in heart development and differentiation, such as SRF, MEF2, ETS1, SMAD, and GATA. Using our classifier as a predictor, a genome-wide scan identified over 40,000 novel human heart enhancers. Although the classifier used no gene expression information, these novel enhancers are strongly associated with genes expressed in the heart. Finally, in vivo tests of our predictions in mouse and zebrafish achieved a validation rate of 62%, significantly higher than what is expected by chance. These results support the existence of underlying cis-regulatory codes dictating tissue-specific transcription in mammalian genomes and validate our enhancer classifier strategy as a method to uncover these regulatory codes.
Asunto(s)
Genoma , Corazón/embriología , Secuencias de Aminoácidos/genética , Animales , Secuencia de Bases , Femenino , Humanos , Mamíferos/genética , Ratones/embriología , Embarazo , Unión Proteica/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Reproducibilidad de los ResultadosRESUMEN
Obesity-associated morbidity is exacerbated by abdominal obesity, which can be measured as the waist-to-hip ratio adjusted for the body mass index (WHRadjBMI). Here we identify genes associated with obesity and WHRadjBMI and characterize allele-sensitive enhancers that are predicted to regulate WHRadjBMI genes in women. We found that several waist-to-hip ratio-associated variants map within primate-specific Alu retrotransposons harboring a DNA motif associated with adipocyte differentiation. This suggests that a genetic component of adipose distribution in humans may involve co-option of retrotransposons as adipose enhancers. We evaluated the role of the strongest female WHRadjBMI-associated gene, SNX10, in adipose biology. We determined that it is required for human adipocyte differentiation and function and participates in diet-induced adipose expansion in female mice, but not males. Our data identify genes and regulatory mechanisms that underlie female-specific adipose distribution and mediate metabolic dysfunction in women.
Asunto(s)
Obesidad , Retroelementos , Humanos , Femenino , Animales , Ratones , Obesidad/genética , Obesidad/metabolismo , Adiposidad/genética , Índice de Masa Corporal , Relación Cintura-Cadera , Tejido Adiposo/metabolismo , Nexinas de Clasificación/genética , Nexinas de Clasificación/metabolismoRESUMEN
The mechanisms that underlie the timing of labor in humans are largely unknown. In most pregnancies, labor is initiated at term (≥ 37 weeks gestation), but in a signifiicant number of women spontaneous labor occurs preterm and is associated with increased perinatal mortality and morbidity. The objective of this study was to characterize the cells at the maternal-fetal interface (MFI) in term and preterm pregnancies in both the laboring and non-laboring state in Black women, who have among the highest preterm birth rates in the U.S. Using mass cytometry to obtain high-dimensional single-cell resolution, we identified 31 cell populations at the MFI, including 25 immune cell types and six non-immune cell types. Among the immune cells, maternal PD1+ CD8 T cell subsets were less abundant in term laboring compared to term non-laboring women. Among the non-immune cells, PD-L1+ maternal (stromal) and fetal (extravillous trophoblast) cells were less abundant in preterm laboring compared to term laboring women. Consistent with these observations, the expression of CD274, the gene encoding PD-L1, was significantly depressed and less responsive to fetal signaling molecules in cultured mesenchymal stromal cells from the decidua of preterm compared to term women. Overall, these results suggest that the PD1/PD-L1 pathway at the MFI may perturb the delicate balance between immune tolerance and rejection and contribute to the onset of spontaneous preterm labor.
Asunto(s)
Trabajo de Parto , Trabajo de Parto Prematuro , Nacimiento Prematuro , Embarazo , Humanos , Femenino , Recién Nacido , Antígeno B7-H1/genética , Trabajo de Parto Prematuro/metabolismo , Subgrupos de Linfocitos TRESUMEN
In Tibetans, noncoding alleles in EPAS1-whose protein product hypoxia-inducible factor 2α (HIF-2α) drives the response to hypoxia-carry strong signatures of positive selection; however, their functional mechanism has not been systematically examined. Here, we report that high-altitude alleles disrupt the activity of four EPAS1 enhancers in one or more cell types. We further characterize one enhancer (ENH5) whose activity is both allele specific and hypoxia dependent. Deletion of ENH5 results in down-regulation of EPAS1 and HIF-2α targets in acute hypoxia and in a blunting of the transcriptional response to sustained hypoxia. Deletion of ENH5 in mice results in dysregulation of gene expression across multiple tissues. We propose that pleiotropic adaptive effects of the Tibetan alleles in EPAS1 underlie the strong selective signal at this gene.
RESUMEN
The potency of the immune response has still to be harnessed effectively to combat human cancers. However, the discovery of T-cell targets in melanomas and other tumors has raised the possibility that cancer vaccines can be used to induce a therapeutically effective immune response against cancer. The targets, cancer-testis (CT) antigens, are immunogenic proteins preferentially expressed in normal gametogenic tissues and different histological types of tumors. Therapeutic cancer vaccines directed against CT antigens are currently in late-stage clinical trials testing whether they can delay or prevent recurrence of lung cancer and melanoma following surgical removal of primary tumors. CT antigens constitute a large, but ill-defined, family of proteins that exhibit a remarkably restricted expression. Currently, there is a considerable amount of information about these proteins, but the data are scattered through the literature and in several bioinformatic databases. The database presented here, CTdatabase (http://www.cta.lncc.br), unifies this knowledge to facilitate both the mining of the existing deluge of data, and the identification of proteins alleged to be CT antigens, but that do not have their characteristic restricted expression pattern. CTdatabase is more than a repository of CT antigen data, since all the available information was carefully curated and annotated with most data being specifically processed for CT antigens and stored locally. Starting from a compilation of known CT antigens, CTdatabase provides basic information including gene names and aliases, RefSeq accession numbers, genomic location, known splicing variants, gene duplications and additional family members. Gene expression at the mRNA level in normal and tumor tissues has been collated from publicly available data obtained by several different technologies. Manually curated data related to mRNA and protein expression, and antigen-specific immune responses in cancer patients are also available, together with links to PubMed for relevant CT antigen articles.
Asunto(s)
Antígenos de Neoplasias/metabolismo , Bases de Datos de Proteínas , Proteínas de Neoplasias/metabolismo , Testículo/metabolismo , Antígenos de Neoplasias/genética , Antígenos de Neoplasias/inmunología , Etiquetas de Secuencia Expresada , Humanos , Inmunidad , Masculino , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/inmunología , Reacción en Cadena de la Polimerasa , PubMed , ARN Mensajero/metabolismoRESUMEN
Various human diseases and pregnancy-related disorders reflect endometrial dysfunction. However, rodent models do not share fundamental biological processes with the human endometrium, such as spontaneous decidualization, and no existing human cell cultures recapitulate the cyclic interactions between endometrial stromal and epithelial compartments necessary for decidualization and implantation. Here we report a protocol differentiating human pluripotent stem cells into endometrial stromal fibroblasts (PSC-ESFs) that are highly pure and able to decidualize. Coculture of PSC-ESFs with placenta-derived endometrial epithelial cells generated organoids used to examine stromal-epithelial interactions. Cocultures exhibited specific endometrial markers in the appropriate compartments, organization with cell polarity, and hormone responsiveness of both cell types. Furthermore, cocultures recapitulate a central feature of the human decidua by cyclically responding to hormone withdrawal followed by hormone retreatment. This advance enables mechanistic studies of the cyclic responses that characterize the human endometrium.
Asunto(s)
Técnicas de Cocultivo/métodos , Decidua/metabolismo , Endometrio/metabolismo , Fibroblastos/metabolismo , Células Madre Pluripotentes/metabolismo , Células del Estroma/metabolismo , Femenino , HumanosRESUMEN
Human trophoblast stem cells (hTSC) can be isolated from first trimester placenta but not from term placenta. Here we demonstrate that villous cytotrophoblasts (vCTB) from term placenta can be reprogrammed into induced trophoblastic stem-like cells (iTSC) by introducing sets of transcription factors. The iTSCs express TSC markers such as GATA3, TEAD4 and ELF5, and are multipotent, validated by their differentiation into both extravillous trophoblasts (EVT) and syncytiotrophoblasts (STB) in vitro and in vivo. The iTSC can be passaged indefinitely in vitro without slowing of growth. The transcriptome profile of these cells closely resembles the profile of hTSC isolated from first trimester placentae but different from the term placental vCTB from which they originated. The ability to reprogram cells from term placenta into iTSC will allow study of early gestation events which impact placental function later in gestation, including preeclampsia and spontaneous preterm birth.
Asunto(s)
Nacimiento Prematuro , Trofoblastos , Diferenciación Celular , Proteínas de Unión al ADN , Femenino , Humanos , Recién Nacido , Proteínas Musculares , Placenta , Embarazo , Células Madre , Factores de Transcripción de Dominio TEA , Factores de Transcripción/genéticaRESUMEN
Genome-wide association studies (GWAS) have identified many disease-associated variants, yet mechanisms underlying these associations remain unclear. To understand obesity-associated variants, we generate gene regulatory annotations in adipocytes and hypothalamic neurons across cellular differentiation stages. We then test variants in 97 obesity-associated loci using a massively parallel reporter assay and identify putatively causal variants that display cell type specific or cross-tissue enhancer-modulating properties. Integrating these variants with gene regulatory information suggests genes that underlie obesity GWAS associations. We also investigate a complex genomic interval on 16p11.2 where two independent loci exhibit megabase-range, cross-locus chromatin interactions. We demonstrate that variants within these two loci regulate a shared gene set. Together, our data support a model where GWAS loci contain variants that alter enhancer activity across tissues, potentially with temporally restricted effects, to impact the expression of multiple genes. This complex model has broad implications for ongoing efforts to understand GWAS.
Asunto(s)
Adipocitos/fisiología , Elementos de Facilitación Genéticos , Pleiotropía Genética , Obesidad/genética , Adipocitos/citología , Arritmias Cardíacas/genética , Arritmias Cardíacas/patología , Enfermedades Genéticas Ligadas al Cromosoma X/genética , Enfermedades Genéticas Ligadas al Cromosoma X/patología , Estudio de Asociación del Genoma Completo , Gigantismo/genética , Gigantismo/patología , Cardiopatías Congénitas/genética , Cardiopatías Congénitas/patología , Humanos , Hipotálamo/fisiología , Discapacidad Intelectual/genética , Discapacidad Intelectual/patología , MAP Quinasa Quinasa 5/genética , Neuronas/citología , Neuronas/fisiología , Polimorfismo de Nucleótido Simple , Proteínas Quinasas/genética , Sitios de Carácter Cuantitativo , ATPasas Transportadoras de Calcio del Retículo Sarcoplásmico/genética , Factores de Transcripción/genética , TranscriptomaRESUMEN
Whereas coding variants often have pleiotropic effects across multiple tissues, noncoding variants are thought to mediate their phenotypic effects by specific tissue and temporal regulation of gene expression. Here, we investigated the genetic and functional architecture of a genomic region within the FTO gene that is strongly associated with obesity risk. We show that multiple variants on a common haplotype modify the regulatory properties of several enhancers targeting IRX3 and IRX5 from megabase distances. We demonstrate that these enhancers affect gene expression in multiple tissues, including adipose and brain, and impart regulatory effects during a restricted temporal window. Our data indicate that the genetic architecture of disease-associated loci may involve extensive pleiotropy, allelic heterogeneity, shared allelic effects across tissues, and temporally restricted effects.
Asunto(s)
Tejido Adiposo/metabolismo , Encéfalo/metabolismo , Proteínas de Homeodominio/genética , Obesidad/genética , Factores de Transcripción/genética , Alelos , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato/genética , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato/metabolismo , Animales , Encéfalo/embriología , Línea Celular , Cromatina/química , Cromatina/metabolismo , Desarrollo Embrionario , Elementos de Facilitación Genéticos , Conducta Alimentaria , Preferencias Alimentarias , Regulación de la Expresión Génica , Haplotipos , Proteínas de Homeodominio/metabolismo , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Neuronas/metabolismo , Obesidad/fisiopatología , Polimorfismo de Nucleótido Simple , Factores de Transcripción/metabolismoRESUMEN
Genome-wide association studies (GWAS) have implicated the IL33 locus in asthma, but the underlying mechanisms remain unclear. Here, we identify a 5 kb region within the GWAS-defined segment that acts as an enhancer-blocking element in vivo and in vitro. Chromatin conformation capture showed that this 5 kb region loops to the IL33 promoter, potentially regulating its expression. We show that the asthma-associated single nucleotide polymorphism (SNP) rs1888909, located within the 5 kb region, is associated with IL33 gene expression in human airway epithelial cells and IL-33 protein expression in human plasma, potentially through differential binding of OCT-1 (POU2F1) to the asthma-risk allele. Our data demonstrate that asthma-associated variants at the IL33 locus mediate allele-specific regulatory activity and IL33 expression, providing a mechanism through which a regulatory SNP contributes to genetic risk of asthma.
Asunto(s)
Asma/genética , Elementos de Facilitación Genéticos , Interleucina-33/genética , Alelos , Animales , Asma/metabolismo , Cromatina/genética , Cromatina/metabolismo , Femenino , Predisposición Genética a la Enfermedad , Humanos , Interleucina-33/metabolismo , Masculino , Ratones Transgénicos , Factor 1 de Transcripción de Unión a Octámeros/genética , Factor 1 de Transcripción de Unión a Octámeros/metabolismo , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas , Pez CebraRESUMEN
There is a life-long relationship between rhinovirus (RV) infection and the development and clinical manifestations of asthma. In this study we demonstrate that cultured primary bronchial epithelial cells from adults with asthma (n = 9) show different transcriptional and chromatin responses to RV infection compared to those without asthma (n = 9). Both the number and magnitude of transcriptional and chromatin responses to RV were muted in cells from asthma cases compared to controls. Pathway analysis of the transcriptionally responsive genes revealed enrichments of apoptotic pathways in controls but inflammatory pathways in asthma cases. Using promoter capture Hi-C we tethered regions of RV-responsive chromatin to RV-responsive genes and showed enrichment of these regions and genes at asthma GWAS loci. Taken together, our studies indicate a delayed or prolonged inflammatory state in cells from asthma cases and highlight genes that may contribute to genetic risk for asthma.
Asunto(s)
Asma/metabolismo , Cromatina/metabolismo , Células Epiteliales/fisiología , Mucosa Respiratoria/citología , Rhinovirus/fisiología , Adulto , Asma/genética , Células Cultivadas , Humanos , Transcripción GenéticaRESUMEN
While a genetic component of preterm birth (PTB) has long been recognized and recently mapped by genome-wide association studies (GWASs), the molecular determinants underlying PTB remain elusive. This stems in part from an incomplete availability of functional genomic annotations in human cell types relevant to pregnancy and PTB. We generated transcriptome (RNA-seq), epigenome (ChIP-seq of H3K27ac, H3K4me1, and H3K4me3 histone modifications), open chromatin (ATAC-seq), and chromatin interaction (promoter capture Hi-C) annotations of cultured primary decidua-derived mesenchymal stromal/stem cells and in vitro differentiated decidual stromal cells and developed a computational framework to integrate these functional annotations with results from a GWAS of gestational duration in 56,384 women. Using these resources, we uncovered additional loci associated with gestational duration and target genes of associated loci. Our strategy illustrates how functional annotations in pregnancy-relevant cell types aid in the experimental follow-up of GWAS for PTB and, likely, other pregnancy-related conditions.
Asunto(s)
Nacimiento Prematuro , Transcriptoma , Cromatina/genética , Cromatina/metabolismo , Decidua , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Recién Nacido , Masculino , Embarazo , Nacimiento Prematuro/genética , Nacimiento Prematuro/metabolismo , Células del EstromaRESUMEN
Over 500 genetic loci have been associated with risk of cardiovascular diseases (CVDs); however, most loci are located in gene-distal non-coding regions and their target genes are not known. Here, we generated high-resolution promoter capture Hi-C (PCHi-C) maps in human induced pluripotent stem cells (iPSCs) and iPSC-derived cardiomyocytes (CMs) to provide a resource for identifying and prioritizing the functional targets of CVD associations. We validate these maps by demonstrating that promoters preferentially contact distal sequences enriched for tissue-specific transcription factor motifs and are enriched for chromatin marks that correlate with dynamic changes in gene expression. Using the CM PCHi-C map, we linked 1999 CVD-associated SNPs to 347 target genes. Remarkably, more than 90% of SNP-target gene interactions did not involve the nearest gene, while 40% of SNPs interacted with at least two genes, demonstrating the importance of considering long-range chromatin interactions when interpreting functional targets of disease loci.
Asunto(s)
Enfermedades Cardiovasculares/genética , Genoma Humano , Células Madre Pluripotentes Inducidas/metabolismo , Miocitos Cardíacos/metabolismo , Regiones Promotoras Genéticas , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Sitios Genéticos , Estudio de Asociación del Genoma Completo , Genómica , Humanos , Células Madre Pluripotentes Inducidas/citología , Miocitos Cardíacos/citología , Polimorfismo de Nucleótido Simple , Elementos Reguladores de la TranscripciónRESUMEN
BACKGROUND: One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. RESULTS: TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. CONCLUSION: Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.
Asunto(s)
Empalme Alternativo/genética , Intrones/genética , Animales , Secuencia de Bases , ADN Complementario , Elementos de Facilitación Genéticos , Humanos , Ratones , Sitios de Empalme de ARNRESUMEN
ATAC-seq is a high-throughput sequencing technique that identifies open chromatin. Depending on the cell type, ATAC-seq samples may contain ~20-80% of mitochondrial sequencing reads. As the regions of open chromatin of interest are usually located in the nuclear genome, mitochondrial reads are typically discarded from the analysis. We tested two approaches to decrease wasted sequencing in ATAC-seq libraries generated from lymphoblastoid cell lines: targeted cleavage of mitochondrial DNA fragments using CRISPR technology and removal of detergent from the cell lysis buffer. We analyzed the effects of these treatments on the number of usable (unique, non-mitochondrial) reads and the number and quality of peaks called, including peaks identified in enhancers and transcription start sites. Both treatments resulted in considerable reduction of mitochondrial reads (1.7 and 3-fold, respectively). The removal of detergent, however, resulted in increased background and fewer peaks. The highest number of peaks and highest quality data was obtained by preparing samples with the original ATAC-seq protocol (using detergent) and treating them with CRISPR. This strategy reduced the amount of sequencing required to call a high number of peaks, which could lead to cost reduction when performing ATAC-seq on large numbers of samples and in cell types that contain a large amount of mitochondria.