RESUMEN
Identification of somatic driver mutations in the noncoding genome remains challenging. To comprehensively characterize noncoding driver mutations for pancreatic ductal adenocarcinoma (PDAC), we first created genome-scale maps of accessible chromatin regions (ACRs) and histone modification marks (HMMs) in pancreatic cell lines and purified pancreatic acinar and duct cells. Integration with whole-genome mutation calls from 506 PDACs revealed 314 ACRs/HMMs significantly enriched with 3,614 noncoding somatic mutations (NCSMs). Functional assessment using massively parallel reporter assays (MPRA) identified 178 NCSMs impacting reporter activity (19.45% of those tested). Focused luciferase validation confirmed negative effects on gene regulatory activity for NCSMs near CDKN2A and ZFP36L2. For the latter, CRISPR interference (CRISPRi) further identified ZFP36L2 as a target gene (16.0 - 24.0% reduced expression, P = 0.023-0.0047) with disrupted KLF9 binding likely mediating the effect. Our integrative approach provides a catalog of potentially functional noncoding driver mutations and nominates ZFP36L2 as a PDAC driver gene.
RESUMEN
AIMS/HYPOTHESIS: Genome-wide association studies (GWAS) have identified hundreds of type 2 diabetes loci, with the vast majority of signals located in non-coding regions; as a consequence, it remains largely unclear which 'effector' genes these variants influence. Determining these effector genes has been hampered by the relatively challenging cellular settings in which they are hypothesised to confer their effects. METHODS: To implicate such effector genes, we elected to generate and integrate high-resolution promoter-focused Capture-C, assay for transposase-accessible chromatin with sequencing (ATAC-seq) and RNA-seq datasets to characterise chromatin and expression profiles in multiple cell lines relevant to type 2 diabetes for subsequent functional follow-up analyses: EndoC-BH1 (pancreatic beta cell), HepG2 (hepatocyte) and Simpson-Golabi-Behmel syndrome (SGBS; adipocyte). RESULTS: The subsequent variant-to-gene analysis implicated 810 candidate effector genes at 370 type 2 diabetes risk loci. Using partitioned linkage disequilibrium score regression, we observed enrichment for type 2 diabetes and fasting glucose GWAS loci in promoter-connected putative cis-regulatory elements in EndoC-BH1 cells as well as fasting insulin GWAS loci in SGBS cells. Moreover, as a proof of principle, when we knocked down expression of the SMCO4 gene in EndoC-BH1 cells, we observed a statistically significant increase in insulin secretion. CONCLUSIONS/INTERPRETATION: These results provide a resource for comparing tissue-specific data in tractable cellular models as opposed to relatively challenging primary cell settings. DATA AVAILABILITY: Raw and processed next-generation sequencing data for EndoC-BH1, HepG2, SGBS_undiff and SGBS_diff cells are deposited in GEO under the Superseries accession GSE262484. Promoter-focused Capture-C data are deposited under accession GSE262496. Hi-C data are deposited under accession GSE262481. Bulk ATAC-seq data are deposited under accession GSE262479. Bulk RNA-seq data are deposited under accession GSE262480.
RESUMEN
Genome-wide association studies (GWAS) have identified hundreds of genetic signals associated with autoimmune disease. The majority of these signals are located in non-coding regions and likely impact cis-regulatory elements (cRE). Because cRE function is dynamic across cell types and states, profiling the epigenetic status of cRE across physiological processes is necessary to characterize the molecular mechanisms by which autoimmune variants contribute to disease risk. We localized risk variants from 15 autoimmune GWAS to cRE active during TCR-CD28 co-stimulation of naïve human CD4+ T cells. To characterize how dynamic changes in gene expression correlate with cRE activity, we measured transcript levels, chromatin accessibility, and promoter-cRE contacts across three phases of naive CD4+ T cell activation using RNA-seq, ATAC-seq, and HiC. We identified ~1200 protein-coding genes physically connected to accessible disease-associated variants at 423 GWAS signals, at least one-third of which are dynamically regulated by activation. From these maps, we functionally validated a novel stretch of evolutionarily conserved intergenic enhancers whose activity is required for activation-induced IL2 gene expression in human and mouse, and is influenced by autoimmune-associated genetic variation. The set of genes implicated by this approach are enriched for genes controlling CD4+ T cell function and genes involved in human inborn errors of immunity, and we pharmacologically validated eight implicated genes as novel regulators of T cell activation. These studies directly show how autoimmune variants and the genes they regulate influence processes involved in CD4+ T cell proliferation and activation.
Asunto(s)
Linfocitos T CD4-Positivos , Cromatina , Estudio de Asociación del Genoma Completo , Interleucina-2 , Activación de Linfocitos , Humanos , Linfocitos T CD4-Positivos/inmunología , Linfocitos T CD4-Positivos/metabolismo , Cromatina/metabolismo , Cromatina/genética , Activación de Linfocitos/genética , Interleucina-2/genética , Interleucina-2/metabolismo , Animales , Ratones , Elementos de Facilitación Genéticos/genética , Enfermedades Autoinmunes/genética , Enfermedades Autoinmunes/inmunología , Regulación de la Expresión Génica , Autoinmunidad/genéticaRESUMEN
Assessing inter-muscular coordination in older adults is crucial, as it directly impacts an individual's ability for independent functioning, injury prevention, and active engagement in daily activities. However, the precise mechanisms by which distinct muscle fiber types synchronize their activity across muscles to generate coordinated movements in older adults remain unknown. Our objective is to investigate how distinct muscle groups dynamically synchronize with each other in young and older adults during exercise. Thirty-five young adults and nine older adults performed one bodyweight squat set until exhaustion. Simultaneous surface electromyography (sEMG) recordings were taken from the left and right vastus lateralis, and left and right erector spinae. To quantify inter-muscular coordination, we first obtained ten time series of sEMG band power for each muscle, representing the dynamics of different muscle fiber types. Next, we calculated the bivariate equal-time Pearson's cross-correlation for each pair of sEMG band power time series across all leg and back muscles. The main results show (i) an overall reduction in the degree of inter-muscular coordination, and (ii) increased stratification of the inter-muscular network in older adults compared to young adults. These findings suggest that as individuals age, the global inter-muscular network becomes less flexible and adaptable, hindering its ability to reorganize effectively in response to fatigue or other stimuli. This network approach opens new avenues for developing novel network-based markers to characterize multilevel inter-muscular interactions, which can help target functional deficits and potentially reduce the risk of falls and neuro-muscular injuries in older adults.
RESUMEN
Late-onset Alzheimer's disease (LOAD) research has principally focused on neurons over the years due to their known role in the production of amyloid beta plaques and neurofibrillary tangles. In contrast, recent genomic studies of LOAD have implicated microglia as culprits of the prolonged inflammation exacerbating the neurodegeneration observed in patient brains. Indeed, recent LOAD genome-wide association studies (GWAS) have reported multiple loci near genes related to microglial function, including TREM2, ABI3, and CR1. However, GWAS alone cannot pinpoint underlying causal variants or effector genes at such loci, as most signals reside in non-coding regions of the genome and could presumably confer their influence frequently via long-range regulatory interactions. We elected to carry out a combination of ATAC-seq and high-resolution promoter-focused Capture-C in two human microglial cell models (iPSC-derived microglia and HMC3) in order to physically map interactions between LOAD GWAS-implicated candidate causal variants and their corresponding putative effector genes. Notably, we observed consistent evidence that rs6024870 at the GWAS CASS4 locus contacted the promoter of nearby gene, RTFDC1. We subsequently observed a directionallly consistent decrease in RTFDC1 expression with the the protective minor A allele of rs6024870 via both luciferase assays in HMC3 cells and expression studies in primary human microglia. Through CRISPR-Cas9-mediated deletion of the putative regulatory region harboring rs6024870 in HMC3 cells, we observed increased pro-inflammatory cytokine secretion and decreased DNA double strand break repair related, at least in part, to RTFDC1 expression levels. Our variant-to-function approach therefore reveals that the rs6024870-harboring regulatory element at the LOAD 'CASS4' GWAS locus influences both microglial inflammatory capacity and DNA damage resolution, along with cumulative evidence implicating RTFDC1 as a novel candidate effector gene.
RESUMEN
A portion of the genetic basis for many common autoimmune disorders has been uncovered by genome-wide association studies (GWAS), but GWAS do not reveal causal variants, effector genes, or the cell types impacted by disease-associated variation. We have generated 3D genomic datasets consisting of promoter-focused Capture-C, Hi-C, ATAC-seq, and RNA-seq and integrated these data with GWAS of 16 autoimmune traits to physically map disease-associated variants to the effector genes they likely regulate in 57 human cell types. These 3D maps of gene cis-regulatory architecture are highly powered to identify the cell types most likely impacted by disease-associated genetic variation compared to 1D genomic features, and tend to implicate different effector genes than eQTL approaches in the same cell types. Most of the variants implicated by these cis-regulatory architectures are highly trait-specific, but nearly half of the target genes connected to these variants are shared across multiple autoimmune disorders in multiple cell types, suggesting a high level of genetic diversity and complexity among autoimmune diseases that nonetheless converge at the level of target gene and cell type. Substantial effector gene sharing led to the common enrichment of similar biological networks across disease and cell types. However, trait-specific pathways representing potential areas for disease-specific intervention were identified. To test this, we pharmacologically validated squalene synthase, a cholesterol biosynthetic enzyme encoded by the FDFT1 gene implicated by our approach in MS and SLE, as a novel immunomodulatory drug target controlling inflammatory cytokine production by human T cells. These data represent a comprehensive resource for basic discovery of gene cis-regulatory mechanisms, and the analyses reported reveal mechanisms by which autoimmune-associated variants act to regulate gene expression, function, and pathology across multiple, distinct tissues and cell types.
RESUMEN
Recent genome-wide association studies (GWAS) have revealed shared genetic components among alcohol, opioid, tobacco and cannabis use disorders. However, the extent of the underlying shared causal variants and effector genes, along with their cellular context, remain unclear. We leveraged our existing 3D genomic datasets comprising high-resolution promoter-focused Capture-C/Hi-C, ATAC-seq and RNA-seq across >50 diverse human cell types to focus on genomic regions that coincide with GWAS loci. Using stratified LD regression, we determined the proportion of genomewide SNP heritability attributable to the features assayed across our cell types by integrating recent GWAS summary statistics for the relevant traits: alcohol use disorder (AUD), tobacco use disorder (TUD), opioid use disorder (OUD) and cannabis use disorder (CanUD). Statistically significant enrichments (P<0.05) were observed in 14 specific cell types, with heritability reaching 9.2-fold for iPSC-derived cortical neurons and neural progenitors, confirming that they are crucial cell types for further functional exploration. Additionally, several pancreatic cell types, notably pancreatic beta cells, showed enrichment for TUD, with heritability enrichments up to 4.8-fold, suggesting genomic overlap with metabolic processes. Further investigation revealed significant positive genetic correlations between T2D with both TUD and CanUD (FDR<0.05) and a significant negative genetic correlation with AUD. Interestingly, after partitioning the heritability for each cell type's cis-regulatory elements, the correlation between T2D and TUD for pancreatic beta cells was greater (r=0.2) than the global genetic correlation value. Our study provides new genomic insights into substance use disorders and implicates cell types where functional follow-up studies could reveal causal variant-gene mechanisms underpinning these disorders.
RESUMEN
BACKGROUND: Autoimmune cytopenias (AICs) regularly occur in profoundly IgG-deficient patients with common variable immunodeficiency (CVID). The isotypes, antigenic targets, and origin(s) of their disease-causing autoantibodies are unclear. OBJECTIVE: We sought to determine reactivity, clonality, and provenance of AIC-associated IgM autoantibodies in patients with CVID. METHODS: We used glycan arrays, patient erythrocytes, and platelets to determine targets of CVID IgM autoantibodies. Glycan-binding profiles were used to identify autoreactive clones across B-cell subsets, specifically circulating marginal zone (MZ) B cells, for sorting and IGH sequencing. The locations, transcriptomes, and responses of tonsillar MZ B cells to different TH- cell subsets were determined by confocal microscopy, RNA-sequencing, and cocultures, respectively. RESULTS: Autoreactive IgM coated erythrocytes and platelets from many CVID patients with AICs (CVID+AIC). On glycan arrays, CVID+AIC plasma IgM narrowly recognized erythrocytic i antigens and platelet i-related antigens and failed to bind hundreds of pathogen- and tumor-associated carbohydrates. Polyclonal i antigen-recognizing B-cell receptors were highly enriched among CVID+AIC circulating MZ B cells. Within tonsillar tissues, MZ B cells secreted copious IgM when activated by the combination of IL-10 and IL-21 or when cultured with IL-10/IL-21-secreting FOXP3-CD25hi T follicular helper (Tfh) cells. In lymph nodes from immunocompetent controls, MZ B cells, plentiful FOXP3+ regulatory T cells, and rare FOXP3-CD25+ cells that represented likely CD25hi Tfh cells all localized outside of germinal centers. In CVID+AIC lymph nodes, cellular positions were similar but CD25hi Tfh cells greatly outnumbered regulatory cells. CONCLUSIONS: Our findings indicate that glycan-reactive IgM autoantibodies produced outside of germinal centers may contribute to the autoimmune pathogenesis of CVID.
Asunto(s)
Autoanticuerpos , Plaquetas , Inmunodeficiencia Variable Común , Eritrocitos , Inmunoglobulina M , Polisacáridos , Humanos , Inmunoglobulina M/inmunología , Inmunoglobulina M/sangre , Eritrocitos/inmunología , Inmunodeficiencia Variable Común/inmunología , Polisacáridos/inmunología , Plaquetas/inmunología , Autoanticuerpos/inmunología , Autoanticuerpos/sangre , Masculino , Femenino , Subgrupos de Linfocitos B/inmunología , AdultoRESUMEN
The ch12q13 locus is among the most significant childhood obesity loci identified in genome-wide association studies. This locus resides in a non-coding region within FAIM2; thus, the underlying causal variant(s) presumably influence disease susceptibility via cis-regulation. We implicated rs7132908 as a putative causal variant by leveraging our in-house 3D genomic data and public domain datasets. Using a luciferase reporter assay, we observed allele-specific cis-regulatory activity of the immediate region harboring rs7132908. We generated isogenic human embryonic stem cell lines homozygous for either rs7132908 allele to assess changes in gene expression and chromatin accessibility throughout a differentiation to hypothalamic neurons, a key cell type known to regulate feeding behavior. The rs7132908 obesity risk allele influenced expression of FAIM2 and other genes and decreased the proportion of neurons produced by differentiation. We have functionally validated rs7132908 as a causal obesity variant that temporally regulates nearby effector genes and influences neurodevelopment and survival.
Asunto(s)
Regiones no Traducidas 3' , Proteínas Reguladoras de la Apoptosis , Proteínas de la Membrana , Obesidad Infantil , Niño , Humanos , Regiones no Traducidas 3'/genética , Alelos , Diferenciación Celular/genética , Cromosomas Humanos Par 12/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Células Madre Embrionarias Humanas/metabolismo , Neuronas/metabolismo , Obesidad Infantil/genética , Polimorfismo de Nucleótido Simple/genética , Proteínas de la Membrana/genética , Proteínas Reguladoras de la Apoptosis/genéticaRESUMEN
Although genome-wide association studies (GWAS) have identified loci for sleep-related traits, they do not directly uncover the underlying causal variants and corresponding effector genes. The majority of such variants reside in non-coding regions and are therefore presumed to impact cis-regulatory elements. Our previously reported 'variant-to-gene mapping' effort in human induced pluripotent stem cell (iPSC)-derived neural progenitor cells (NPCs), combined with validation in both Drosophila and zebrafish, implicated phosphatidyl inositol glycan (PIG)-Q as a functionally relevant gene at the insomnia "WDR90" GWAS locus. However, importantly that effort did not characterize the corresponding underlying causal variant. Specifically, our previous 3D genomic datasets nominated a shortlist of three neighboring single nucleotide polymorphisms (SNPs) in strong linkage disequilibrium within an intronic enhancer region of WDR90 that contacted the open PIG-Q promoter. We sought to investigate the influence of these SNPs collectively and then individually on PIG-Q modulation to pinpoint the causal "regulatory" variant. Starting with gross level perturbation, deletion of the entire region in NPCs via CRISPR-Cas9 editing and subsequent RNA sequencing revealed expression changes in specific PIG-Q transcripts. Results from individual luciferase reporter assays for each SNP in iPSCs revealed that the region with the rs3752495 risk allele (RA) induced a ~2.5-fold increase in luciferase expression. Importantly, rs3752495 also exhibited an allele-specific effect, with the RA increasing the luciferase expression by ~2-fold versus the non-RA. In conclusion, our variant-to-function approach and in vitro validation implicate rs3752495 as a causal insomnia variant embedded within WDR90 while modulating the expression of the distally located PIG-Q.
Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Trastornos del Inicio y del Mantenimiento del Sueño , Polimorfismo de Nucleótido Simple/genética , Humanos , Trastornos del Inicio y del Mantenimiento del Sueño/genética , Células Madre Pluripotentes Inducidas , Animales , Células-Madre Neurales , Desequilibrio de Ligamiento/genéticaRESUMEN
Ikaros is a transcriptional factor required for conventional T cell development, differentiation, and anergy. While the related factors Helios and Eos have defined roles in regulatory T cells (Treg), a role for Ikaros has not been established. To determine the function of Ikaros in the Treg lineage, we generated mice with Treg-specific deletion of the Ikaros gene (Ikzf1). We find that Ikaros cooperates with Foxp3 to establish a major portion of the Treg epigenome and transcriptome. Ikaros-deficient Treg exhibit Th1-like gene expression with abnormal production of IL-2, IFNg, TNFa, and factors involved in Wnt and Notch signaling. While Ikzf1-Treg-cko mice do not develop spontaneous autoimmunity, Ikaros-deficient Treg are unable to control conventional T cell-mediated immune pathology in response to TCR and inflammatory stimuli in models of IBD and organ transplantation. These studies establish Ikaros as a core factor required in Treg for tolerance and the control of inflammatory immune responses.
Asunto(s)
Factores de Transcripción Forkhead , Regulación de la Expresión Génica , Factor de Transcripción Ikaros , Linfocitos T Reguladores , Animales , Factor de Transcripción Ikaros/metabolismo , Factor de Transcripción Ikaros/genética , Linfocitos T Reguladores/inmunología , Linfocitos T Reguladores/metabolismo , Factores de Transcripción Forkhead/metabolismo , Factores de Transcripción Forkhead/genética , Ratones , Ratones NoqueadosRESUMEN
Genome wide association study (GWAS)-implicated bone mineral density (BMD) signals have been shown to localize in cis-regulatory regions of distal effector genes using 3D genomic methods. Detailed characterization of such genes can reveal novel causal genes for BMD determination. Here, we elected to characterize the "DNM3" locus on chr1q24, where the long non-coding RNA DNM3OS and the embedded microRNA MIR199A2 (miR-199a-5p) are implicated as effector genes contacted by the region harboring variation in linkage disequilibrium with BMD-associated sentinel single nucleotide polymorphism, rs12041600. During osteoblast differentiation of human mesenchymal stem/progenitor cells (hMSC), miR-199a-5p expression was temporally decreased and correlated with the induction of osteoblastic transcription factors RUNX2 and Osterix. Functional relevance of miR-199a-5p downregulation in osteoblastogenesis was investigated by introducing miR-199a-5p mimic into hMSC. Cells overexpressing miR-199a-5p depicted a cobblestone-like morphological change and failed to produce BMP2-dependent extracellular matrix mineralization. Mechanistically, a miR-199a-5p mimic modified hMSC propagated normal SMAD1/5/9 signaling and expressed osteoblastic transcription factors RUNX2 and Osterix but depicted pronounced upregulation of SOX9 and enhanced expression of essential chondrogenic genes ACAN, COMP, and COL10A1. Mineralization defects, morphological changes, and enhanced chondrogenic gene expression associated with miR-199a-5p mimic over-expression were restored with miR-199a-5p inhibitor suggesting specificity of miR-199a-5p in chondrogenic fate specification. The expression of both the DNM3OS and miR-199a-5p temporally increased and correlated with hMSC chondrogenic differentiation. Although miR-199a-5p overexpression failed to further enhance chondrogenesis, blocking miR-199a-5p activity significantly reduced chondrogenic pellet size, extracellular matrix deposition, and chondrogenic gene expression. Taken together, our results indicate that oscillating miR-199a-5p levels dictate hMSC osteoblast or chondrocyte terminal fate. Our study highlights a functional role of miR-199a-5p as a BMD effector gene at the DNM3 BMD GWAS locus, where patients with cis-regulatory genetic variation which increases miR-199a-5p expression could lead to reduced osteoblast activity.
RESUMEN
Over 1,100 independent signals have been identified with genome-wide association studies (GWAS) for bone mineral density (BMD), a key risk factor for mortality-increasing fragility fractures; however, the effector gene(s) for most remain unknown. Informed by a variant-to-gene mapping strategy implicating 89 non-coding elements predicted to regulate osteoblast gene expression at BMD GWAS loci, we executed a single-cell CRISPRi screen in human fetal osteoblast 1.19 cells (hFOBs). The BMD relevance of hFOBs was supported by heritability enrichment from cross-cell type stratified LD-score regression involving 98 cell types grouped into 15 tissues. 24 genes showed perturbation in the screen, with four (ARID5B, CC2D1B, EIF4G2, and NCOA3) exhibiting consistent effects upon siRNA knockdown on three measures of osteoblast maturation and mineralization. Lastly, additional heritability enrichments, genetic correlations, and multi-trait fine-mapping revealed that many BMD GWAS signals are pleiotropic and likely mediate their effects via non-bone tissues that warrant attention in future screens.
RESUMEN
Microbial community structure and function were assessed in the organic and upper mineral soil across a ~4000-year dune-based chronosequence at Big Bay, New Zealand, where total P declined and the proportional contribution of organic soil in the profile increased with time. We hypothesized that the organic and mineral soils would show divergent community evolution over time with a greater dependency on the functionality of phosphatase genes in the organic soil layer as it developed. The structure of bacterial, fungal, and phosphatase-harbouring communities was examined in both horizons across 3 dunes using amplicon sequencing, network analysis, and qPCR. The soils showed a decline in pH and total phosphorus (P) over time with an increase in phosphatase activity. The organic horizon had a wider diversity of Class A (phoN/phoC) and phoD-harbouring communities and a more complex microbiome, with hub taxa that correlated with P. Bacterial diversity declined in both horizons over time, with enrichment of Planctomycetes and Acidobacteria. More complex fungal communities were evident in the youngest dune, transitioning to a dominance of Ascomycota in both soil horizons. Higher phosphatase activity in older dunes was driven by less diverse P-mineralizing communities, especially in the organic horizon.
Asunto(s)
Microbiota , Suelo , Suelo/química , Fósforo/análisis , Bosque Lluvioso , Bacterias/genética , Microbiota/genética , Minerales , Monoéster Fosfórico Hidrolasas/genética , Microbiología del SueloRESUMEN
BACKGROUND: Carpal tunnel syndrome (CTS) is a common disorder caused by compression of the median nerve in the wrist, resulting in pain and numbness throughout the hand and forearm. While multiple behavioural and physiological factors influence CTS risk, a growing body of evidence supports a strong genetic contribution. Recent genome-wide association study (GWAS) efforts have reported 53 independent signals associated with CTS. While GWAS can identify genetic loci conferring risk, it does not determine which cell types drive the genetic aetiology of the trait, which variants are "causal" at a given signal, and which effector genes correspond to these non-coding variants. These obstacles limit interpretation of potential disease mechanisms. METHODS: We analysed CTS GWAS findings in the context of chromatin conformation between gene promoters and accessible chromatin regions across cellular models of bone, skeletal muscle, adipocytes and neurons. We identified proxy variants in high LD with the lead CTS sentinel SNPs residing in promoter connected open chromatin in the skeletal muscle and bone contexts. FINDINGS: We detected significant enrichment for heritability in skeletal muscle myotubes, as well as a weaker correlation in human mesenchymal stem cell-derived osteoblasts. In myotubes, our approach implicated 117 genes contacting 60 proxy variants corresponding to 20 of the 53 GWAS signals. In the osteoblast context we implicated 30 genes contacting 24 proxy variants coinciding with 12 signals, of which 19 genes shared. We subsequently prioritized BZW2 as a candidate effector gene in CTS and implicated it as novel gene that perturbs myocyte differentiation in vitro. INTERPRETATION: Taken together our results suggest that the CTS genetic component influences the size, integrity, and organization of multiple tissues surrounding the carpal tunnel, in particular muscle and bone, to predispose the nerve to being compressed in this disease setting. FUNDING: This work was supported by NIH Grant UM1 DK126194 (SFAG and WY), R01AG072705 (SFAG & KDH) and the Center for Spatial and Functional Genomics at CHOP (SFAG & ADW). SFAG is supported by the Daniel B. Burke Endowed Chair for Diabetes Research. WY is supported by the Perelman School of Medicine of the University of Pennsylvania.
Asunto(s)
Síndrome del Túnel Carpiano , Humanos , Síndrome del Túnel Carpiano/genética , Estudio de Asociación del Genoma Completo , Músculo Esquelético , Mapeo Cromosómico , Cromatina/genética , Proteínas de Unión al ADN/genéticaRESUMEN
The prevalence of childhood obesity is increasing worldwide, along with the associated common comorbidities of type 2 diabetes and cardiovascular disease in later life. Motivated by evidence for a strong genetic component, our prior genome-wide association study (GWAS) efforts for childhood obesity revealed 19 independent signals for the trait; however, the mechanism of action of these loci remains to be elucidated. To molecularly characterize these childhood obesity loci we sought to determine the underlying causal variants and the corresponding effector genes within diverse cellular contexts. Integrating childhood obesity GWAS summary statistics with our existing 3D genomic datasets for 57 human cell types, consisting of high-resolution promoter-focused Capture-C/Hi-C, ATAC-seq, and RNA-seq, we applied stratified LD score regression and calculated the proportion of genome-wide SNP heritability attributable to cell type-specific features, revealing pancreatic alpha cell enrichment as the most statistically significant. Subsequent chromatin contact-based fine-mapping was carried out for genome-wide significant childhood obesity loci and their linkage disequilibrium proxies to implicate effector genes, yielded the most abundant number of candidate variants and target genes at the BDNF, ADCY3, TMEM18 and FTO loci in skeletal muscle myotubes and the pancreatic beta-cell line, EndoC-BH1. One novel implicated effector gene, ALKAL2 - an inflammation-responsive gene in nerve nociceptors - was observed at the key TMEM18 locus across multiple immune cell types. Interestingly, this observation was also supported through colocalization analysis using expression quantitative trait loci (eQTL) derived from the Genotype-Tissue Expression (GTEx) dataset, supporting an inflammatory and neurologic component to the pathogenesis of childhood obesity. Our comprehensive appraisal of 3D genomic datasets generated in a myriad of different cell types provides genomic insights into pediatric obesity pathogenesis.
RESUMEN
The ch12q13 obesity locus is among the most significant childhood obesity loci identified in genome-wide association studies. This locus resides in a non-coding region within FAIM2; thus, the underlying causal variant(s) presumably influence disease susceptibility via an influence on cis-regulation within the genomic region. We implicated rs7132908 as a putative causal variant at this locus leveraging a combination of our inhouse 3D genomic data, public domain datasets, and several computational approaches. Using a luciferase reporter assay in human primary astrocytes, we observed allele-specific cis-regulatory activity of the immediate region harboring rs7132908. Motivated by this finding, we went on to generate isogenic human embryonic stem cell lines homozygous for either rs7132908 allele with CRISPR-Cas9 homology-directed repair to assess changes in gene expression due to genotype and chromatin accessibility throughout a differentiation to hypothalamic neurons, a key cell type known to regulate feeding behavior. We observed that the rs7132908 obesity risk allele influenced the expression of FAIM2 along with other genes, decreased the proportion of neurons produced during differentiation, up-regulated cell death gene sets, and conversely down-regulated neuron differentiation gene sets. We have therefore functionally validated rs7132908 as a causal obesity variant which temporally regulates nearby effector genes at the ch12q13 locus and influences neurodevelopment and survival.
RESUMEN
BACKGROUND AND AIMS: A key histopathological feature of inflammatory bowel disease is damage to the mucosa, including breakdown of the epithelial barrier. Human enteroids and colonoids are a critical bench-to-bedside tool for studying the epithelium in inflammatory bowel disease. The goal of the current study was to define transcriptional differences in healthy versus diseased subjects that are sustained in enteroids and colonoids, including from disease-spared tissue. METHODS: Biopsies and matching enteroid or colonoid cultures from pediatric patients with ileal Crohn disease (N = 6) and control subjects (N = 17) were subjected to RNA sequencing followed by bioinformatic and machine learning analyses. Late passage enteroids were exposed to cytokines to assess durable transcriptional differences. RESULTS: We observed substantial overlap of pathways upregulated in Crohn disease in enteroids and ileal biopsies, as well as colonoids and rectal biopsies. KEGG pathways for cytokine-cytokine receptor interaction, chemokine signaling, protein export, and Toll-like receptor signaling were upregulated in both ileal and rectal biopsies, as well as enteroids and colonoids. In vitro cytokine exposure reactivated genes previously increased in biopsies. Machine learning predicted biopsy location (100% accuracy) and donor disease status (83% accuracy). A random forest classifier generated using ileal enteroids identified rectal colonoids from ileal Crohn disease subjects with 80% accuracy. CONCLUSION: We confirmed transcriptional profiles of Crohn disease biopsies are expressed in enteroids and colonoids. Furthermore, transcriptomic data from disease-spared rectal tissue can identify patients with ileal Crohn disease. Our data support the use of patient enteroids and colonoids as critical translational tools for the study of inflammatory bowel disease.
RESUMEN
Long noncoding RNAs (lncRNA) play an important role in gene regulation and contribute to tumorigenesis. While pan-cancer studies of lncRNA expression have been performed for adult malignancies, the lncRNA landscape across pediatric cancers remains largely uncharted. Here, we curated RNA sequencing data for 1,044 pediatric leukemia and extracranial solid tumors and integrated paired tumor whole genome sequencing and epigenetic data in relevant cell line models to explore lncRNA expression, regulation, and association with cancer. A total of 2,657 lncRNAs were robustly expressed across six pediatric cancers, including 1,142 exhibiting histotype-elevated expression. DNA copy number alterations contributed to lncRNA dysregulation at a proportion comparable to protein coding genes. Application of a multidimensional framework to identify and prioritize lncRNAs impacting gene networks revealed that lncRNAs dysregulated in pediatric cancer are associated with proliferation, metabolism, and DNA damage hallmarks. Analysis of upstream regulation via cell type-specific transcription factors further implicated distinct histotype-elevated and developmental lncRNAs. Integration of these analyses prioritized lncRNAs for experimental validation, and silencing of TBX2-AS1, the top-prioritized neuroblastoma-specific lncRNA, resulted in significant growth inhibition of neuroblastoma cells, confirming the computational predictions. Taken together, these data provide a comprehensive characterization of lncRNA regulation and function in pediatric cancers and pave the way for future mechanistic studies. SIGNIFICANCE: Comprehensive characterization of lncRNAs in pediatric cancer leads to the identification of highly expressed lncRNAs across childhood cancers, annotation of lncRNAs showing histotype-specific elevated expression, and prediction of lncRNA gene regulatory networks.
Asunto(s)
Leucemia , Neuroblastoma , ARN Largo no Codificante , Adulto , Humanos , Niño , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Perfilación de la Expresión Génica , Neuroblastoma/genética , Leucemia/genética , Genómica , Redes Reguladoras de Genes , Regulación Neoplásica de la Expresión GénicaRESUMEN
Although genome wide association studies (GWAS) have been crucial for the identification of loci associated with sleep traits and disorders, the method itself does not directly uncover the underlying causal variants and corresponding effector genes. The overwhelming majority of such variants reside in non-coding regions and are therefore presumed to impact the activity of cis-regulatory elements, such as enhancers. Our previously reported 'variant-to-gene mapping' effort in human induced pluripotent stem cell (iPSC)-derived neural progenitor cells (NPCs), combined with validation in both Drosophila and zebrafish, implicated PIG-Q as a functionally relevant gene at the insomnia 'WDR90' locus. However, importantly that effort did not characterize the corresponding underlying causal variant at this GWAS signal. Specifically, our genome-wide ATAC-seq and high-resolution promoter-focused Capture C datasets generated in this cell setting brought our attention to a shortlist of three tightly neighboring single nucleotide polymorphisms (SNPs) in strong linkage disequilibrium in a candidate intronic enhancer region of WDR90 that contacted the open PIG-Q promoter. The objective of this study was to investigate the influence of the proxy SNPs collectively and then individually on PIG-Q modulation and to pinpoint the causal "regulatory" variant among the three SNPs. Starting at a gross level perturbation, deletion of the entire region harboring all three SNPs in human iPSC-derived neural progenitor cells via CRISPR-Cas9 editing and subsequent RNA sequencing revealed expression changes in specific PIG-Q transcripts. Results from more refined individual luciferase reporter assays for each of the three SNPs in iPSCs revealed that the intronic region with the rs3752495 risk allele induced a ~2.5-fold increase in luciferase expression (n=10). Importantly, rs3752495 also exhibited an allele specific effect, with the risk allele increasing the luciferase expression by ~2-fold compared to the non-risk allele. In conclusion, our variant-to-function approach and subsequent in vitro validation implicates rs3752495 as a causal insomnia risk variant embedded at the WDR90-PIG-Q locus.