RESUMEN
MOTIVATION: Understanding the rules that govern enhancer-driven transcription remains a central unsolved problem in genomics. Now with multiple massively parallel enhancer perturbation assays published, there are enough data that we can utilize to learn to predict enhancer-promoter (EP) relationships in a data-driven manner. RESULTS: We applied machine learning to one of the largest enhancer perturbation studies integrated with transcription factor (TF) and histone modification ChIP-seq. The results uncovered a discrepancy in the prediction of genome-wide data compared to data from targeted experiments. Relative strength of contact was important for prediction, confirming the basic principle of EP regulation. Novel features such as the density of the enhancers/promoters in the genomic region was found to be important, highlighting our lack of understanding on how other elements in the region contribute to the regulation. Several TF peaks were identified that improved the prediction by identifying the negatives and reducing False Positives. In summary, integrating genomic assays with enhancer perturbation studies increased the accuracy of the model, and provided novel insights into the understanding of enhancer-driven transcription. AVAILABILITY AND IMPLEMENTATION: The trained models, data, and the source code are available at http://doi.org/10.5281/zenodo.11290386 and https://github.com/HanLabUNLV/sleps.
Asunto(s)
Elementos de Facilitación Genéticos , Regiones Promotoras Genéticas , Aprendizaje Automático Supervisado , Humanos , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Genómica/métodos , Secuenciación de Inmunoprecipitación de Cromatina/métodosRESUMEN
Rationale: Idiopathic pulmonary fibrosis (IPF) is a rare, irreversible, and progressive disease of the lungs. Common genetic variants, in addition to nongenetic factors, have been consistently associated with IPF. Rare variants identified by candidate gene, family-based, and exome studies have also been reported to associate with IPF. However, the extent to which rare variants, genome-wide, may contribute to the risk of IPF remains unknown. Objectives: We used whole-genome sequencing to investigate the role of rare variants, genome-wide, on IPF risk. Methods: As part of the Trans-Omics for Precision Medicine Program, we sequenced 2,180 cases of IPF. Association testing focused on the aggregated effect of rare variants (minor allele frequency ⩽0.01) within genes or regions. We also identified individual rare variants that are influential within genes and estimated the heritability of IPF on the basis of rare and common variants. Measurements and Main Results: Rare variants in both TERT and RTEL1 were significantly associated with IPF. A single rare variant in each of the TERT and RTEL1 genes was found to consistently influence the aggregated test statistics. There was no significant evidence of association with other previously reported rare variants. The SNP heritability of IPF was estimated to be 32% (SE = 3%). Conclusions: Rare variants within the TERT and RTEL1 genes and well-established common variants have the largest contribution to IPF risk overall. Efforts in risk profiling or the development of therapies for IPF that focus on TERT, RTEL1, common variants, and environmental risk factors are likely to have the largest impact on this complex disease.
Asunto(s)
Fibrosis Pulmonar Idiopática , Humanos , Fibrosis Pulmonar Idiopática/genética , Secuenciación Completa del Genoma , ExomaRESUMEN
The gain-of-function minor allele of the MUC5B (mucin 5B, oligomeric mucus/gel-forming) promoter (rs35705950) is the strongest risk factor for idiopathic pulmonary fibrosis (IPF), a devastating fibrotic lung disease that leads to progressive respiratory failure in adults. We have previously demonstrated that Muc5b overexpression in mice worsens lung fibrosis after bleomycin exposure and have hypothesized that excess Muc5b promotes endoplasmic reticulum (ER) stress and apoptosis, stimulating fibrotic lung injury. Here, we report that ER stress pathway members ATF4 (activating transcription factor 4) and ATF6 coexpress with MUC5B in epithelia of the distal IPF airway and honeycomb cyst and that this is more pronounced in carriers of the gain-of-function MUC5B promoter variant. Similarly, in mice exposed to bleomycin, Muc5b expression is temporally associated with markers of ER stress. Using bulk and single-cell RNA sequencing in bleomycin-exposed mice, we found that pathologic ER stress-associated transcripts Atf4 and Ddit3 (DNA damage inducible transcript 3) were elevated in alveolar epithelia of SFTPC-Muc5b transgenic (SFTPC-Muc5bTg) mice relative to wild-type (WT) mice. Activation of the ER stress response inhibits protein translation for most genes by phosphorylation of Eif2α (eukaryotic translation initiation factor 2 alpha), which prevents guanine exchange by Eif2B and facilitates translation of Atf4. The integrated stress response inhibitor (ISRIB) facilitates interaction of phosphorylated Eif2α with Eif2B, overcoming translation inhibition associated with ER stress and reducing Atf4. We found that a single dose of ISRIB diminished Atf4 translation in SFTPC-Muc5bTg mice after bleomycin injury. Moreover, ISRIB resolved the exaggerated fibrotic response of SFTPC-Muc5bTg mice to bleomycin. In summary, we demonstrate that MUC5B and Muc5b expression is associated with pathologic ER stress and that restoration of normal translation with a single dose of ISRIB promotes lung repair in bleomycin-injured Muc5b-overexpressing mice.
Asunto(s)
Fibrosis Pulmonar Idiopática , Mucina 5B , Ratones , Animales , Mucina 5B/genética , Mucina 5B/metabolismo , Factor 2B Eucariótico de Iniciación , Fibrosis Pulmonar Idiopática/inducido químicamente , Fibrosis Pulmonar Idiopática/genética , Fibrosis Pulmonar Idiopática/metabolismo , Estrés del Retículo Endoplásmico , BleomicinaRESUMEN
Rationale: Common genetic variants have been associated with idiopathic pulmonary fibrosis (IPF). Objectives: To determine functional relevance of the 10 IPF-associated common genetic variants we previously identified. Methods: We performed expression quantitative trait loci (eQTL) and methylation quantitative trait loci (mQTL) mapping, followed by co-localization of eQTL and mQTL with genetic association signals and functional validation by luciferase reporter assays. Illumina multi-ethnic genotyping arrays, mRNA sequencing, and Illumina 850k methylation arrays were performed on lung tissue of participants with IPF (234 RNA and 345 DNA samples) and non-diseased controls (188 RNA and 202 DNA samples). Measurements and Main Results: Focusing on genetic variants within 10 IPF-associated genetic loci, we identified 27 eQTLs in controls and 24 eQTLs in cases (false-discovery-rate-adjusted P < 0.05). Among these signals, we identified associations of lead variants rs35705950 with expression of MUC5B and rs2076295 with expression of DSP in both cases and controls. mQTL analysis identified CpGs in gene bodies of MUC5B (cg17589883) and DSP (cg08964675) associated with the lead variants in these two loci. We also demonstrated strong co-localization of eQTL/mQTL and genetic signal in MUC5B (rs35705950) and DSP (rs2076295). Functional validation of the mQTL in MUC5B using luciferase reporter assays demonstrates that the CpG resides within a putative internal repressor element. Conclusions: We have established a relationship of the common IPF genetic risk variants rs35705950 and rs2076295 with respective changes in MUC5B and DSP expression and methylation. These results provide additional evidence that both MUC5B and DSP are involved in the etiology of IPF.
Asunto(s)
Fibrosis Pulmonar Idiopática , Humanos , ADN , Metilación de ADN/genética , Expresión Génica , Predisposición Genética a la Enfermedad/genética , Fibrosis Pulmonar Idiopática/genética , Mucina 5B/genética , Sitios de Carácter Cuantitativo/genética , ARNRESUMEN
We previously identified a novel molecular subtype of idiopathic pulmonary fibrosis (IPF) defined by increased expression of cilium-associated genes, airway mucin gene MUC5B, and KRT5 marker of basal cell airway progenitors. Here we show the association of MUC5B and cilia gene expression in human IPF airway epithelial cells, providing further rationale for examining the role of cilium genes in the pathogenesis of IPF. We demonstrate increased multiciliogenesis and changes in motile cilia structure of multiciliated cells both in IPF and bleomycin lung fibrosis models. Importantly, conditional deletion of a cilium gene, Ift88 (intraflagellar transport 88), in Krt5 basal cells reduces Krt5 pod formation and lung fibrosis, whereas no changes are observed in Ift88 conditional deletion in club cell progenitors. Our findings indicate that aberrant injury-activated primary ciliogenesis and Hedgehog signaling may play a causative role in Krt5 pod formation, which leads to aberrant multiciliogenesis and lung fibrosis. This implies that modulating cilium gene expression in Krt5 cell progenitors is a potential therapeutic target for IPF.
Asunto(s)
Fibrosis Pulmonar Idiopática , Bleomicina/toxicidad , Cilios/metabolismo , Proteínas Hedgehog/genética , Proteínas Hedgehog/metabolismo , Humanos , Fibrosis Pulmonar Idiopática/patología , Transducción de SeñalRESUMEN
A subset of patients with hypersensitivity pneumonitis (HP) develop lung fibrosis that is clinically similar to idiopathic pulmonary fibrosis (IPF). To address the aetiological determinants of fibrotic HP, we investigated whether the common IPF genetic risk variants were also relevant in study subjects with fibrotic HP. Our findings indicate that common genetic variants in TERC, DSP, MUC5B and IVD were significantly associated with fibrotic HP. These findings provide support for a shared etiology and pathogenesis between fibrotic HP and IPF.
Asunto(s)
Alveolitis Alérgica Extrínseca , Fibrosis Pulmonar Idiopática , Alveolitis Alérgica Extrínseca/genética , Fibrosis , Humanos , Fibrosis Pulmonar Idiopática/genética , Fibrosis Pulmonar Idiopática/patología , Pulmón/patología , Factores de RiesgoRESUMEN
Molecular patterns and pathways in idiopathic pulmonary fibrosis (IPF) have been extensively investigated, but few studies have assimilated multiomic platforms to provide an integrative understanding of molecular patterns that are relevant in IPF. Herein, we combine the coding and noncoding transcriptomes, DNA methylomes, and proteomes from IPF and healthy lung tissue to identify molecules and pathways associated with this disease. RNA sequencing, Illumina MethylationEPIC array, and liquid chromatography-mass spectrometry proteomic data were collected on lung tissue from 24 subjects with IPF and 14 control subjects. Significant differential features were identified by using linear models adjusting for age and sex, inflation, and bias when appropriate. Data Integration Analysis for Biomarker Discovery Using a Latent Component Method for Omics Studies was used for integrative multiomic analysis. We identified 4,643 differentially expressed transcripts aligning to 3,439 genes, 998 differentially abundant proteins, 2,500 differentially methylated regions, and 1,269 differentially expressed long noncoding RNAs (lncRNAs) that were significant after correcting for multiple tests (false discovery rate < 0.05). Unsupervised hierarchical clustering using 20 coding mRNA, protein, methylation, and lncRNA features with the highest loadings on the top latent variable from the four data sets demonstrates perfect separation of IPF and control lungs. Our analysis confirmed previously validated molecules and pathways known to be dysregulated in disease and implicated novel molecular features as potential drivers and modifiers of disease. For example, 4 proteins, 18 differentially methylated regions, and 10 lncRNAs were found to have strong correlations (|r| > 0.8) with MMP7 (matrix metalloproteinase 7). Therefore, by using a system biology approach, we have identified novel molecular relationships in IPF.
Asunto(s)
Fibrosis Pulmonar Idiopática/metabolismo , Pulmón/metabolismo , ARN Largo no Codificante/genética , Transcriptoma/fisiología , Anciano , Estudios de Casos y Controles , Femenino , Perfilación de la Expresión Génica/métodos , Humanos , Masculino , Metaloproteinasa 7 de la Matriz/metabolismo , Persona de Mediana Edad , ARN Mensajero/metabolismoRESUMEN
Non-secretor status due to homozygosity for the common FUT2 variant c.461G>A (p.Trp154∗) is associated with either risk for autoimmune diseases or protection against viral diarrhea and HIV. We determined the role of FUT2 in otitis media susceptibility by obtaining DNA samples from 609 multi-ethnic families and simplex case subjects with otitis media. Exome and Sanger sequencing, linkage analysis, and Fisher exact and transmission disequilibrium tests (TDT) were performed. The common FUT2 c.604C>T (p.Arg202∗) variant co-segregates with otitis media in a Filipino pedigree (LOD = 4.0). Additionally, a rare variant, c.412C>T (p.Arg138Cys), is associated with recurrent/chronic otitis media in European-American children (p = 1.2 × 10-5) and US trios (TDT p = 0.01). The c.461G>A (p.Trp154∗) variant was also over-transmitted in US trios (TDT p = 0.01) and was associated with shifts in middle ear microbiota composition (PERMANOVA p < 10-7) and increased biodiversity. When all missense and nonsense variants identified in multi-ethnic US trios with CADD > 20 were combined, FUT2 variants were over-transmitted in trios (TDT p = 0.001). Fut2 is transiently upregulated in mouse middle ear after inoculation with non-typeable Haemophilus influenzae. Four FUT2 variants-namely p.Ala104Val, p.Arg138Cys, p.Trp154∗, and p.Arg202∗-reduced A antigen in mutant-transfected COS-7 cells, while the nonsense variants also reduced FUT2 protein levels. Common and rare FUT2 variants confer susceptibility to otitis media, likely by modifying the middle ear microbiome through regulation of A antigen levels in epithelial cells. Our families demonstrate marked intra-familial genetic heterogeneity, suggesting that multiple combinations of common and rare variants plus environmental factors influence the individual otitis media phenotype as a complex trait.
Asunto(s)
Fucosiltransferasas/genética , Variación Genética/genética , Otitis Media/genética , Animales , Células COS , Línea Celular , Chlorocebus aethiops , Oído Medio/microbiología , Exoma/genética , Femenino , Células HEK293 , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Microbiota/fisiología , Otitis Media/microbiología , Linaje , Galactósido 2-alfa-L-FucosiltransferasaRESUMEN
Rationale: Chronic hypersensitivity pneumonitis (CHP) is caused by an immune response to antigen inhalation and is characterized by variable histopathological and clinical features. A subset of subjects with CHP have usual interstitial pneumonia and appear to be clinically similar to subjects with idiopathic pulmonary fibrosis (IPF).Objectives: To determine the common and unique molecular features of CHP and IPF.Methods: Transcriptome analysis of lung samples from CHP (n = 82), IPF (n = 103), and unaffected controls (n = 103) was conducted. Differential gene expression was determined adjusting for sex, race, age, and smoking history and using false discovery rate to control for multiple comparisons.Measurements and Main Results: When compared with controls, we identified 413 upregulated and 317 downregulated genes in CHP and 861 upregulated and 322 downregulated genes in IPF. Concordantly upregulated or downregulated genes in CHP and IPF were related to collagen catabolic processes and epithelial development, whereas genes specific to CHP (differentially expressed in CHP when compared with control and not differentially expressed in IPF) were related to chemokine-mediated signaling and immune responsiveness. Using weighted gene coexpression network analysis, we found that among subjects with CHP, genes involved in adaptive immunity or epithelial cell development were associated with improved or reduced lung function, respectively, and that MUC5B expression was associated with epithelial cell development. MUC5B expression was also associated with lung fibrosis and honeycombing.Conclusions: Gene expression analysis of CHP and IPF identified signatures common to CHP and IPF, as well as genes uniquely expressed in CHP. Select modules of gene expression are characterized by distinct clinical and pathological features of CHP.
Asunto(s)
Alveolitis Alérgica Extrínseca/genética , Alveolitis Alérgica Extrínseca/inmunología , Perfilación de la Expresión Génica , Fibrosis Pulmonar Idiopática/genética , Fibrosis Pulmonar Idiopática/inmunología , Enfermedades Pulmonares Intersticiales/genética , Enfermedades Pulmonares Intersticiales/inmunología , Adulto , Anciano , Anciano de 80 o más Años , Alveolitis Alérgica Extrínseca/fisiopatología , Femenino , Expresión Génica , Humanos , Fibrosis Pulmonar Idiopática/fisiopatología , Enfermedades Pulmonares Intersticiales/fisiopatología , Masculino , Persona de Mediana EdadRESUMEN
BACKGROUND: Epigenetic signatures in the nasal epithelium, which is a primary interface with the environment and an accessible proxy for the bronchial epithelium, might provide insights into mechanisms of allergic disease. OBJECTIVE: We aimed to identify and interpret methylation signatures in nasal epithelial brushes associated with rhinitis and asthma. METHODS: Nasal epithelial brushes were obtained from 455 children at the 16-year follow-up of the Dutch Prevention and Incidence of Asthma and Mite Allergy birth cohort study. Epigenome-wide association studies were performed on children with asthma, rhinitis, and asthma and/or rhinitis (AsRh) by using logistic regression, and the top results were replicated in 2 independent cohorts of African American and Puerto Rican children. Significant CpG sites were related to environmental exposures (pets, active and passive smoking, and molds) during secondary school and were correlated with gene expression by RNA-sequencing (n = 244). RESULTS: The epigenome-wide association studies identified CpG sites significantly associated with rhinitis (n = 81) and AsRh (n = 75), but not with asthma. We significantly replicated 62 of 81 CpG sites with rhinitis and 60 of 75 with AsRh, as well as 1 CpG site with asthma. Methylation of cg03565274 was negatively associated with AsRh and positively associated with exposure to pets during secondary school. DNA methylation signals associated with AsRh were mainly driven by specific IgE-positive subjects. DNA methylation related to gene transcripts that were enriched for immune pathways and expressed in immune and epithelial cells. Nasal CpG sites performed well in predicting AsRh. CONCLUSIONS: We identified replicable DNA methylation profiles of asthma and rhinitis in nasal brushes. Exposure to pets may affect nasal epithelial methylation in relation to asthma and rhinitis.
Asunto(s)
Asma/genética , Metilación de ADN/genética , Mucosa Nasal/inmunología , Rinitis/genética , Adolescente , Negro o Afroamericano/genética , Asma/inmunología , Niño , Estudios de Cohortes , Islas de CpG/genética , Islas de CpG/inmunología , Metilación de ADN/inmunología , Epigénesis Genética/genética , Epigénesis Genética/inmunología , Epigenoma/genética , Epigenoma/inmunología , Epigenómica/métodos , Células Epiteliales/inmunología , Femenino , Estudio de Asociación del Genoma Completo/métodos , Humanos , Inmunoglobulina E/genética , Masculino , Mucosa Respiratoria/inmunología , Rinitis/inmunologíaRESUMEN
Epigenome-wide studies of methylation in children support a role for epigenetic mechanisms in asthma; however, studies in adults are rare and few have examined non-atopic asthma. We conducted the largest epigenome-wide association study (EWAS) of blood DNA methylation in adults in relation to non-atopic and atopic asthma.We measured DNA methylation in blood using the Illumina MethylationEPIC array among 2286 participants in a case-control study of current adult asthma nested within a United States agricultural cohort. Atopy was defined by serum specific immunoglobulin E (IgE). Participants were categorised as atopy without asthma (n=185), non-atopic asthma (n=673), atopic asthma (n=271), or a reference group of neither atopy nor asthma (n=1157). Analyses were conducted using logistic regression.No associations were observed with atopy without asthma. Numerous cytosine-phosphate-guanine (CpG) sites were differentially methylated in non-atopic asthma (eight at family-wise error rate (FWER) p<9×10-8, 524 at false discovery rate (FDR) less than 0.05) and implicated 382 novel genes. More CpG sites were identified in atopic asthma (181 at FWER, 1086 at FDR) and implicated 569 novel genes. 104 FDR CpG sites overlapped. 35% of CpG sites in non-atopic asthma and 91% in atopic asthma replicated in studies of whole blood, eosinophils, airway epithelium, or nasal epithelium. Implicated genes were enriched in pathways related to the nervous system or inflammation.We identified numerous, distinct differentially methylated CpG sites in non-atopic and atopic asthma. Many CpG sites from blood replicated in asthma-relevant tissues. These circulating biomarkers reflect risk and sequelae of disease, as well as implicate novel genes associated with non-atopic and atopic asthma.
Asunto(s)
Asma , Epigenoma , Adulto , Asma/genética , Estudios de Casos y Controles , Niño , Islas de CpG , Metilación de ADN , Epigénesis Genética , Estudio de Asociación del Genoma Completo , Humanos , Pulmón , Estados UnidosRESUMEN
Rationale: Several common and rare genetic variants have been associated with idiopathic pulmonary fibrosis, a progressive fibrotic condition that is localized to the lung. Objectives: To develop an integrated understanding of the rare and common variants located in multiple loci that have been reported to contribute to the risk of disease. Methods: We performed deep targeted resequencing (3.69 Mb of DNA) in cases (n = 3,624) and control subjects (n = 4,442) across genes and regions previously associated with disease. We tested for associations between disease and 1) individual common variants via logistic regression and 2) groups of rare variants via sequence kernel association tests. Measurements and Main Results: Statistically significant common variant association signals occurred in all 10 of the regions chosen based on genome-wide association studies. The strongest risk variant is the MUC5B promoter variant rs35705950, with an odds ratio of 5.45 (95% confidence interval, 4.91-6.06) for one copy of the risk allele and 18.68 (95% confidence interval, 13.34-26.17) for two copies of the risk allele (P = 9.60 × 10-295). In addition to identifying for the first time that rare variation in FAM13A is associated with disease, we confirmed the role of rare variation in the TERT and RTEL1 gene regions in the risk of IPF, and found that the FAM13A and TERT regions have independent common and rare variant signals. Conclusions: A limited number of common and rare variants contribute to the risk of idiopathic pulmonary fibrosis in each of the resequencing regions, and these genetic variants focus on biological mechanisms of host defense and cell senescence.
Asunto(s)
Senescencia Celular/genética , Interacciones Huésped-Patógeno/genética , Fibrosis Pulmonar Idiopática/genética , Transportadoras de Casetes de Unión a ATP/genética , Estudios de Casos y Controles , ADN Helicasas/genética , Exorribonucleasas/genética , Femenino , Proteínas Activadoras de GTPasa/genética , Predisposición Genética a la Enfermedad , Variación Genética , Estudio de Asociación del Genoma Completo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Modelos Logísticos , Masculino , Mucina 5B/genética , Regiones Promotoras Genéticas/genética , Proteína A Asociada a Surfactante Pulmonar/genética , Proteína C Asociada a Surfactante Pulmonar/genética , ARN/genética , Análisis de Secuencia de ADN , Telomerasa/genética , Proteínas de Unión a Telómeros/genéticaRESUMEN
A genetic basis for otitis media is established, however, the role of rare variants in disease etiology is largely unknown. Previously a duplication variant within A2ML1 was identified as a significant risk factor for otitis media in an indigenous Filipino population and in US children. In this report exome and Sanger sequencing was performed using DNA samples from the indigenous Filipino population, Filipino cochlear implantees, US probands, Finnish, and Pakistani families with otitis media. Sixteen novel, damaging A2ML1 variants identified in otitis media patients were rare or low-frequency in population-matched controls. In the indigenous population, both gingivitis and A2ML1 variants including the known duplication variant and the novel splice variant c.4061 + 1 G>C were independently associated with otitis media. Sequencing of salivary RNA samples from indigenous Filipinos demonstrated lower A2ML1 expression according to the carriage of A2ML1 variants. Sequencing of additional salivary RNA samples from US patients with otitis media revealed differentially expressed genes that are highly correlated with A2ML1 expression levels. In particular, RND3 is upregulated in both A2ML1 variant carriers and high-A2ML1 expressors. These findings support a role for A2ML1 in keratinocyte differentiation within the middle ear as part of otitis media pathology and the potential application of ROCK inhibition in otitis media.
Asunto(s)
Regulación hacia Abajo , Perfilación de la Expresión Génica/métodos , Mutación , Otitis Media/genética , Análisis de Secuencia de ADN/métodos , alfa-Macroglobulinas/genética , Adolescente , Adulto , Niño , Preescolar , Femenino , Finlandia , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad , Humanos , Lactante , Masculino , Persona de Mediana Edad , Pakistán , Linaje , Filipinas , Análisis de Secuencia de ARN , Transducción de Señal , Estados Unidos , Adulto JovenAsunto(s)
Biomarcadores/sangre , Proteínas Sanguíneas/análisis , Diagnóstico Precoz , Fibrosis Pulmonar/sangre , Fibrosis Pulmonar/diagnóstico , Fibrosis Pulmonar/fisiopatología , Medición de Riesgo/métodos , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Colorado , Femenino , Humanos , Masculino , Persona de Mediana Edad , TennesseeRESUMEN
Most genetic variants identified through genome-wide association studies (GWAS) are suspected to be regulatory in nature, but only a small fraction colocalize with expression quantitative trait loci (eQTLs, variants associated with expression of a gene). Therefore, it is hypothesized but largely untested that integration of disease GWAS with context-specific eQTLs will reveal the underlying genes driving disease associations. We used colocalization and transcriptomic analyses to identify shared genetic variants and likely causal genes associated with critically ill COVID-19 and idiopathic pulmonary fibrosis. We first identified five genome-wide significant variants associated with both diseases. Four of the variants did not demonstrate clear colocalization between GWAS and healthy lung eQTL signals. Instead, two of the four variants colocalized only in cell-type and disease-specific eQTL datasets. These analyses pointed to higher ATP11A expression from the C allele of rs12585036, in monocytes and in lung tissue from primarily smokers, which increased risk of IPF and decreased risk of critically ill COVID-19. We also found lower DPP9 expression (and higher methylation at a specific CpG) from the G allele of rs12610495, acting in fibroblasts and in IPF lungs, and increased risk of IPF and critically ill COVID-19. We further found differential expression of the identified causal genes in diseased lungs when compared to non-diseased lungs, specifically in epithelial and immune cell types. These findings highlight the power of integrating GWAS, context-specific eQTLs, and transcriptomics of diseased tissue to harness human genetic variation to identify causal genes and where they function during multiple diseases.
RESUMEN
The G/T transversion rs35705950, located approximately 3 kb upstream of the MUC5B start site, is the cardinal risk factor for idiopathic pulmonary fibrosis (IPF). Here, we investigate the function and chromatin structure of this -3 kb region and provide evidence that it functions as a classically defined enhancer subject to epigenetic programming. We use nascent transcript analysis to show that RNA polymerase II loads within 10 bp of the G/T transversion site, definitively establishing enhancer function for the region. By integrating Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) analysis of fresh and cultured human airway epithelial cells with nuclease sensitivity data, we demonstrate that this region is in accessible chromatin that affects the expression of MUC5B. Through applying paired single-nucleus RNA- and ATAC-seq to frozen tissue from IPF lungs, we extend these findings directly to disease, with results indicating that epigenetic programming of the -3 kb enhancer in IPF occurs in both MUC5B-expressing and nonexpressing lineages. In aggregate, our results indicate that the MUC5B-associated variant rs35705950 resides within an enhancer that is subject to epigenetic remodeling and contributes to pathologic misexpression in IPF.