RESUMEN
To better understand host-virus genetic dependencies and find potential therapeutic targets for COVID-19, we performed a genome-scale CRISPR loss-of-function screen to identify host factors required for SARS-CoV-2 viral infection of human alveolar epithelial cells. Top-ranked genes cluster into distinct pathways, including the vacuolar ATPase proton pump, Retromer, and Commander complexes. We validate these gene targets using several orthogonal methods such as CRISPR knockout, RNA interference knockdown, and small-molecule inhibitors. Using single-cell RNA-sequencing, we identify shared transcriptional changes in cholesterol biosynthesis upon loss of top-ranked genes. In addition, given the key role of the ACE2 receptor in the early stages of viral entry, we show that loss of RAB7A reduces viral entry by sequestering the ACE2 receptor inside cells. Overall, this work provides a genome-scale, quantitative resource of the impact of the loss of each host gene on fitness/response to viral infection.
Asunto(s)
COVID-19/genética , COVID-19/virología , Interacciones Huésped-Patógeno , SARS-CoV-2/fisiología , Células A549 , Células Epiteliales Alveolares/metabolismo , Células Epiteliales Alveolares/virología , Enzima Convertidora de Angiotensina 2/metabolismo , Vías Biosintéticas , COVID-19/metabolismo , Colesterol/biosíntesis , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Endosomas/metabolismo , Perfilación de la Expresión Génica , Técnicas de Silenciamiento del Gen , Técnicas de Inactivación de Genes/métodos , Estudio de Asociación del Genoma Completo , Interacciones Huésped-Patógeno/efectos de los fármacos , Humanos , Interferencia de ARN , SARS-CoV-2/crecimiento & desarrollo , Análisis de la Célula Individual , Carga Viral/efectos de los fármacos , Proteínas de Unión al GTP rab/genética , Proteínas de Unión a GTP rab7RESUMEN
Bulk-tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, and context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from the blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell-type proportions, we demonstrate that cell-type iQTLs could be considered as proxies for cell-type-specific QTL effects, particularly for the most abundant cell type in the tissue. The interpretation of age iQTLs, however, warrants caution because the moderation effect of age on the genotype and molecular phenotype association could be mediated by changes in cell-type composition. Finally, we show that cell-type iQTLs contribute to cell-type-specific enrichment of diseases that, in combination with additional functional data, could guide future functional studies. Overall, this study highlights the use of iQTLs to gain insights into the context specificity of regulatory effects.
Asunto(s)
Regulación de la Expresión Génica , Sitios de Carácter Cuantitativo , Humanos , Sitios de Carácter Cuantitativo/genética , Genotipo , FenotipoRESUMEN
Hyper-secretion and/or hyper-concentration of mucus is a defining feature of multiple obstructive lung diseases, including chronic obstructive pulmonary disease (COPD). Mucus itself is composed of a mixture of water, ions, salt and proteins, of which the gel-forming mucins, MUC5AC and MUC5B, are the most abundant. Recent studies have linked the concentrations of these proteins in sputum to COPD phenotypes, including chronic bronchitis (CB) and acute exacerbations (AE). We sought to determine whether common genetic variants influence sputum mucin concentrations and whether these variants are also associated with COPD phenotypes, specifically CB and AE. We performed a GWAS to identify quantitative trait loci for sputum mucin protein concentration (pQTL) in the Sub-Populations and InteRmediate Outcome Measures in COPD Study (SPIROMICS, n = 708 for total mucin, n = 215 for MUC5AC, MUC5B). Subsequently, we tested for associations of mucin pQTL with CB and AE using regression modeling (n = 822-1300). Replication analysis was conducted using data from COPDGene (n = 5740) and by examining results from the UK Biobank. We identified one genome-wide significant pQTL for MUC5AC (rs75401036) and two for MUC5B (rs140324259, rs10001928). The strongest association for MUC5B, with rs140324259 on chromosome 11, explained 14% of variation in sputum MUC5B. Despite being associated with lower MUC5B, the C allele of rs140324259 conferred increased risk of CB (odds ratio (OR) = 1.42; 95% confidence interval (CI): 1.10-1.80) as well as AE ascertained over three years of follow up (OR = 1.41; 95% CI: 1.02-1.94). Associations between rs140324259 and CB or AE did not replicate in COPDGene. However, in the UK Biobank, rs140324259 was associated with phenotypes that define CB, namely chronic mucus production and cough, again with the C allele conferring increased risk. We conclude that sputum MUC5AC and MUC5B concentrations are associated with common genetic variants, and the top locus for MUC5B may influence COPD phenotypes, in particular CB.
Asunto(s)
Mucinas , Enfermedad Pulmonar Obstructiva Crónica , Humanos , Mucinas/genética , Mucinas/metabolismo , Esputo/metabolismo , Enfermedad Pulmonar Obstructiva Crónica/genética , Moco/metabolismo , FenotipoRESUMEN
Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.
Asunto(s)
Análisis de Correlación Canónica , Proteómica , Humanos , Proteómica/métodos , Multiómica , Estudios de CohortesRESUMEN
Tens of thousands of genetic variants associated with gene expression (cis-eQTLs) have been discovered in the human population. These eQTLs are active in various tissues and contexts, but the molecular mechanisms of eQTL variability are poorly understood, hindering our understanding of genetic regulation across biological contexts. Since many eQTLs are believed to act by altering transcription factor (TF) binding affinity, we hypothesized that analyzing eQTL effect size as a function of TF level may allow discovery of mechanisms of eQTL variability. Using GTEx Consortium eQTL data from 49 tissues, we analyzed the interaction between eQTL effect size and TF level across tissues and across individuals within specific tissues and generated a list of 10,098 TF-eQTL interactions across 2,136 genes that are supported by at least two lines of evidence. These TF-eQTLs were enriched for various TF binding measures, supporting with orthogonal evidence that these eQTLs are regulated by the implicated TFs. We also found that our TF-eQTLs tend to overlap genes with gene-by-environment regulatory effects and to colocalize with GWAS loci, implying that our approach can help to elucidate mechanisms of context-specificity and trait associations. Finally, we highlight an interesting example of IKZF1 TF regulation of an APBB1IP gene eQTL that colocalizes with a GWAS signal for blood cell traits. Together, our findings provide candidate TF mechanisms for a large number of eQTLs and offer a generalizable approach for researchers to discover TF regulators of genetic variant effects in additional QTL datasets.
Asunto(s)
Sitios de Carácter Cuantitativo , Factores de Transcripción/fisiología , Alelos , Sitios de Unión , Técnicas de Silenciamiento del Gen , Interacción Gen-Ambiente , Estudio de Asociación del Genoma Completo , Humanos , Factor 1 Regulador del Interferón/genética , Modelos Genéticos , Fenotipo , Factores de Transcripción/metabolismoRESUMEN
BACKGROUND: Epigenetic dysregulation has been proposed as a key mechanism for arsenic-related cardiovascular disease (CVD). We evaluated differentially methylated positions (DMPs) as potential mediators on the association between arsenic and CVD. METHODS: Blood DNA methylation was measured in 2321 participants (mean age 56.2, 58.6% women) of the Strong Heart Study, a prospective cohort of American Indians. Urinary arsenic species were measured using high-performance liquid chromatography coupled to inductively coupled plasma mass spectrometry. We identified DMPs that are potential mediators between arsenic and CVD. In a cross-species analysis, we compared those DMPs with differential liver DNA methylation following early-life arsenic exposure in the apoE knockout (apoE-/-) mouse model of atherosclerosis. RESULTS: A total of 20 and 13 DMPs were potential mediators for CVD incidence and mortality, respectively, several of them annotated to genes related to diabetes. Eleven of these DMPs were similarly associated with incident CVD in 3 diverse prospective cohorts (Framingham Heart Study, Women's Health Initiative, and Multi-Ethnic Study of Atherosclerosis). In the mouse model, differentially methylated regions in 20 of those genes and DMPs in 10 genes were associated with arsenic. CONCLUSIONS: Differential DNA methylation might be part of the biological link between arsenic and CVD. The gene functions suggest that diabetes might represent a relevant mechanism for arsenic-related cardiovascular risk in populations with a high burden of diabetes.
Asunto(s)
Arsénico , Aterosclerosis , Enfermedades Cardiovasculares , Animales , Apolipoproteínas E , Arsénico/toxicidad , Aterosclerosis/inducido químicamente , Aterosclerosis/genética , Enfermedades Cardiovasculares/inducido químicamente , Enfermedades Cardiovasculares/genética , Metilación de ADN , Femenino , Humanos , Masculino , Ratones , Persona de Mediana Edad , Estudios ProspectivosRESUMEN
Transcriptome studies disentangle functional mechanisms of gene expression regulation and may elucidate the underlying biology of disease processes. However, the types of tissues currently collected typically assay a single post-mortem timepoint or are limited to investigating cell types found in blood. Noninvasive tissues may improve disease-relevant discovery by enabling more complex longitudinal study designs, by capturing different and potentially more applicable cell types, and by increasing sample sizes due to reduced collection costs and possible higher enrollment from vulnerable populations. Here, we develop methods for sampling noninvasive biospecimens, investigate their performance across commercial and in-house library preparations, characterize their biology, and assess the feasibility of using noninvasive tissues in a multitude of transcriptomic applications. We collected buccal swabs, hair follicles, saliva, and urine cell pellets from 19 individuals over three to four timepoints, for a total of 300 unique biological samples, which we then prepared with replicates across three library preparations, for a final tally of 472 transcriptomes. Of the four tissues we studied, we found hair follicles and urine cell pellets to be most promising due to the consistency of sample quality, the cell types and expression profiles we observed, and their performance in disease-relevant applications. This is the first study to thoroughly delineate biological and technical features of noninvasive samples and demonstrate their use in a wide array of transcriptomic and clinical analyses. We anticipate future use of these biospecimens will facilitate discovery and development of clinical applications.
Asunto(s)
Perfilación de la Expresión Génica , Transcriptoma , Humanos , Estudios Longitudinales , Regulación de la Expresión Génica , SalivaRESUMEN
We conducted cohort- and race-specific epigenome-wide association analyses of mitochondrial deoxyribonucleic acid (mtDNA) copy number (mtDNA CN) measured in whole blood from participants of African and European origins in five cohorts (n = 6182, mean age = 57-67 years, 65% women). In the meta-analysis of all the participants, we discovered 21 mtDNA CN-associated DNA methylation sites (CpG) (P < 1 × 10-7), with a 0.7-3.0 standard deviation increase (3 CpGs) or decrease (18 CpGs) in mtDNA CN corresponding to a 1% increase in DNA methylation. Several significant CpGs have been reported to be associated with at least two risk factors (e.g. chronological age or smoking) for cardiovascular disease (CVD). Five genes [PR/SET domain 16, nuclear receptor subfamily 1 group H member 3 (NR1H3), DNA repair protein, DNA polymerase kappa and decaprenyl-diphosphate synthase subunit 2], which harbor nine significant CpGs, are known to be involved in mitochondrial biosynthesis and functions. For example, NR1H3 encodes a transcription factor that is differentially expressed during an adipose tissue transition. The methylation level of cg09548275 in NR1H3 was negatively associated with mtDNA CN (effect size = -1.71, P = 4 × 10-8) and was positively associated with the NR1H3 expression level (effect size = 0.43, P = 0.0003), which indicates that the methylation level in NR1H3 may underlie the relationship between mtDNA CN, the NR1H3 transcription factor and energy expenditure. In summary, the study results suggest that mtDNA CN variation in whole blood is associated with DNA methylation levels in genes that are involved in a wide range of mitochondrial activities. These findings will help reveal molecular mechanisms between mtDNA CN and CVD.
Asunto(s)
Epigenoma , Genoma Mitocondrial , Anciano , Variaciones en el Número de Copia de ADN/genética , Metilación de ADN/genética , ADN Mitocondrial/genética , ADN Mitocondrial/metabolismo , Epigenoma/genética , Femenino , Genoma Mitocondrial/genética , Humanos , Masculino , Persona de Mediana EdadRESUMEN
BACKGROUND: Treatment and preventative advances for chronic obstructive pulmonary disease (COPD) have been slow due, in part, to limited subphenotypes. We tested if unsupervised machine learning on CT images would discover CT emphysema subtypes with distinct characteristics, prognoses and genetic associations. METHODS: New CT emphysema subtypes were identified by unsupervised machine learning on only the texture and location of emphysematous regions on CT scans from 2853 participants in the Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS), a COPD case-control study, followed by data reduction. Subtypes were compared with symptoms and physiology among 2949 participants in the population-based Multi-Ethnic Study of Atherosclerosis (MESA) Lung Study and with prognosis among 6658 MESA participants. Associations with genome-wide single-nucleotide-polymorphisms were examined. RESULTS: The algorithm discovered six reproducible (interlearner intraclass correlation coefficient, 0.91-1.00) CT emphysema subtypes. The most common subtype in SPIROMICS, the combined bronchitis-apical subtype, was associated with chronic bronchitis, accelerated lung function decline, hospitalisations, deaths, incident airflow limitation and a gene variant near DRD1, which is implicated in mucin hypersecretion (p=1.1 ×10-8). The second, the diffuse subtype was associated with lower weight, respiratory hospitalisations and deaths, and incident airflow limitation. The third was associated with age only. The fourth and fifth visually resembled combined pulmonary fibrosis emphysema and had distinct symptoms, physiology, prognosis and genetic associations. The sixth visually resembled vanishing lung syndrome. CONCLUSION: Large-scale unsupervised machine learning on CT scans defined six reproducible, familiar CT emphysema subtypes that suggest paths to specific diagnosis and personalised therapies in COPD and pre-COPD.
Asunto(s)
Enfisema , Enfermedad Pulmonar Obstructiva Crónica , Enfisema Pulmonar , Humanos , Enfisema Pulmonar/diagnóstico por imagen , Enfisema Pulmonar/genética , Estudios de Casos y Controles , Aprendizaje Automático no Supervisado , Pulmón , Tomografía Computarizada por Rayos XRESUMEN
BACKGROUND: Chronic obstructive pulmonary disease (COPD) varies significantly in symptomatic and physiologic presentation. Identifying disease subtypes from molecular data, collected from easily accessible blood samples, can help stratify patients and guide disease management and treatment. METHODS: Blood gene expression measured by RNA-sequencing in the COPDGene Study was analyzed using a network perturbation analysis method. Each COPD sample was compared against a learned reference gene network to determine the part that is deregulated. Gene deregulation values were used to cluster the disease samples. RESULTS: The discovery set included 617 former smokers from COPDGene. Four distinct gene network subtypes are identified with significant differences in symptoms, exercise capacity and mortality. These clusters do not necessarily correspond with the levels of lung function impairment and are independently validated in two external cohorts: 769 former smokers from COPDGene and 431 former smokers in the Multi-Ethnic Study of Atherosclerosis (MESA). Additionally, we identify several genes that are significantly deregulated across these subtypes, including DSP and GSTM1, which have been previously associated with COPD through genome-wide association study (GWAS). CONCLUSIONS: The identified subtypes differ in mortality and in their clinical and functional characteristics, underlining the need for multi-dimensional assessment potentially supplemented by selected markers of gene expression. The subtypes were consistent across cohorts and could be used for new patient stratification and disease prognosis.
Asunto(s)
Redes Reguladoras de Genes , Enfermedad Pulmonar Obstructiva Crónica , Humanos , Redes Reguladoras de Genes/genética , Fumadores , Estudio de Asociación del Genoma Completo/métodos , Enfermedad Pulmonar Obstructiva Crónica/diagnóstico , Enfermedad Pulmonar Obstructiva Crónica/genética , PronósticoRESUMEN
Approximately 1.5 billion people worldwide are overweight or affected by obesity, and are at risk of developing type 2 diabetes, cardiovascular disease and related metabolic and inflammatory disturbances. Although the mechanisms linking adiposity to associated clinical conditions are poorly understood, recent studies suggest that adiposity may influence DNA methylation, a key regulator of gene expression and molecular phenotype. Here we use epigenome-wide association to show that body mass index (BMI; a key measure of adiposity) is associated with widespread changes in DNA methylation (187 genetic loci with P < 1 × 10-7, range P = 9.2 × 10-8 to 6.0 × 10-46; n = 10,261 samples). Genetic association analyses demonstrate that the alterations in DNA methylation are predominantly the consequence of adiposity, rather than the cause. We find that methylation loci are enriched for functional genomic features in multiple tissues (P < 0.05), and show that sentinel methylation markers identify gene expression signatures at 38 loci (P < 9.0 × 10-6, range P = 5.5 × 10-6 to 6.1 × 10-35, n = 1,785 samples). The methylation loci identify genes involved in lipid and lipoprotein metabolism, substrate transport and inflammatory pathways. Finally, we show that the disturbances in DNA methylation predict future development of type 2 diabetes (relative risk per 1 standard deviation increase in methylation risk score: 2.3 (2.07-2.56); P = 1.1 × 10-54). Our results provide new insights into the biologic pathways influenced by adiposity, and may enable development of new strategies for prediction and prevention of type 2 diabetes and other adverse clinical consequences of obesity.
Asunto(s)
Adiposidad/genética , Índice de Masa Corporal , Metilación de ADN/genética , Diabetes Mellitus Tipo 2/genética , Epigénesis Genética , Epigenómica , Estudio de Asociación del Genoma Completo , Obesidad/genética , Tejido Adiposo/metabolismo , Pueblo Asiatico/genética , Sangre/metabolismo , Estudios de Cohortes , Diabetes Mellitus Tipo 2/complicaciones , Europa (Continente)/etnología , Femenino , Marcadores Genéticos , Predisposición Genética a la Enfermedad , Humanos , India/etnología , Masculino , Obesidad/sangre , Obesidad/complicaciones , Sobrepeso/sangre , Sobrepeso/complicaciones , Sobrepeso/genética , Población Blanca/genéticaRESUMEN
OBJECTIVE: The purpose of this study was to identify disease relevant genes and explore underlying immunological mechanisms that contribute to early and late onset forms of myasthenia gravis. METHODS: We used a novel genomic methodology to integrate genomewide association study (GWAS) findings in myasthenia gravis with cell-type specific information, such as gene expression patterns and promotor-enhancer interactions, in order to identify disease-relevant genes. Subsequently, we conducted additional genomic investigations, including an expression quantitative analysis of 313 healthy people to provide mechanistic insights. RESULTS: We identified several genes that were specifically linked to early onset myasthenia gravis including TNIP1, ORMDL3, GSDMB, and TRAF3. We showed that regulators of toll-like receptor 4 signaling were enriched among these early onset disease genes (fold enrichment = 3.85, p = 6.4 × 10-3 ). In contrast, T-cell regulators CD28 and CTLA4 were exclusively linked to late onset disease. We identified 2 causal genetic variants (rs231770 and rs231735; posterior probability = 0.98 and 0.91) near the CTLA4 gene. Subsequently, we demonstrated that these causal variants result in low expression of CTLA4 (rho = -0.66, p = 1.28 × 10-38 and rho = -0.52, p = 7.01 × 10-22 , for rs231735 and rs231770, respectively). INTERPRETATION: The disease-relevant genes identified in this study are a unique resource for many disciplines, including clinicians, scientists, and the pharmaceutical industry. The distinct immunological pathways linked to early and late onset myasthenia gravis carry important implications for drug repurposing opportunities and for future studies of drug development. ANN NEUROL 2021;90:455-463.
Asunto(s)
Variación Genética/fisiología , Estudio de Asociación del Genoma Completo/métodos , Inmunidad Innata/fisiología , Miastenia Gravis/genética , Miastenia Gravis/inmunología , Polimorfismo de Nucleótido Simple/fisiología , Adulto , Edad de Inicio , Femenino , Humanos , Masculino , Persona de Mediana Edad , Miastenia Gravis/diagnósticoRESUMEN
Inappropriate activation or inadequate regulation of CD4+ and CD8+ T cells may contribute to the initiation and progression of multiple autoimmune and inflammatory diseases. Studies on disease-associated genetic polymorphisms have highlighted the importance of biological context for many regulatory variants, which is particularly relevant in understanding the genetic regulation of the immune system and its cellular phenotypes. Here we show cell type-specific regulation of transcript levels of genes associated with several autoimmune diseases in CD4+ and CD8+ T cells including a trans-acting regulatory locus at chr12q13.2 containing the rs1131017 SNP in the RPS26 gene. Most remarkably, we identify a common missense variant in IL27, associated with type 1 diabetes that results in decreased functional activity of the protein and reduced expression levels of downstream IRF1 and STAT1 in CD4+ T cells only. Altogether, our results indicate that eQTL mapping in purified T cells provides novel functional insights into polymorphisms and pathways associated with autoimmune diseases.
Asunto(s)
Enfermedades Autoinmunes/genética , Linfocitos T CD4-Positivos/metabolismo , Linfocitos T CD8-positivos/metabolismo , Sitios de Carácter Cuantitativo/genética , Linfocitos T CD4-Positivos/inmunología , Linfocitos T CD8-positivos/inmunología , Mapeo Cromosómico/métodos , Diabetes Mellitus Tipo 1/genética , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Células HEK293 , Humanos , Factor 1 Regulador del Interferón/genética , Interleucina-27/genética , Mutación , Polimorfismo de Nucleótido Simple , Proteínas Ribosómicas/genética , Factor de Transcripción STAT1/genéticaRESUMEN
BACKGROUND: Glycosylation is one of the most common post-translation modifications with large influences on protein structure and function. The effector function of immunoglobulin G (IgG) alters between pro- and anti-inflammatory, based on its glycosylation. IgG glycan synthesis is highly complex and dynamic. METHODS: With the use of two different analytical methods for assessing IgG glycosylation, we aim to elucidate the link between DNA methylation and glycosylation of IgG by means of epigenome-wide association studies. In total, 3000 individuals from 4 cohorts were analyzed. RESULTS: The overlap of the results from the two glycan measurement panels yielded DNA methylation of 7 CpG-sites on 5 genomic locations to be associated with IgG glycosylation: cg25189904 (chr.1, GNG12); cg05951221, cg21566642 and cg01940273 (chr.2, ALPPL2); cg05575921 (chr.5, AHRR); cg06126421 (6p21.33); and cg03636183 (chr.19, F2RL3). Mediation analyses with respect to smoking revealed that the effect of smoking on IgG glycosylation may be at least partially mediated via DNA methylation levels at these 7 CpG-sites. CONCLUSION: Our results suggest the presence of an indirect link between DNA methylation and IgG glycosylation that may in part capture environmental exposures. GENERAL SIGNIFICANCE: An epigenome-wide analysis conducted in four population-based cohorts revealed an association between DNA methylation and IgG glycosylation patterns. Presumably, DNA methylation mediates the effect of smoking on IgG glycosylation.
Asunto(s)
Metilación de ADN , Inmunoglobulina G/química , Procesamiento Proteico-Postraduccional , Fumar/efectos adversos , Mapeo Cromosómico , Estudios de Cohortes , Islas de CpG , Epigenómica/métodos , Europa (Continente) , Glicosilación , Humanos , Inmunoglobulina G/metabolismo , Estudios Multicéntricos como Asunto , Polisacáridos/análisis , Estudios en Gemelos como AsuntoRESUMEN
The functional consequences of trait associated SNPs are often investigated using expression quantitative trait locus (eQTL) mapping. While trait-associated variants may operate in a cell-type specific manner, eQTL datasets for such cell-types may not always be available. We performed a genome-environment interaction (GxE) meta-analysis on data from 5,683 samples to infer the cell type specificity of whole blood cis-eQTLs. We demonstrate that this method is able to predict neutrophil and lymphocyte specific cis-eQTLs and replicate these predictions in independent cell-type specific datasets. Finally, we show that SNPs associated with Crohn's disease preferentially affect gene expression within neutrophils, including the archetypal NOD2 locus.
Asunto(s)
Linfocitos/citología , Neutrófilos/citología , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Línea Celular , Enfermedad de Crohn/genética , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo/métodos , Humanos , Linfocitos/metabolismo , Neutrófilos/metabolismo , Proteína Adaptadora de Señalización NOD2/genética , Proteína Adaptadora de Señalización NOD2/metabolismo , Fenotipo , Análisis de Componente Principal , Reproducibilidad de los ResultadosRESUMEN
CDH13 encodes T-cadherin, a receptor for high molecular weight (HMW) adiponectin and low-density lipoprotein, promoting proliferation and migration of endothelial cells. Genome-wide association studies have mapped multiple variants in CDH13 associated with cardiometabolic traits (CMT) with variable effects across studies. We hypothesized that this heterogeneity might reflect interplay with DNA methylation within the region. Resequencing and EpiTYPER™ assay were applied for the HYPertension in ESTonia/Coronary Artery Disease in Czech (HYPEST/CADCZ; n = 358) samples to identify CDH13 promoter SNPs acting as methylation Quantitative Trait Loci (meQTLs) and to investigate their associations with CMT. In silico data were extracted from genome-wide DNA methylation and genotype datasets of the population-based sample Estonian Genome Center of the University of Tartu (EGCUT; n = 165). HYPEST-CADCZ meta-analysis identified a rare variant rs113460564 as highly significant meQTL for a 134-bp distant CpG site (P = 5.90 × 10(-6); ß = 3.19%). Four common SNPs (rs12443878, rs12444338, rs62040565, rs8060301) exhibited effect on methylation level of up to 3 neighboring CpG sites in both datasets. The strongest association was detected in EGCUT between rs8060301 and cg09415485 (false discovery rate corrected P value = 1.89 × 10(-30)). Simultaneously, rs8060301 showed association with diastolic blood pressure, serum high-density lipoprotein and HMW adiponectin (P < 0.005). Novel strong associations were identified between rare CDH13 promoter meQTLs (minor allele frequency <5%) and HMW adiponectin: rs2239857 (P = 5.50 × 10(-5), ß = -1,841.9 ng/mL) and rs77068073 (P = 2.67 × 10(-4), ß = -2,484.4 ng/mL). Our study shows conclusively that CDH13 promoter harbors meQTLs associated with CMTs. It paves the way to deeper understanding of the interplay between DNA variation and methylation in susceptibility to common diseases.
Asunto(s)
Presión Sanguínea/genética , Cadherinas/genética , Metilación de ADN , Regiones Promotoras Genéticas , Adiponectina/sangre , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Frecuencia de los Genes , Estudios de Asociación Genética , Pleiotropía Genética , Humanos , Hipertensión/sangre , Hipertensión/genética , Lipoproteínas HDL/sangre , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ADN , Adulto JovenRESUMEN
BACKGROUND: The liver plays a central role in the maintenance of homeostasis and health in general. However, there is substantial inter-individual variation in hepatic gene expression, and although numerous genetic factors have been identified, less is known about the epigenetic factors. RESULTS: By analyzing the methylomes and transcriptomes of 14 fetal and 181 adult livers, we identified 657 differentially methylated genes with adult-specific expression, these genes were enriched for transcription factor binding sites of HNF1A and HNF4A. We also identified 1,000 genes specific to fetal liver, which were enriched for GATA1, STAT5A, STAT5B and YY1 binding sites. We saw strong liver-specific effects of single nucleotide polymorphisms on both methylation levels (28,447 unique CpG sites (meQTL)) and gene expression levels (526 unique genes (eQTL)), at a false discovery rate (FDR) < 0.05. Of the 526 unique eQTL associated genes, 293 correlated significantly not only with genetic variation but also with methylation levels. The tissue-specificities of these associations were analyzed in muscle, subcutaneous adipose tissue and visceral adipose tissue. We observed that meQTL were more stable between tissues than eQTL and a very strong tissue-specificity for the identified associations between CpG methylation and gene expression. CONCLUSIONS: Our analyses generated a comprehensive resource of factors involved in the regulation of hepatic gene expression, and allowed us to estimate the proportion of variation in gene expression that could be attributed to genetic and epigenetic variation, both crucial to understanding differences in drug response and the etiology of liver diseases.
Asunto(s)
Epigénesis Genética , Epigenómica , Feto/metabolismo , Perfilación de la Expresión Génica , Hígado/crecimiento & desarrollo , Hígado/metabolismo , Adulto , Metilación de ADN , Feto/embriología , Regulación del Desarrollo de la Expresión Génica , Humanos , Especificidad de Órganos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genéticaRESUMEN
Background: Schizophrenia (SCZ) patients exhibit 30% higher prevalence of metabolic syndrome (MetS) compared to the general population with its suboptimal management contributing to increased mortality. Large-scale studies providing real-world evidence of the underlying causes remain limited. Methods: To address this gap, we used real-world health data from the Estonian Biobank, spanning a median follow-up of ten years, to investigate the impact of genetic predisposition and antipsychotic treatment on the development of MetS in SCZ patients. Specifically, we set out to characterize antipsychotic treatment patterns, genetic predisposition of MetS traits, MetS prognosis, and body mass index (BMI) trajectories, comparing SCZ cases (n = 677) to age- and sex-matched controls (n = 2708). Findings: SCZ cases exhibited higher genetic predisposition to SCZ (OR = 1.75, 95% CI 1.58-1.94), but lower polygenic burden for increased BMI (OR = 0.88, 95% CI 0.88-0.96) and C-reactive protein (OR = 0.88, 95% CI 0.81-0.97) compared to controls. While SCZ cases showed worse prognosis of MetS (HR 1.95, 95% CI 1.54-2.46), higher antipsychotic adherence within the first treatment year was associated with reduced long-term MetS incidence. Linear mixed modelling, incorporating multiple BMI timepoints, underscored the significant contribution of both, antipsychotic medication, and genetic predisposition to higher BMI, driving the substantially upward trajectory of BMI in SCZ cases. Interpretation: These findings contribute to refining clinical risk prediction and prevention strategies for MetS among SCZ patients and emphasize the significance of incorporating genetic information, long-term patient tracking, and employing diverse perspectives when analyzing real-world health data. Funding: EU Horizon 2020, Swedish Research Council, Estonian Research Council, Estonian Ministry of Education and Research, University of Tartu.
RESUMEN
Systemic low-grade inflammation is a feature of chronic disease. C-reactive protein (CRP) is a common biomarker of inflammation and used as an indicator of disease risk; however, the role of inflammation in disease is not completely understood. Methylation is an epigenetic modification in the DNA which plays a pivotal role in gene expression. In this study we evaluated differential DNA methylation patterns associated with blood CRP level to elucidate biological pathways and genetic regulatory mechanisms to improve the understanding of chronic inflammation. The racially and ethnically diverse participants in this study were included as 50% White, 41% Black or African American, 7% Hispanic or Latino/a, and 2% Native Hawaiian, Asian American, American Indian, or Alaska Native (total n = 13,433) individuals. We replicated 113 CpG sites from 87 unique loci, of which five were novel (CADM3, NALCN, NLRC5, ZNF792, and cg03282312), across a discovery set of 1,150 CpG sites associated with CRP level (p < 1.2E-7). The downstream pathways affected by DNA methylation included the identification of IFI16 and IRF7 CpG-gene transcript pairs which contributed to the innate immune response gene enrichment pathway along with NLRC5, NOD2, and AIM2. Gene enrichment analysis also identified the nuclear factor-kappaB transcription pathway. Using two-sample Mendelian randomization (MR) we inferred methylation at three CpG sites as causal for CRP levels using both White and Black or African American MR instrument variables. Overall, we identified novel CpG sites and gene transcripts that could be valuable in understanding the specific cellular processes and pathogenic mechanisms involved in inflammation.
Asunto(s)
Proteína C-Reactiva , Metilación de ADN , Humanos , Proteína C-Reactiva/genética , Epigénesis Genética , ADN , Inflamación/genética , Estudio de Asociación del Genoma Completo , Islas de CpG , Péptidos y Proteínas de Señalización Intracelular/genéticaRESUMEN
Chronic kidney disease (CKD) impacts about 1 in 7 adults in the United States, but African Americans (AAs) carry a disproportionately higher burden of disease. Epigenetic modifications, such as DNA methylation at cytosine-phosphate-guanine (CpG) sites, have been linked to kidney function and may have clinical utility in predicting the risk of CKD. Given the dynamic relationship between the epigenome, environment, and disease, AAs may be especially sensitive to environment-driven methylation alterations. Moreover, risk models incorporating CpG methylation have been shown to predict disease across multiple racial groups. In this study, we developed a methylation risk score (MRS) for CKD in cohorts of AAs. We selected nine CpG sites that were previously reported to be associated with estimated glomerular filtration rate (eGFR) in epigenome-wide association studies to construct a MRS in the Hypertension Genetic Epidemiology Network (HyperGEN). In logistic mixed models, the MRS was significantly associated with prevalent CKD and was robust to multiple sensitivity analyses, including CKD risk factors. There was modest replication in validation cohorts. In summary, we demonstrated that an eGFR-based CpG score is an independent predictor of prevalent CKD, suggesting that MRS should be further investigated for clinical utility in evaluating CKD risk and progression.