RESUMEN
BACKGROUND: Gene-environment interaction (G × E) is likely an important influence shaping individual differences in alcohol misuse (AM), yet it has not been extensively studied in molecular genetic research. In this study, we use a series of genome-wide gene-environment interaction (GWEIS) and in silico annotation methods with the aim of improving gene identification and biological understanding of AM. METHODS: We carried out GWEIS for four AM phenotypes in the large UK Biobank sample (N = 360,314), with trauma exposure and socioeconomic status (SES) as moderators of the genetic effects. Exploratory analyses compared stratified genome-wide association (GWAS) and GWEIS modeling approaches. We applied functional annotation, gene- and gene-set enrichment, and polygenic score analyses to interpret the GWEIS results. RESULTS: GWEIS models showed few genetic variants with significant interaction effects across gene-environment pairs. Enrichment analyses identified moderation by SES of the genes NOXA1, DLGAP1, and UBE2L3 on drinking quantity and the gene IFIT1B on drinking frequency. Except for DLGAP1, these genes have not previously been linked to AM. The most robust results (GWEIS interaction p = 4.59e-09) were seen for SES moderating the effects of variants linked to immune-related genes on a pattern of drinking with versus without meals. CONCLUSIONS: Our results highlight several genes and a potential mechanism of immune system functioning behind the moderating effect of SES on the genetic influences on AM. Although GWEIS seems to be a preferred approach over stratified GWAS, modeling G × E effects at the molecular level remains a challenge even in large samples. Understanding these effects will require substantial effort and more in-depth phenotypic measurement.
RESUMEN
The corpus callosum (CC) is the largest set of white matter fibers connecting the two hemispheres of the brain. In humans, it is essential for coordinating sensorimotor responses, performing associative/executive functions, and representing information in multiple dimensions. Understanding which genetic variants underpin corpus callosum morphometry, and their shared influence on cortical structure and susceptibility to neuropsychiatric disorders, can provide molecular insights into the CC's role in mediating cortical development and its contribution to neuropsychiatric disease. To characterize the morphometry of the midsagittal corpus callosum, we developed a publicly available artificial intelligence based tool to extract, parcellate, and calculate its total and regional area and thickness. Using the UK Biobank (UKB) and the Adolescent Brain Cognitive Development study (ABCD), we extracted measures of midsagittal corpus callosum morphometry and performed a genome-wide association study (GWAS) meta-analysis of European participants (combined N = 46,685). We then examined evidence for generalization to the non-European participants of the UKB and ABCD cohorts (combined N = 7,040). Post-GWAS analyses implicate prenatal intracellular organization and cell growth patterns, and high heritability in regions of open chromatin, suggesting transcriptional activity regulation in early development. Results suggest programmed cell death mediated by the immune system drives the thinning of the posterior body and isthmus. Global and local genetic overlap, along with causal genetic liability, between the corpus callosum, cerebral cortex, and neuropsychiatric disorders such as attention-deficit/hyperactivity and bipolar disorders were identified. These results provide insight into variability of corpus callosum development, its genetic influence on the cerebral cortex, and biological mechanisms related to neuropsychiatric dysfunction.
RESUMEN
While genome-wide association studies are increasingly successful in discovering genomic loci associated with complex human traits and disorders, the biological interpretation of these findings remains challenging. Here we developed the GSA-MiXeR analytical tool for gene set analysis (GSA), which fits a model for the heritability of individual genes, accounting for linkage disequilibrium across variants and allowing the quantification of partitioned heritability and fold enrichment for small gene sets. We validated the method using extensive simulations and sensitivity analyses. When applied to a diverse selection of complex traits and disorders, including schizophrenia, GSA-MiXeR prioritizes gene sets with greater biological specificity compared to standard GSA approaches, implicating voltage-gated calcium channel function and dopaminergic signaling for schizophrenia. Such biologically relevant gene sets, often with fewer than ten genes, are more likely to provide insights into the pathobiology of complex diseases and highlight potential drug targets.
Asunto(s)
Estudio de Asociación del Genoma Completo , Desequilibrio de Ligamiento , Esquizofrenia , Humanos , Estudio de Asociación del Genoma Completo/métodos , Esquizofrenia/genética , Herencia Multifactorial/genética , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Predisposición Genética a la Enfermedad , Mapeo Cromosómico/métodos , Simulación por Computador , Carácter Cuantitativo HeredableRESUMEN
Tropical coastal benthic communities will change in species composition and relative dominance due to global (e.g., increasing water temperature) and local (e.g., increasing terrestrial influence due to land-based activity) stressors. This study aimed to gain insight into possible trajectories of coastal benthic assemblages in Raja Ampat, Indonesia, by studying coral reefs at varying distances from human activities and marine lakes with high turbidity in three temperature categories (<31 °C, 31-32 °C, and >32 °C). The benthic community diversity and relative coverage of major benthic groups were quantified via replicate photo transects. The composition of benthic assemblages varied significantly among the reef and marine lake habitats. The marine lakes <31 °C contained hard coral, crustose coralline algae (CCA), and turf algae with coverages similar to those found in the coral reefs (17.4-18.8% hard coral, 3.5-26.3% CCA, and 15-15.5% turf algae, respectively), while the higher temperature marine lakes (31-32 °C and >32 °C) did not harbor hard coral or CCA. Benthic composition in the reefs was significantly influenced by geographic distance among sites but not by human activity or depth. Benthic composition in the marine lakes appeared to be structured by temperature, salinity, and degree of connection to the adjacent sea. Our results suggest that beyond a certain temperature (>31 °C), benthic communities shift away from coral dominance, but new outcomes of assemblages can be highly distinct, with a possible varied dominance of macroalgae, benthic cyanobacterial mats, or filter feeders such as bivalves and tubeworms. This study illustrates the possible use of marine lake model systems to gain insight into shifts in the benthic community structure of tropical coastal ecosystems if hard corals are no longer dominant.
Asunto(s)
Arrecifes de Coral , Ecosistema , Temperatura , Clima Tropical , Animales , Indonesia , Lagos/química , Biodiversidad , Antozoos/fisiologíaRESUMEN
Transcriptome-wide association studies (TWAS) aim to detect relationships between gene expression and a phenotype, and are commonly used for secondary analysis of genome-wide association study (GWAS) results. Results from TWAS analyses are often interpreted as indicating a genetic relationship between gene expression and a phenotype, but this interpretation is not consistent with the null hypothesis that is evaluated in the traditional TWAS framework. In this study we provide a mathematical outline of this TWAS framework, and elucidate what interpretations are warranted given the null hypothesis it actually tests. We then use both simulations and real data analysis to assess the implications of misinterpreting TWAS results as indicative of a genetic relationship between gene expression and the phenotype. Our simulation results show considerably inflated type 1 error rates for TWAS when interpreted this way, with 41% of significant TWAS associations detected in the real data analysis found to have insufficient statistical evidence to infer such a relationship. This demonstrates that in current implementations, TWAS cannot reliably be used to investigate genetic relationships between gene expression and a phenotype, but that local genetic correlation analysis can serve as a potential alternative.
Asunto(s)
Estudio de Asociación del Genoma Completo , Transcriptoma , Transcriptoma/genética , Mapeo Cromosómico , Simulación por Computador , Análisis de DatosRESUMEN
The relative influence of geography, currents, and environment on gene flow within sessile marine species remains an open question. Detecting subtle genetic differentiation at small scales is challenging in benthic populations due to large effective population sizes, general lack of resolution in genetic markers, and because barriers to dispersal often remain elusive. Marine lakes can circumvent confounding factors by providing discrete and replicated ecosystems. Using high-resolution double digest restriction-site-associated DNA sequencing (4826 Single Nucleotide Polymorphisms, SNPs), we genotyped populations of the sponge Suberites diversicolor (n = 125) to test the relative importance of spatial scales (1-1400 km), local environmental conditions, and permeability of seascape barriers in shaping population genomic structure. With the SNP dataset, we show strong intralineage population structure, even at scales <10 km (average F ST = 0.63), which was not detected previously using single markers. Most variation was explained by differentiation between populations (AMOVA: 48.8%) with signatures of population size declines and bottlenecks per lake. Although the populations were strongly structured, we did not detect significant effects of geographic distance, local environments, or degree of connection to the sea on population structure, suggesting mechanisms such as founder events with subsequent priority effects may be at play. We show that the inclusion of morphologically cryptic lineages that can be detected with the COI marker can reduce the obtained SNP set by around 90%. Future work on sponge genomics should confirm that only one lineage is included. Our results call for a reassessment of poorly dispersing benthic organisms that were previously assumed to be highly connected based on low-resolution markers.
RESUMEN
Functional connectivity within resting-state networks (RSN-FC) is vital for cognitive functioning. RSN-FC is heritable and partially translates to the anatomic architecture of white matter, but the genetic component of structural connections of RSNs (RSN-SC) and their potential genetic overlap with RSN-FC remain unknown. Here, we perform genome-wide association studies (N discovery = 24,336; N replication = 3412) and annotation on RSN-SC and RSN-FC. We identify genes for visual network-SC that are involved in axon guidance and synaptic functioning. Genetic variation in RSN-FC impacts biological processes relevant to brain disorders that previously were only phenotypically associated with RSN-FC alterations. Correlations of the genetic components of RSNs are mostly observed within the functional domain, whereas less overlap is observed within the structural domain and between the functional and structural domains. This study advances the understanding of the complex functional organization of the brain and its structural underpinnings from a genetics viewpoint.
Asunto(s)
Mapeo Encefálico , Estudio de Asociación del Genoma Completo , Imagen por Resonancia Magnética , Encéfalo/diagnóstico por imagen , Cognición , Red Nerviosa/diagnóstico por imagenRESUMEN
Proxy phenotypes allow for the utilization of genetic data from large population cohorts to analyze late-onset diseases by using parental diagnoses as a proxy for genetic disease risk. Proxy phenotypes based on parental diagnosis status have been used in previous studies to identify common variants associated with Alzheimer's disease. As of yet, proxy phenotypes have not been used to identify genes associated with Alzheimer's disease through rare variants. Here we show that a proxy Alzheimer's disease/dementia phenotype can capture known Alzheimer's disease risk genes through rare variant aggregation. We generated a proxy Alzheimer's disease/dementia phenotype for 148,508 unrelated individuals of European ancestry in the UK biobank in order to perform exome-wide rare variant aggregation analyses to identify genes associated with proxy Alzheimer's disease/dementia. We identified four genes significantly associated with the proxy phenotype, three of which were significantly associated with proxy Alzheimer's disease/dementia in an independent replication cohort consisting of 197,506 unrelated individuals of European ancestry in the UK biobank. All three of the replicated genes have been previously associated with clinically diagnosed Alzheimer's disease (SORL1, TREM2, and TOMM40/APOE). We show that proxy Alzheimer's disease/dementia can be used to identify genes associated with Alzheimer's disease through rare variant aggregation.
Asunto(s)
Enfermedad de Alzheimer , Humanos , Enfermedad de Alzheimer/genética , Variación Genética , Exoma , Estudio de Asociación del Genoma Completo , Factores de Riesgo , Predisposición Genética a la Enfermedad , Proteínas Relacionadas con Receptor de LDL/genética , Proteínas de Transporte de Membrana/genéticaRESUMEN
The widespread comorbidity among psychiatric disorders demonstrated in epidemiological studies1-5 is mirrored by non-zero, positive genetic correlations from large-scale genetic studies6-10. To identify shared biological processes underpinning this observed phenotypic and genetic covariance and enhance molecular characterization of general psychiatric disorder liability11-13, we used several strategies aimed at uncovering pleiotropic, that is, cross-trait-associated, single-nucleotide polymorphisms (SNPs), genes and biological pathways. We conducted cross-trait meta-analysis on 12 psychiatric disorders to identify pleiotropic SNPs. The meta-analytic signal was driven by schizophrenia, hampering interpretation and joint biological characterization of the cross-trait meta-analytic signal. Subsequent pairwise comparisons of psychiatric disorders identified substantial pleiotropic overlap, but mainly among pairs of psychiatric disorders, and mainly at less stringent P-value thresholds. Only annotations related to evolutionarily conserved genomic regions were significant for multiple (9 out of 12) psychiatric disorders. Overall, identification of shared biological mechanisms remains challenging due to variation in power and genetic architecture between psychiatric disorders.
Asunto(s)
Genómica , Trastornos Mentales , Humanos , Trastornos Mentales/genéticaRESUMEN
A quarter of the world's population is estimated to meet the criteria for metabolic syndrome (MetS), a cluster of cardiometabolic risk factors that promote development of coronary artery disease and type 2 diabetes, leading to increased risk of premature death and significant health costs. In this study we investigate whether the genetics associated with MetS components mirror their phenotypic clustering. A multivariate approach that leverages genetic correlations of fasting glucose, HDL cholesterol, systolic blood pressure, triglycerides, and waist circumference was used, which revealed that these genetic correlations are best captured by a genetic one factor model. The common genetic factor genome-wide association study (GWAS) detects 235 associated loci, 174 more than the largest GWAS on MetS to date. Of these loci, 53 (22.5%) overlap with loci identified for two or more MetS components, indicating that MetS is a complex, heterogeneous disorder. Associated loci harbor genes that show increased expression in the brain, especially in GABAergic and dopaminergic neurons. A polygenic risk score drafted from the MetS factor GWAS predicts 5.9% of the variance in MetS. These results provide mechanistic insights into the genetics of MetS and suggestions for drug targets, especially fenofibrate, which has the promise of tackling multiple MetS components.
Asunto(s)
Diabetes Mellitus Tipo 2 , Fenofibrato , Síndrome Metabólico , Humanos , Síndrome Metabólico/epidemiología , HDL-Colesterol , Estudio de Asociación del Genoma Completo , Diabetes Mellitus Tipo 2/genética , Factores de Riesgo , Triglicéridos , Circunferencia de la Cintura , Presión Sanguínea , Glucosa , GlucemiaRESUMEN
Cerebellar volume is highly heritable and associated with neurodevelopmental and neurodegenerative disorders. Understanding the genetic architecture of cerebellar volume may improve our insight into these disorders. This study aims to investigate the convergence of cerebellar volume genetic associations in close detail. A genome-wide associations study for cerebellar volume was performed in a discovery sample of 27,486 individuals from UK Biobank, resulting in 30 genome-wide significant loci and a SNP heritability of 39.82%. We pinpoint the likely causal variants and those that have effects on amino acid sequence or cerebellar gene-expression. Additionally, 85 genome-wide significant genes were detected and tested for convergence onto biological pathways, cerebellar cell types, human evolutionary genes or developmental stages. Local genetic correlations between cerebellar volume and neurodevelopmental and neurodegenerative disorders reveal shared loci with Parkinson's disease, Alzheimer's disease and schizophrenia. These results provide insights into the heritable mechanisms that contribute to developing a brain structure important for cognitive functioning and mental health.
Asunto(s)
Estudio de Asociación del Genoma Completo , Esquizofrenia , Encéfalo , Estudio de Asociación del Genoma Completo/métodos , Humanos , Salud Mental , Polimorfismo de Nucleótido Simple , Esquizofrenia/genéticaRESUMEN
Genetic correlation (rg) analysis is used to identify phenotypes that may have a shared genetic basis. Traditionally, rg is studied globally, considering only the average of the shared signal across the genome, although this approach may fail when the rg is confined to particular genomic regions or in opposing directions at different loci. Current tools for local rg analysis are restricted to analysis of two phenotypes. Here we introduce LAVA, an integrated framework for local rg analysis that, in addition to testing the standard bivariate local rgs between two phenotypes, can evaluate local heritabilities and analyze conditional genetic relations between several phenotypes using partial correlation and multiple regression. Applied to 25 behavioral and health phenotypes, we show considerable heterogeneity in the bivariate local rgs across the genome, which is often masked by the global rg patterns, and demonstrate how our conditional approaches can elucidate more complex, multivariate genetic relations.
Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Mapeo Cromosómico , Genoma , FenotipoRESUMEN
With the rapidly increasing availability of large genetic data sets in recent years, Mendelian Randomization (MR) has quickly gained popularity as a novel secondary analysis method. Leveraging genetic variants as instrumental variables, MR can be used to estimate the causal effects of one phenotype on another even when experimental research is not feasible, and therefore has the potential to be highly informative. It is dependent on strong assumptions however, often producing biased results if these are not met. It is therefore imperative that these assumptions are well-understood by researchers aiming to use MR, in order to evaluate their validity in the context of their analyses and data. The aim of this perspective is therefore to further elucidate these assumptions and the role they play in MR, as well as how different kinds of data can be used to further support them.
Asunto(s)
Análisis de la Aleatorización Mendeliana , Causalidad , Humanos , Análisis de la Aleatorización Mendeliana/métodos , FenotipoRESUMEN
Animal and human studies have documented the existence of developmental windows (or sensitive periods) when experience can have lasting effects on brain structure or function, behavior, and disease. Although sensitive periods for depression likely arise through a complex interplay of genes and experience, this possibility has not yet been explored in humans. We examined the effect of genetic pathways regulating sensitive periods, alone and in interaction with common childhood adversities, on depression risk. Guided by a translational approach, we: (1) performed association analyses of three gene sets (60 genes) shown in animal studies to regulate sensitive periods using summary data from a genome-wide association study of depression (n = 807,553); (2) evaluated the developmental expression patterns of these genes using data from BrainSpan (n = 31), a transcriptional atlas of postmortem brain samples; and (3) tested gene-by-development interplay (dGxE) by analyzing the combined effect of common variants in sensitive period genes and time-varying exposure to two types of childhood adversity within a population-based birth cohort (n = 6254). The gene set regulating sensitive period opening associated with increased depression risk. Notably, 6 of the 15 genes in this set showed developmentally regulated gene-level expression. We also identified a statistical interaction between caregiver physical or emotional abuse during ages 1-5 years and genetic risk for depression conferred by the opening genes. Genes involved in regulating sensitive periods are differentially expressed across the life course and may be implicated in depression vulnerability. Our findings about gene-by-development interplay motivate further research in large, more diverse samples to further unravel the complexity of depression etiology through a sensitive period lens.
Asunto(s)
Depresión , Estudio de Asociación del Genoma Completo , Encéfalo , Preescolar , Depresión/genética , Humanos , Lactante , Acontecimientos que Cambian la Vida , Factores de RiesgoRESUMEN
The folding of the human cerebral cortex is a highly genetically regulated process that allows for a much larger surface area to fit into the cranial vault and optimizes functional organization. Sulcal depth is a robust yet understudied measure of localized folding, previously associated with multiple neurodevelopmental disorders. Here, we report the first genome-wide association study of sulcal depth. Through the multivariate omnibus statistical test (MOSTest) applied to vertex-wise measures from 33,748 U.K. Biobank participants (mean age, 64.3 years; 52.0% female), we identified 856 genome-wide significant loci (P < 5 × 10−8). Comparisons with cortical thickness and surface area indicated that sulcal depth has higher locus yield, heritability, and effective sample size. There was a large amount of genetic overlap between these traits, with gene-based analyses indicating strong associations with neurodevelopmental processes. Our findings demonstrate sulcal depth is a promising neuroimaging phenotype that may enhance our understanding of cortical morphology.
RESUMEN
The use of induced pluripotent stem cells (iPSC) to model human complex diseases is gaining popularity as it allows investigation of human cells that are otherwise sparsely available. However, due to its laborious and cost intensive nature, iPSC research is often plagued by limited sample size and putative large variability between clones, decreasing statistical power for detecting experimental effects. Here, we investigate the source and magnitude of variability in the proteome of parallel differentiated astrocytes using mass spectrometry. We compare three possible sources of variability: inter-donor variability, inter- and intra-clonal variability, at different stages of maturation. We show that the interclonal variability is significantly smaller than the inter-donor variability, and that including more donors has a much larger influence on statistical power than adding more clones per donor. Our results provide insight into the sources of variability at protein level between iPSC samples derived in parallel and will aid in optimizing iPSC studies.
Asunto(s)
Células Madre Pluripotentes Inducidas , Diferenciación Celular , Células Cultivadas , Humanos , Espectrometría de Masas , ProteomaRESUMEN
Gene-environment interactions (GxE) are often suggested to play an important role in the aetiology of psychiatric phenotypes, yet so far, only a handful of genome-wide environment interaction studies (GWEIS) of psychiatric phenotypes have been conducted. Representing the most comprehensive effort of its kind to date, we used data from the UK Biobank to perform a series of GWEIS for neuroticism across 25 broadly conceptualised environmental risk factors (trauma, social support, drug use, physical health). We investigated interactions on the level of SNPs, genes, and gene-sets, and computed interaction-based polygenic risk scores (PRS) to predict neuroticism in an independent sample subset (N = 10,000). We found that the predictive ability of the interaction-based PRSs did not significantly improve beyond that of a traditional PRS based on SNP main effects from GWAS, but detected one variant and two gene-sets showing significant interaction signal after correction for the number of analysed environments. This study illustrates the possibilities and limitations of a comprehensive GWEIS in currently available sample sizes.