RESUMO
Previous genome-wide association studies (GWAS) of hematological traits have identified over 10 000 distinct trait-specific risk loci. However, at these loci, the underlying causal mechanisms remain incompletely characterized. To elucidate novel biology and better understand causal mechanisms at known loci, we performed a transcriptome-wide association study (TWAS) of 29 hematological traits in 399 835 UK Biobank (UKB) participants of European ancestry using gene expression prediction models trained from whole blood RNA-seq data in 922 individuals. We discovered 557 gene-trait associations for hematological traits distinct from previously reported GWAS variants in European populations. Among the 557 associations, 301 were available for replication in a cohort of 141 286 participants of European ancestry from the Million Veteran Program. Of these 301 associations, 108 replicated at a strict Bonferroni adjusted threshold ($\alpha$= 0.05/301). Using our TWAS results, we systematically assigned 4261 out of 16 900 previously identified hematological trait GWAS variants to putative target genes. Compared to coloc, our TWAS results show reduced specificity and increased sensitivity in external datasets to assign variants to target genes.
Assuntos
Estudo de Associação Genômica Ampla , Transcriptoma , Bancos de Espécimes Biológicos , Células Sanguíneas , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Humanos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Transcriptoma/genética , Reino UnidoRESUMO
The circadian clock coordinates a variety of immune responses with signals from the external environment to promote survival. We investigated the potential reciprocal relationship between the circadian clock and skin inflammation. We treated mice topically with the Toll-like receptor 7 (TLR7) agonist imiquimod (IMQ) to activate IFN-sensitive gene (ISG) pathways and induce psoriasiform inflammation. IMQ transiently altered core clock gene expression, an effect mirrored in human patient psoriatic lesions. In mouse skin 1 d after IMQ treatment, ISGs, including the key ISG transcription factor IFN regulatory factor 7 (Irf7), were more highly induced after treatment during the day than the night. Nuclear localization of phosphorylated-IRF7 was most prominently time-of-day dependent in epidermal leukocytes, suggesting that these cell types play an important role in the diurnal ISG response to IMQ. Mice lacking Bmal1 systemically had exacerbated and arrhythmic ISG/Irf7 expression after IMQ. Furthermore, daytime-restricted feeding, which affects the phase of the skin circadian clock, reverses the diurnal rhythm of IMQ-induced ISG expression in the skin. These results suggest a role for the circadian clock, driven by BMAL1, as a negative regulator of the ISG response, and highlight the finding that feeding time can modulate the skin immune response. Since the IFN response is essential for the antiviral and antitumor effects of TLR activation, these findings are consistent with the time-of-day-dependent variability in the ability to fight microbial pathogens and tumor initiation and offer support for the use of chronotherapy for their treatment.
Assuntos
Ritmo Circadiano , Imunidade Inata/genética , Interferons/genética , Glicoproteínas de Membrana/genética , Pele/metabolismo , Receptor 7 Toll-Like/genética , Fatores de Transcrição ARNTL/genética , Fatores de Transcrição ARNTL/metabolismo , Animais , Proteínas CLOCK/genética , Proteínas CLOCK/metabolismo , Imiquimode/farmacologia , Indutores de Interferon/farmacologia , Fator Regulador 7 de Interferon/genética , Fator Regulador 7 de Interferon/metabolismo , Interferons/metabolismo , Masculino , Glicoproteínas de Membrana/agonistas , Glicoproteínas de Membrana/metabolismo , Camundongos , Camundongos Endogâmicos C57BL , Pele/efeitos dos fármacos , Receptor 7 Toll-Like/agonistas , Receptor 7 Toll-Like/metabolismoRESUMO
Germline mutations modulate the risk of developing schizophrenia (SCZ). Much less is known about the role of mosaic somatic mutations in the context of SCZ. Deep (239×) whole-genome sequencing (WGS) of brain neurons from 61 SCZ cases and 25 controls postmortem identified mutations occurring during prenatal neurogenesis. SCZ cases showed increased somatic variants in open chromatin, with increased mosaic CpG transversions (CpG>GpG) and T>G mutations at transcription factor binding sites (TFBSs) overlapping open chromatin, a result not seen in controls. Some of these variants alter gene expression, including SCZ risk genes and genes involved in neurodevelopment. Although these mutational processes can reflect a difference in factors indirectly involved in disease, increased somatic mutations at developmental TFBSs could also potentially contribute to SCZ.
Assuntos
Encéfalo , Mutação em Linhagem Germinativa , Mosaicismo , Esquizofrenia , Fatores de Transcrição , Feminino , Humanos , Masculino , Sítios de Ligação , Encéfalo/embriologia , Encéfalo/metabolismo , Estudos de Casos e Controles , Cromatina/metabolismo , Ilhas de CpG , Neurogênese/genética , Neurônios/metabolismo , Esquizofrenia/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Sequenciamento Completo do GenomaRESUMO
OBJECTIVE: Treatment-resistant depression (TRD) occurs in roughly one-third of all individuals with major depressive disorder (MDD). Although research has suggested a significant common variant genetic component of liability to TRD, with heritability estimated at 8% when compared with non-treatment-resistant MDD, no replicated genetic loci have been identified, and the genetic architecture of TRD remains unclear. A key barrier to this work has been the paucity of adequately powered cohorts for investigation, largely because of the challenge in prospectively investigating this phenotype. The objective of this study was to perform a well-powered genetic study of TRD. METHODS: Using receipt of electroconvulsive therapy (ECT) as a surrogate for TRD, the authors applied standard machine learning methods to electronic health record data to derive predicted probabilities of receiving ECT. These probabilities were then applied as a quantitative trait in a genome-wide association study of 154,433 genotyped patients across four large biobanks. RESULTS: Heritability estimates ranged from 2% to 4.2%, and significant genetic overlap was observed with cognition, attention deficit hyperactivity disorder, schizophrenia, alcohol and smoking traits, and body mass index. Two genome-wide significant loci were identified, both previously implicated in metabolic traits, suggesting shared biology and potential pharmacological implications. CONCLUSIONS: This work provides support for the utility of estimation of disease probability for genomic investigation and provides insights into the genetic architecture and biology of TRD.
Assuntos
Transtorno Depressivo Maior , Transtorno Depressivo Resistente a Tratamento , Eletroconvulsoterapia , Estudo de Associação Genômica Ampla , Humanos , Transtorno Depressivo Resistente a Tratamento/genética , Transtorno Depressivo Resistente a Tratamento/terapia , Feminino , Masculino , Transtorno Depressivo Maior/genética , Transtorno Depressivo Maior/terapia , Pessoa de Meia-Idade , Aprendizado de Máquina , Adulto , Fenótipo , Idoso , Índice de Massa Corporal , Esquizofrenia/genética , Esquizofrenia/terapiaRESUMO
Large-scale genome-wide association studies of schizophrenia have uncovered hundreds of associated loci but with extremely limited representation of African diaspora populations. We surveyed electronic health records of 200,000 individuals of African ancestry in the Million Veteran and All of Us Research Programs, and, coupled with genotype-level data from four case-control studies, realized a combined sample size of 13,012 affected and 54,266 unaffected persons. Three genome-wide significant signals - near PLXNA4, PMAIP1, and TRPA1 - are the first to be independently identified in populations of predominantly African ancestry. Joint analyses of African, European, and East Asian ancestries across 86,981 cases and 303,771 controls, yielded 376 distinct autosomal loci, which were refined to 708 putatively causal variants via multi-ancestry fine-mapping. Utilizing single-cell functional genomic data from human brain tissue and two complementary approaches, transcriptome-wide association studies and enhancer-promoter contact mapping, we identified a consensus set of 94 genes across ancestries and pinpointed the specific cell types in which they act. We identified reproducible associations of schizophrenia polygenic risk scores with schizophrenia diagnoses and a range of other mental and physical health problems. Our study addresses a longstanding gap in the generalizability of research findings for schizophrenia across ancestral populations, underlining shared biological underpinnings of schizophrenia across global populations in the presence of broadly divergent risk allele frequencies.
RESUMO
With the advent of healthcare-based genotyped biobanks, genome-wide association studies (GWAS) leverage larger sample sizes, incorporate patients with diverse ancestries and introduce noisier phenotypic definitions. Yet the extent and impact of phenotypic misclassification on large-scale datasets is not currently well understood due to a lack of statistical methods to estimate relevant parameters from empirical data. Here, we develop a statistical method and scalable software, PheMED, Phenotypic Measurement of Effective Dilution, to quantify phenotypic misclassification across GWAS using only summary statistics. We illustrate how the parameters estimated by PheMED relate to the negative and positive predictive value of the labeled phenotype, compared to ground truth, and how misclassification of the phenotype yields diluted effect-sizes of variant-phenotype associations. Furthermore, we apply our methodology to detect multiple instances of statistically significant dilution in real-world data. We demonstrate how effective dilution biases downstream GWAS replication and heritability analyses despite utilizing current best practices, and provide a dilution-aware meta-analysis approach that outperforms existing methods. Consequently, we anticipate that PheMED will be a valuable tool for researchers to address phenotypic data quality issues both within and across cohorts.
RESUMO
Amyotrophic lateral sclerosis (ALS) is a progressively fatal neurodegenerative disease affecting motor neurons in the brain and spinal cord. In this study, we investigated gene expression changes in ALS via RNA sequencing in 380 postmortem samples from cervical, thoracic and lumbar spinal cord segments from 154 individuals with ALS and 49 control individuals. We observed an increase in microglia and astrocyte gene expression, accompanied by a decrease in oligodendrocyte gene expression. By creating a gene co-expression network in the ALS samples, we identified several activated microglia modules that negatively correlate with retrospective disease duration. We mapped molecular quantitative trait loci and found several potential ALS risk loci that may act through gene expression or splicing in the spinal cord and assign putative cell types for FNBP1, ACSL5, SH3RF1 and NFASC. Finally, we outline how common genetic variants associated with splicing of C9orf72 act as proxies for the well-known repeat expansion, and we use the same mechanism to suggest ATXN3 as a putative risk gene.
Assuntos
Esclerose Lateral Amiotrófica , Doenças Neurodegenerativas , Humanos , Esclerose Lateral Amiotrófica/genética , Esclerose Lateral Amiotrófica/metabolismo , Doenças Neurodegenerativas/metabolismo , Estudos Retrospectivos , Transcriptoma , Medula Espinal/metabolismoRESUMO
Advances in single-cell and -nucleus transcriptomics have enabled generation of increasingly large-scale datasets from hundreds of subjects and millions of cells. These studies promise to give unprecedented insight into the cell type specific biology of human disease. Yet performing differential expression analyses across subjects remains difficult due to challenges in statistical modeling of these complex studies and scaling analyses to large datasets. Our open-source R package dreamlet (DiseaseNeurogenomics.github.io/dreamlet) uses a pseudobulk approach based on precision-weighted linear mixed models to identify genes differentially expressed with traits across subjects for each cell cluster. Designed for data from large cohorts, dreamlet is substantially faster and uses less memory than existing workflows, while supporting complex statistical models and controlling the false positive rate. We demonstrate computational and statistical performance on published datasets, and a novel dataset of 1.4M single nuclei from postmortem brains of 150 Alzheimer's disease cases and 149 controls.
RESUMO
Binge eating disorder (BED) is the most common eating disorder, yet its genetic architecture remains largely unknown. Studying BED is challenging because it is often comorbid with obesity, a common and highly polygenic trait, and it is underdiagnosed in biobank data sets. To address this limitation, we apply a supervised machine-learning approach (using 822 cases of individuals diagnosed with BED) to estimate the probability of each individual having BED based on electronic medical records from the Million Veteran Program. We perform a genome-wide association study of individuals of African (n = 77,574) and European (n = 285,138) ancestry while controlling for body mass index to identify three independent loci near the HFE, MCHR2 and LRP11 genes and suggest APOE as a risk gene for BED. We identify shared heritability between BED and several neuropsychiatric traits, and implicate iron metabolism in the pathophysiology of BED. Overall, our findings provide insights into the genetics underlying BED and suggest directions for future translational research.
Assuntos
Transtorno da Compulsão Alimentar , Humanos , Transtorno da Compulsão Alimentar/genética , Transtorno da Compulsão Alimentar/psicologia , Estudo de Associação Genômica Ampla , Obesidade/genética , Fenótipo , FerroRESUMO
Advances in single-cell and -nucleus transcriptomics have enabled generation of increasingly large-scale datasets from hundreds of subjects and millions of cells. These studies promise to give unprecedented insight into the cell type specific biology of human disease. Yet performing differential expression analyses across subjects remains difficult due to challenges in statistical modeling of these complex studies and scaling analyses to large datasets. Our open-source R package dreamlet (DiseaseNeurogenomics.github.io/dreamlet) uses a pseudobulk approach based on precision-weighted linear mixed models to identify genes differentially expressed with traits across subjects for each cell cluster. Designed for data from large cohorts, dreamlet is substantially faster and uses less memory than existing workflows, while supporting complex statistical models and controlling the false positive rate. We demonstrate computational and statistical performance on published datasets, and a novel dataset of 1.4M single nuclei from postmortem brains of 150 Alzheimer's disease cases and 149 controls.
RESUMO
Non-coding variants increase risk of neuropsychiatric disease. However, our understanding of the cell-type specific role of the non-coding genome in disease is incomplete. We performed population scale (N=1,393) chromatin accessibility profiling of neurons and non-neurons from two neocortical brain regions: the anterior cingulate cortex and dorsolateral prefrontal cortex. Across both regions, we observed notable differences in neuronal chromatin accessibility between schizophrenia cases and controls. A per-sample disease pseudotime was positively associated with genetic liability for schizophrenia. Organizing chromatin into cis- and trans-regulatory domains, identified a prominent neuronal trans-regulatory domain (TRD1) active in immature glutamatergic neurons during fetal development. Polygenic risk score analysis using genetic variants within chromatin accessibility of TRD1 successfully predicted susceptibility to schizophrenia in the Million Veteran Program cohort. Overall, we present the most extensive resource to date of chromatin accessibility in the human cortex, yielding insights into the cell-type specific etiology of schizophrenia.
RESUMO
Non-coding variants increase risk of neuropsychiatric disease. However, our understanding of the cell-type specific role of the non-coding genome in disease is incomplete. We performed population scale (N=1,393) chromatin accessibility profiling of neurons and non-neurons from two neocortical brain regions: the anterior cingulate cortex and dorsolateral prefrontal cortex. Across both regions, we observed notable differences in neuronal chromatin accessibility between schizophrenia cases and controls. A per-sample disease pseudotime was positively associated with genetic liability for schizophrenia. Organizing chromatin into cis- and trans-regulatory domains, identified a prominent neuronal trans-regulatory domain (TRD1) active in immature glutamatergic neurons during fetal development. Polygenic risk score analysis using genetic variants within chromatin accessibility of TRD1 successfully predicted susceptibility to schizophrenia in the Million Veteran Program cohort. Overall, we present the most extensive resource to date of chromatin accessibility in the human cortex, yielding insights into the cell-type specific etiology of schizophrenia.
RESUMO
Genome-wide association studies (GWAS) have underrepresented individuals from non-European populations, impeding progress in characterizing the genetic architecture and consequences of health and disease traits. To address this, we present a population-stratified phenome-wide GWAS followed by a multi-population meta-analysis for 2,068 traits derived from electronic health records of 635,969 participants in the Million Veteran Program (MVP), a longitudinal cohort study of diverse U.S. Veterans genetically similar to the respective African (121,177), Admixed American (59,048), East Asian (6,702), and European (449,042) superpopulations defined by the 1000 Genomes Project. We identified 38,270 independent variants associating with one or more traits at experiment-wide P<4.6×10-11 significance; fine-mapping 6,318 signals identified from 613 traits to single-variant resolution. Among these, a third (2,069) of the associations were found only among participants genetically similar to non-European reference populations, demonstrating the importance of expanding diversity in genetic studies. Our work provides a comprehensive atlas of phenome-wide genetic associations for future studies dissecting the architecture of complex traits in diverse populations.
RESUMO
Recent efforts have identified genetic loci that are associated with coronavirus disease 2019 (COVID-19) infection rates and disease outcome severity. Translating these genetic findings into druggable genes that reduce COVID-19 host susceptibility is a critical next step. Using a translational genomics approach that integrates COVID-19 genetic susceptibility variants, multi-tissue genetically regulated gene expression (GReX), and perturbagen signatures, we identified IL10RB as the top candidate gene target for COVID-19 host susceptibility. In a series of validation steps, we show that predicted GReX upregulation of IL10RB and higher IL10RB expression in COVID-19 patient blood is associated with worse COVID-19 outcomes and that in vitro IL10RB overexpression is associated with increased viral load and activation of disease-relevant molecular pathways.
RESUMO
Importance: Although depression is a common psychiatric disorder, its underlying biological basis remains poorly understood. Pairing depression polygenic scores with the results of clinical laboratory tests can reveal biological processes involved in depression etiology and in the physiological changes resulting from depression. Objective: To characterize the association between depression polygenic scores and an inflammatory biomarker, ie, white blood cell count. Design, Setting, and Participants: This genetic association study was conducted from May 19, 2019, to June 5, 2021, using electronic health record data from 382â¯452 patients across 4 health care systems. Analyses were conducted separately in each health care system and meta-analyzed across all systems. Primary analyses were conducted in Vanderbilt University Medical Center's biobank. Replication analyses were conducted across 3 other PsycheMERGE sites: Icahn School of Medicine at Mount Sinai, Mass General Brigham, and the Million Veteran Program. All patients with available genetic data and recorded white blood cell count measurements were included in the analyses. Primary analyses were conducted in individuals of European descent and then repeated in a population of individuals of African descent. Exposures: Depression polygenic scores. Main Outcomes and Measures: White blood cell count. Results: Across the 4 PsycheMERGE sites, there were 382â¯452 total participants of European ancestry (18.7% female; median age, 57.9 years) and 12â¯383 participants of African ancestry (61.1% female; median age, 39.0 [range, birth-90.0 years]). A laboratory-wide association scan revealed a robust association between depression polygenic scores and white blood cell count (ß, 0.03; SE, 0.004; P = 1.07 × 10-17), which was replicated in a meta-analysis across the 4 health care systems (ß, 0.03; SE, 0.002; P = 1.03 × 10-136). Mediation analyses suggested a bidirectional association, with white blood cell count accounting for 2.5% of the association of depression polygenic score with depression diagnosis (95% CI, 2.2%-20.8%; P = 2.84 × 10-70) and depression diagnosis accounting for 9.8% of the association of depression polygenic score with white blood cell count (95% CI, 8.4%-11.1%; P = 1.78 × 10-44). Mendelian randomization provided additional support for an association between increased white blood count and depression risk, but depression modeled as the exposure showed no evidence of an influence on white blood cell counts. Conclusions and Relevance: This genetic association study found that increased depression polygenic scores were associated with increased white blood cell count, and suggests that this association may be bidirectional. These findings highlight the potential importance of the immune system in the etiology of depression and may motivate future development of clinical biomarkers and targeted treatment options for depression.
Assuntos
Transtorno Depressivo/sangue , Transtorno Depressivo/genética , Transtorno Depressivo/imunologia , Estudos de Associação Genética , Herança Multifatorial/genética , Neutrófilos , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Bancos de Espécimes Biológicos , Biomarcadores , Criança , Pré-Escolar , Registros Eletrônicos de Saúde , Feminino , Predisposição Genética para Doença , Humanos , Lactente , Recém-Nascido , Contagem de Leucócitos , Masculino , Análise da Randomização Mendeliana , Pessoa de Meia-Idade , Adulto JovemRESUMO
The migrating keratinocyte wound front is required for skin wound closure. Despite significant advances in wound healing research, we do not fully understand the molecular mechanisms that orchestrate collective keratinocyte migration. Here, we show that, in the wound front, the epidermal transcription factor Grainyhead like-3 (GRHL3) mediates decreased expression of the adherens junction protein E-cadherin; this results in relaxed adhesions between suprabasal keratinocytes, thus promoting collective cell migration and wound closure. Wound fronts from mice lacking GRHL3 in epithelial cells (Grhl3-cKO) have lower expression of Fascin-1 (FSCN1), a known negative regulator of E-cadherin. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) on wounded keratinocytes shows decreased wound-induced chromatin accessibility near the Fscn1 gene in Grhl3-cKO mice, a region enriched for GRHL3 motifs. These data reveal a wound-induced GRHL3/FSCN1/E-cadherin pathway that regulates keratinocyte-keratinocyte adhesion during wound-front migration; this pathway is activated in acute human wounds and is altered in diabetic wounds in mice, suggesting translational relevance.
Assuntos
Proteínas de Transporte/genética , Adesão Celular/genética , Proteínas de Ligação a DNA/genética , Epiderme/lesões , Regulação da Expressão Gênica , Proteínas dos Microfilamentos/genética , RNA/genética , Fatores de Transcrição/genética , Cicatrização , Animais , Proteínas de Transporte/biossíntese , Linhagem Celular , Movimento Celular/genética , Proteínas de Ligação a DNA/biossíntese , Modelos Animais de Doenças , Epiderme/metabolismo , Epiderme/patologia , Queratinócitos/metabolismo , Camundongos , Proteínas dos Microfilamentos/biossíntese , Fatores de Transcrição/biossínteseRESUMO
BACKGROUND: Recent efforts have identified genetic loci that are associated with coronavirus disease 2019 (COVID-19) infection rates and disease outcome severity. Translating these genetic findings into druggable genes and readily available compounds that reduce COVID-19 host susceptibility is a critical next step. METHODS: We integrate COVID-19 genetic susceptibility variants, multi-tissue genetically regulated gene expression (GReX) and perturbargen signatures to identify candidate genes and compounds that reverse the predicted gene expression dysregulation associated with COVID-19 susceptibility. The top candidate gene is validated by testing both its GReX and observed blood transcriptome association with COVID-19 severity, as well as by in vitro perturbation to quantify effects on viral load and molecular pathway dysregulation. We validate the in silico drug repositioning analysis by examining whether the top candidate compounds decrease COVID-19 incidence based on epidemiological evidence. RESULTS: We identify IL10RB as the top key regulator of COVID-19 host susceptibility. Predicted GReX up-regulation of IL10RB and higher IL10RB expression in COVID-19 patient blood is associated with worse COVID-19 outcomes. In vitro IL10RB overexpression is associated with increased viral load and activation of immune-related molecular pathways. Azathioprine and retinol are prioritized as candidate compounds to reduce the likelihood of testing positive for COVID-19. CONCLUSIONS: We establish an integrative data-driven approach for gene target prioritization. We identify and validate IL10RB as a suitable molecular target for modulation of COVID-19 host susceptibility. Finally, we provide evidence for a few readily available medications that would warrant further investigation as drug repositioning candidates.
RESUMO
Neocortical circuits consist of stereotypical motifs that must self-assemble during development. Recent evidence suggests that the subtype identity of both excitatory projection neurons (PNs) and inhibitory interneurons (INs) is important for this process. We knocked out the transcription factor Satb2 in PNs to induce those of the intratelencephalic (IT) type to adopt a pyramidal tract (PT)-type identity. Loss of IT-type PNs selectively disrupted the lamination and circuit integration of INs derived from the caudal ganglionic eminence (CGE). Strikingly, reprogrammed PNs demonstrated reduced synaptic targeting of CGE-derived INs relative to controls. In control mice, IT-type PNs targeted neighboring CGE INs, while PT-type PNs did not in deep layers, confirming this lineage-dependent motif. Finally, single-cell RNA sequencing revealed that major CGE IN subtypes were conserved after loss of IT PNs, but with differential transcription of synaptic proteins and signaling molecules. Thus, IT-type PNs influence CGE-derived INs in a non-cell-autonomous manner during cortical development.