RESUMO
There is growing evidence for the role of DNA methylation (DNAm) quantitative trait loci (mQTLs) in the genetics of complex traits, including psychiatric disorders. However, due to extensive linkage disequilibrium (LD) of the genome, it is challenging to identify causal genetic variations that drive DNAm levels by population-based genetic association studies. This limits the utility of mQTLs for fine-mapping risk loci underlying psychiatric disorders identified by genome-wide association studies (GWAS). Here we present INTERACT, a deep learning model that integrates convolutional neural networks with transformer, to predict effects of genetic variations on DNAm levels at CpG sites in the human brain. We show that INTERACT-derived DNAm regulatory variants are not confounded by LD, are concentrated in regulatory genomic regions in the human brain, and are convergent with mQTL evidence from genetic association analysis. We further demonstrate that predicted DNAm regulatory variants are enriched for heritability of brain-related traits and improve polygenic risk prediction for schizophrenia across diverse ancestry samples. Finally, we applied predicted DNAm regulatory variants for fine-mapping schizophrenia GWAS risk loci to identify potential novel risk genes. Our study shows the power of a deep learning approach to identify functional regulatory variants that may elucidate the genetic basis of complex traits.
Assuntos
Química Encefálica , Metilação de DNA , Aprendizado Profundo , Esquizofrenia , Encéfalo , Ilhas de CpG , Estudo de Associação Genômica Ampla , Humanos , Redes Neurais de Computação , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Esquizofrenia/genéticaRESUMO
Neurons derived from human induced pluripotent stem cells (hiPSCs) have been used to model basic cellular aspects of neuropsychiatric disorders, but the relationship between the emergent phenotypes and the clinical characteristics of donor individuals has been unclear. We analyzed RNA expression and indices of cellular function in hiPSC-derived neural progenitors and cortical neurons generated from 13 individuals with high polygenic risk scores (PRSs) for schizophrenia (SCZ) and a clinical diagnosis of SCZ, along with 15 neurotypical individuals with low PRS. We identified electrophysiological measures in the patient-derived neurons that implicated altered Na+ channel function, action potential interspike interval, and gamma-aminobutyric acid-ergic neurotransmission. Importantly, electrophysiological measures predicted cardinal clinical and cognitive features found in these SCZ patients. The identification of basic neuronal physiological properties related to core clinical characteristics of illness is a potentially critical step in generating leads for novel therapeutics.
Assuntos
Cognição/fisiologia , Fenômenos Eletrofisiológicos , Células-Tronco Pluripotentes Induzidas/fisiologia , Neurônios/fisiologia , Esquizofrenia/fisiopatologia , Animais , Linhagem Celular , Reprogramação Celular , Córtex Cerebral/patologia , Humanos , Ativação do Canal Iônico , Cinética , Masculino , Fenótipo , Ratos , Esquizofrenia/diagnóstico , Canais de Sódio/metabolismoRESUMO
MicroRNAs (miRNAs) are small non-coding RNAs (sncRNAs) that function in post-transcriptional gene regulation through imperfect base pairing with mRNA targets which results in inhibition of translation and typically destabilization of bound transcripts. Sequence-based algorithms historically used to predict miRNA targets face inherent challenges in reliably reflecting in vivo interactions. Recent strategies have directly profiled miRNA-target interactions by crosslinking and ligation of sncRNAs to their targets within the RNA-induced silencing complex (RISC), followed by high throughput sequencing of the chimeric sncRNA:target RNAs. Despite the strength of these direct profiling approaches, standardized pipelines for effectively analyzing the resulting chimeric sncRNA:target RNA sequencing data are not readily available. Here we present SCRAP, a robust Small Chimeric RNA Analysis Pipeline for the bioinformatic processing of chimeric sncRNA:target RNA sequencing data. SCRAP consists of two parts, each of which are specifically optimized for the distinctive characteristics of chimeric small RNA sequencing reads: first, read processing and alignment and second, peak calling and annotation. We apply SCRAP to benchmark chimeric sncRNA:target RNA sequencing datasets generated by distinct molecular approaches, and compare SCRAP to existing chimeric RNA analysis pipelines. SCRAP has minimal hardware requirements, is cross-platform, and contains extensive annotation to broaden accessibility for processing small chimeric RNA sequencing data and enable insights about the targets of small non-coding RNAs in regulating diverse biological systems.
RESUMO
BACKGROUND: Bisulfite sequencing is a powerful tool for profiling genomic methylation, an epigenetic modification critical in the understanding of cancer, psychiatric disorders, and many other conditions. Raw data generated by whole genome bisulfite sequencing (WGBS) requires several computational steps before it is ready for statistical analysis, and particular care is required to process data in a timely and memory-efficient manner. Alignment to a reference genome is one of the most computationally demanding steps in a WGBS workflow, taking several hours or even days with commonly used WGBS-specific alignment software. This naturally motivates the creation of computational workflows that can utilize GPU-based alignment software to greatly speed up the bottleneck step. In addition, WGBS produces raw data that is large and often unwieldy; a lack of memory-efficient representation of data by existing pipelines renders WGBS impractical or impossible to many researchers. RESULTS: We present BiocMAP, a Bioconductor-friendly methylation analysis pipeline consisting of two modules, to address the above concerns. The first module performs computationally-intensive read alignment using Arioc, a GPU-accelerated short-read aligner. Since GPUs are not always available on the same computing environments where traditional CPU-based analyses are convenient, the second module may be run in a GPU-free environment. This module extracts and merges DNA methylation proportions-the fractions of methylated cytosines across all cells in a sample at a given genomic site. Bioconductor-based output objects in R utilize an on-disk data representation to drastically reduce required main memory and make WGBS projects computationally feasible to more researchers. CONCLUSIONS: BiocMAP is implemented using Nextflow and available at http://research.libd.org/BiocMAP/ . To enable reproducible analysis across a variety of typical computing environments, BiocMAP can be containerized with Docker or Singularity, and executed locally or with the SLURM or SGE scheduling engines. By providing Bioconductor objects, BiocMAP's output can be integrated with powerful analytical open source software for analyzing methylation data.
Assuntos
Genômica , Sulfitos , Humanos , Análise de Sequência de DNA , Sequenciamento Completo do GenomaRESUMO
Transcriptome compartmentalization by the nuclear membrane provides both stochastic and functional buffering of transcript activity in the cytoplasm, and has recently been implicated in neurodegenerative disease processes. Although many mechanisms regulating transcript compartmentalization are also prevalent in brain development, the extent to which subcellular localization differs as the brain matures has yet to be addressed. To characterize the nuclear and cytoplasmic transcriptomes during brain development, we sequenced both RNA fractions from homogenate prenatal and adult human postmortem cortex using poly(A)+ and Ribo-Zero library preparation methods. We find that while many genes are differentially expressed by fraction and developmental expression changes are similarly detectable in nuclear and cytoplasmic RNA, the compartmented transcriptomes become more distinct as the brain matures, perhaps reflecting increased utilization of nuclear retention as a regulatory strategy in adult brain. We examined potential mechanisms of this developmental divergence including alternative splicing, RNA editing, nuclear pore composition, RNA-binding protein motif enrichment, and RNA secondary structure. Intron retention is associated with greater nuclear abundance in a subset of transcripts, as is enrichment for several splicing factor binding motifs. Finally, we examined disease association with fraction-regulated gene sets and found nuclear-enriched genes were also preferentially enriched in gene sets associated with neurodevelopmental psychiatric disorders. These results suggest that although gene-level expression is globally comparable between fractions, nuclear retention of transcripts may play an underappreciated role in developmental regulation of gene expression in brain, particularly in genes whose dysregulation is related to neuropsychiatric disorders.
Assuntos
Núcleo Celular/metabolismo , Córtex Cerebral/metabolismo , Citoplasma/metabolismo , Predisposição Genética para Doença , Transtornos Mentais/genética , Transtornos Mentais/psicologia , Transcriptoma , Fatores Etários , Processamento Alternativo , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Estudos de Associação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Anotação de Sequência Molecular , Edição de RNARESUMO
Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.
Assuntos
Transcriptoma , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Perfilação da Expressão Gênica , Genoma Humano , Humanos , Neoplasias/genética , Especificidade de Órgãos , Prognóstico , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , RNA Mensageiro/metabolismoRESUMO
BACKGROUND: Multispectral fluorescence imaging coupled with linear unmixing is a form of image data collection and analysis that allows for measuring multiple molecular signals in a single biological sample. Multiple fluorescent dyes, each measuring a unique molecule, are simultaneously measured and subsequently "unmixed" to provide a read-out for each molecular signal. This strategy allows for measuring highly multiplexed signals in a single data capture session, such as multiple proteins or RNAs in tissue slices or cultured cells, but can often result in mixed signals and bleed-through problems across dyes. Existing spectral unmixing algorithms are not optimized for challenging biological specimens such as post-mortem human brain tissue, and often require manual intervention to extract spectral signatures. We therefore developed an intuitive, automated, and flexible package called SUFI: spectral unmixing of fluorescent images. RESULTS: This package unmixes multispectral fluorescence images by automating the extraction of spectral signatures using vertex component analysis, and then performs one of three unmixing algorithms derived from remote sensing. We evaluate these remote sensing algorithms' performances on four unique biological datasets and compare the results to unmixing results obtained using ZEN Black software (Zeiss). We lastly integrate our unmixing pipeline into the computational tool dotdotdot, which is used to quantify individual RNA transcripts at single cell resolution in intact tissues and perform differential expression analysis, and thereby provide an end-to-end solution for multispectral fluorescence image analysis and quantification. CONCLUSIONS: In summary, we provide a robust, automated pipeline to assist biologists with improved spectral unmixing of multispectral fluorescence images.
Assuntos
Algoritmos , Software , Humanos , Animais , Camundongos , Microscopia de Fluorescência/métodos , Corantes Fluorescentes , Encéfalo/diagnóstico por imagemRESUMO
Identification and characterisation of novel targets for treatment is a priority in the field of psychiatry. FKBP5 is a gene with decades of evidence suggesting its pathogenic role in a subset of psychiatric patients, with potential to be leveraged as a therapeutic target for these individuals. While it is widely reported that FKBP5/FKBP51 mRNA/protein (FKBP5/1) expression is impacted by psychiatric disease state, risk genotype and age, it is not known in which cell types and sub-anatomical areas of the human brain this occurs. This knowledge is critical to propel FKBP5/1-targeted treatment development. Here, we performed an extensive, large-scale postmortem study (n = 1024) of FKBP5/1, examining neocortical areas (BA9, BA11 and ventral BA24/BA24a) derived from subjects that lived with schizophrenia, major depression or bipolar disorder. With an extensive battery of RNA (bulk RNA sequencing, single-nucleus RNA sequencing, microarray, qPCR, RNAscope) and protein (immunoblot, immunohistochemistry) analysis approaches, we thoroughly investigated the effects of disease state, ageing and genotype on cortical FKBP5/1 expression including in a cell type-specific manner. We identified consistently heightened FKBP5/1 levels in psychopathology and with age, but not genotype, with these effects strongest in schizophrenia. Using single-nucleus RNA sequencing (snRNAseq; BA9 and BA11) and targeted histology (BA9, BA24a), we established that these disease and ageing effects on FKBP5/1 expression were most pronounced in excitatory superficial layer neurons of the neocortex, and this effect appeared to be consistent in both the granular and agranular areas examined. We then found that this increase in FKBP5 levels may impact on synaptic plasticity, as FKBP5 gex levels strongly and inversely correlated with dendritic mushroom spine density and brain-derived neurotrophic factor (BDNF) levels in superficial layer neurons in BA11. These findings pinpoint a novel cellular and molecular mechanism that has potential to open a new avenue of FKBP51 drug development to treat cognitive symptoms in psychiatric disorders.
Assuntos
Transtornos Mentais , Neocórtex , Humanos , Transtornos Mentais/genética , Envelhecimento/genética , Neurônios , Genótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Antipsychotic drugs are the current first-line of treatment for schizophrenia and other psychotic conditions. However, their molecular effects on the human brain are poorly studied, due to difficulty of tissue access and confounders associated with disease status. Here we examine differences in gene expression and DNA methylation associated with positive antipsychotic drug toxicology status in the human caudate nucleus. We find no genome-wide significant differences in DNA methylation, but abundant differences in gene expression. These gene expression differences are overall quite similar to gene expression differences between schizophrenia cases and controls. Interestingly, gene expression differences based on antipsychotic toxicology are different between brain regions, potentially due to affected cell type differences. We finally assess similarities with effects in a mouse model, which finds some overlapping effects but many differences as well. As a first look at the molecular effects of antipsychotics in the human brain, the lack of epigenetic effects is unexpected, possibly because long term treatment effects may be relatively stable for extended periods.
Assuntos
Antipsicóticos , Transtornos Psicóticos , Esquizofrenia , Animais , Antipsicóticos/farmacologia , Antipsicóticos/uso terapêutico , Núcleo Caudado , Humanos , Camundongos , Fenótipo , Transtornos Psicóticos/tratamento farmacológico , Esquizofrenia/tratamento farmacológico , Esquizofrenia/genéticaRESUMO
BACKGROUND: Spatially-resolved transcriptomics has now enabled the quantification of high-throughput and transcriptome-wide gene expression in intact tissue while also retaining the spatial coordinates. Incorporating the precise spatial mapping of gene activity advances our understanding of intact tissue-specific biological processes. In order to interpret these novel spatial data types, interactive visualization tools are necessary. RESULTS: We describe spatialLIBD, an R/Bioconductor package to interactively explore spatially-resolved transcriptomics data generated with the 10x Genomics Visium platform. The package contains functions to interactively access, visualize, and inspect the observed spatial gene expression data and data-driven clusters identified with supervised or unsupervised analyses, either on the user's computer or through a web application. CONCLUSIONS: spatialLIBD is available at https://bioconductor.org/packages/spatialLIBD . It is fully compatible with SpatialExperiment and the Bioconductor ecosystem. Its functionality facilitates analyzing and interactively exploring spatially-resolved data from the Visium platform.
Assuntos
Ecossistema , Transcriptoma , Genômica , SoftwareRESUMO
BACKGROUND: Calcium imaging is a powerful technique for recording cellular activity across large populations of neurons. However, analysis methods capable of single-cell resolution in cultured neurons, especially for cultures derived from human induced pluripotent stem cells (hiPSCs), are lacking. Existing methods lack scalability to accommodate high-throughput comparisons between multiple lines, across developmental timepoints, or across pharmacological manipulations. RESULTS: To address this need we developed CaPTure, a scalable, automated Ca2+ imaging analysis pipeline ( https://github.com/LieberInstitute/CaPTure ). CaPTuredetects neurons, classifies and quantifies spontaneous activity, quantifies synchrony metrics, and generates cell- and network-specific metrics that facilitate phenotypic discovery. The method is compatible with parallel processing on computing clusters without requiring significant user input or parameter modification. CONCLUSION: CaPTure allows for rapid assessment of neuronal activity in cultured cells at cellular resolution, rendering it amenable to high-throughput screening and phenotypic discovery. The platform can be applied to both human- and rodent-derived neurons and is compatible with many imaging systems.
Assuntos
Cálcio , Células-Tronco Pluripotentes Induzidas , Humanos , Neurônios , Processamento de Imagem Assistida por Computador , Linhagem CelularRESUMO
While a definitive understanding of schizophrenia etiology is far from current reality, an increasing body of evidence implicates perturbations in early development that alter the trajectory of brain maturation in this disorder, leading to abnormal function in early childhood and adulthood. This atypical development likely arises from an interaction of many brain cell types that follow distinct developmental paths. Because both cellular identity and development are governed by the transcriptome and epigenome, two levels of gene regulation that have the potential to reflect both genetic and environmental influences, mapping "omic" changes over development in diverse cells is a fruitful avenue for schizophrenia research. In this review, we provide a survey of human brain cellular composition and development, levels of genomic regulation that determine cellular identity and developmental trajectories, and what is known about how genomic regulation is dysregulated in specific cell types in schizophrenia. We also outline technical challenges and solutions to conducting cell type-specific functional genomic studies in human postmortem brain.
Assuntos
Encéfalo/metabolismo , Encéfalo/patologia , Esquizofrenia/genética , Esquizofrenia/patologia , Regulação da Expressão Gênica , Humanos , TranscriptomaRESUMO
Genome-wide association studies have identified single nucleotide polymorphisms (SNPs) associated with schizophrenia risk. Integration of RNA-sequencing data from postmortem human brains with these risk SNPs identified transcripts associated with increased schizophrenia susceptibility, including a class of exon 9-spliced isoforms of Sorting nexin-19 (SNX19d9) and an isoform of Arsenic methyltransferase (AS3MT) splicing out exons 2 and 3 (AS3MTd2d3). However, the biological function of these transcript variants is unclear. Defining the cell types where these risk transcripts are dominantly expressed is an important step to understand function, in prioritizing specific cell types and/or neural pathways in subsequent studies. To identify the cell type-specific localization of SNX19 and AS3MT in the human dorsolateral prefrontal cortex (DLPFC), we used single-molecule in situ hybridization techniques combined with automated quantification and machine learning approaches to analyze 10 postmortem brains of neurotypical individuals. These analyses revealed that both pan-SNX19 and pan-AS3MT were more highly expressed in neurons than non-neurons in layers II/III and VI of DLPFC. Furthermore, pan-SNX19 was preferentially expressed in glutamatergic neurons, while pan-AS3MT was preferentially expressed in GABAergic neurons. Finally, we utilized duplex BaseScope technology, to delineate the localization of SNX19d9 and AS3MTd2d3 splice variants, revealing consistent trends in spatial gene expression among pan-transcripts and schizophrenia risk-related transcript variants. These findings demonstrate that schizophrenia risk transcripts have distinct localization patterns in the healthy human brains, and suggest that SNX19 transcripts might disrupt the normal function of glutamatergic neurons, while AS3MT may lead to disturbances in the GABAergic system in the pathophysiology of schizophrenia.
Assuntos
Metiltransferases , Esquizofrenia , Nexinas de Classificação/genética , Encéfalo/metabolismo , Córtex Pré-Frontal Dorsolateral , Estudo de Associação Genômica Ampla , Humanos , Hibridização In Situ , Metiltransferases/genética , Polimorfismo de Nucleotídeo Único , Esquizofrenia/genéticaRESUMO
Multiplex single-molecule fluorescent in situ hybridization (smFISH) is a powerful method for validating RNA sequencing and emerging spatial transcriptomic data, but quantification remains a computational challenge. We present a framework for generating and analyzing smFISH data in complex tissues while overcoming autofluorescence and increasing multiplexing capacity. We developed dotdotdot (https://github.com/LieberInstitute/dotdotdot) as a corresponding software package to quantify RNA transcripts in single nuclei and perform differential expression analysis. We first demonstrate robustness of our platform in single mouse neurons by quantifying differential expression of activity-regulated genes. We then quantify spatial gene expression in human dorsolateral prefrontal cortex (DLPFC) using spectral imaging and dotdotdot to mask lipofuscin autofluorescence. We lastly apply machine learning to predict cell types and perform downstream cell type-specific expression analysis. In summary, we provide experimental workflows, imaging acquisition and analytic strategies for quantification and biological interpretation of smFISH data in complex tissues.
Assuntos
Automação , Hibridização in Situ Fluorescente/métodos , Imagem Individual de Molécula , Software , Adolescente , Adulto , Animais , Humanos , Processamento de Imagem Assistida por Computador , Lipofuscina/análise , Aprendizado de Máquina , Masculino , Camundongos , Neurônios/citologia , Neurônios/metabolismo , Especificidade de Órgãos , Córtex Pré-Frontal/citologia , Córtex Pré-Frontal/metabolismo , RNA Mensageiro/análiseRESUMO
BACKGROUND: RNA sequencing (RNA-seq) is a common and widespread biological assay, and an increasing amount of data is generated with it. In practice, there are a large number of individual steps a researcher must perform before raw RNA-seq reads yield directly valuable information, such as differential gene expression data. Existing software tools are typically specialized, only performing one step-such as alignment of reads to a reference genome-of a larger workflow. The demand for a more comprehensive and reproducible workflow has led to the production of a number of publicly available RNA-seq pipelines. However, we have found that most require computational expertise to set up or share among several users, are not actively maintained, or lack features we have found to be important in our own analyses. RESULTS: In response to these concerns, we have developed a Scalable Pipeline for Expression Analysis and Quantification (SPEAQeasy), which is easy to install and share, and provides a bridge towards R/Bioconductor downstream analysis solutions. SPEAQeasy is portable across computational frameworks (SGE, SLURM, local, docker integration) and different configuration files are provided ( http://research.libd.org/SPEAQeasy/ ). CONCLUSIONS: SPEAQeasy is user-friendly and lowers the computational-domain entry barrier for biologists and clinicians to RNA-seq data processing as the main input file is a table with sample names and their corresponding FASTQ files. The goal is to provide a flexible pipeline that is immediately usable by researchers, regardless of their technical background or computing environment.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , RNA-Seq , Análise de Sequência de RNA , Fluxo de TrabalhoRESUMO
Obesity and type 2 diabetes are rapidly increasing in the adolescent population. We sought to determine whether adipokines, specifically leptin, C1q/TNF-related proteins 1 (CTRP1) and CTRP9, and the hepatokine fibroblast growth factor 21 (FGF21), are associated with obesity and hyperglycemia in a cohort of lean and obese adolescents, across the spectrum of glycemia. In an observational, longitudinal study of lean and obese adolescents, we measured fasting laboratory tests, oral glucose tolerance tests, and adipokines including leptin, CTRP1, CTRP9, and FGF21. Participants completed baseline and 2-year follow-up study visits and were categorized as lean (LC, lean control; n = 30), obese normoglycemic (ONG; n = 61), and obese hyperglycemic (OHG; n = 31) adolescents at baseline and lean (n = 8), ONG (n = 18), and OHG (n = 4) at follow-up. Groups were compared using ANOVA and regression analysis, and linear mixed effects modeling was used to test for differences in adipokine levels across baseline and follow-up visits. Results showed that at baseline, leptin was higher in all obese groups (P < 0.001) compared with LC. FGF21 was higher in OHG participants compared with LC (P < 0.001) and ONG (P < 0.001) and positively associated with fasting glucose (P < 0.001), fasting insulin (P < 0.001), Homeostasis Model Assessment-Insulin Resistance Index (HOMA-IR; P < 0.001), and hemoglobin A1c (HbA1c; P = 0.01). CTRP1 was higher in OHG compared with ONG (P = 0.03). CTRP9 was not associated with obesity or hyperglycemia in this pediatric cohort. At 2 years, leptin decreased in ONG (P = 0.003) and FGF21 increased in OHG (P = 0.02), relative to lean controls. Altered adipokine levels are associated with the inflammatory milieu in obese youth with and without hyperglycemia. In adolescence, the novel adipokine CTRP1 was elevated with hyperglycemia, whereas CTRP9 was unchanged in this cohort.NEW & NOTEWORTHY Leptin is higher in obese adolescents and FGF21 is higher in obese hyperglycemic adolescents. The novel adipokine CTRP1 is higher in obese hyperglycemic adolescents, whereas CTRP9 was unchanged in this adolescent cohort.
Assuntos
Adipocinas/sangue , Glicemia/metabolismo , Obesidade Infantil/sangue , Adipocinas/análise , Adolescente , Glicemia/fisiologia , Criança , Estudos Transversais , Diabetes Mellitus Tipo 2/sangue , Diabetes Mellitus Tipo 2/complicações , Feminino , Seguimentos , Intolerância à Glucose/sangue , Intolerância à Glucose/complicações , Teste de Tolerância a Glucose , Hemoglobinas Glicadas/análise , Hemoglobinas Glicadas/metabolismo , Humanos , Resistência à Insulina/fisiologia , Estudos Longitudinais , Masculino , Obesidade Infantil/complicações , Estado Pré-Diabético/sangue , Estado Pré-Diabético/complicaçõesRESUMO
Schizophrenia polygenic risk is plausibly manifested by complex transcriptional dysregulation in the brain, involving networks of co-expressed and functionally related genes. The main purpose of this study was to identify and prioritize co-expressed gene sets in a hierarchical manner, based on the strength of the relationships with clinical diagnosis and with polygenic risk for schizophrenia. Weighted Gene Co-expression Network Analysis (WGCNA) was applied to RNA-quality-adjusted DLPFC RNA-Seq data from the LIBD Postmortem Human Brain Repository (90 controls, 74 schizophrenia cases; all Caucasians) to construct co-expression networks and detect "modules" of co-expressed genes. After multiple internal and external validation procedures, modules of selected interest were tested for enrichment in biological ontologies, for association with schizophrenia polygenic risk scores (PRSs) and with diagnosis, and also for enrichment in genes within the significant GWAS loci reported by the Psychiatric Genomic Consortium (PGC2). The association between schizophrenia genetic signals and modules of co-expression converged on one module showing not only a significant association with both diagnosis and PRS but also significant overlap with 36 PGC2 loci genes, deemed the strongest candidates for drug targets. This module contained many genes involved in synaptic signaling and neuroplasticity. Fifty-three PGC2 genes were in modules associated only with diagnosis and 59 in modules unrelated to diagnosis or PRS. Our study highlights complex relationships between gene co-expression networks in the brain and clinical state and polygenic risk for SCZ and provides a strategy for using this information in selecting and prioritizing potentially targetable gene sets for therapeutic drug development.
Assuntos
Redes Reguladoras de Genes/genética , Esquizofrenia/genética , Transcriptoma/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Autopsia , Encéfalo/metabolismo , Feminino , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Masculino , Pessoa de Meia-Idade , Herança Multifatorial/genética , Córtex Pré-Frontal/metabolismo , População Branca/genéticaRESUMO
Cigarette smoking during pregnancy is a major public health concern. While there are well-described consequences in early child development, there is very little known about the effects of maternal smoking on human cortical biology during prenatal life. We therefore performed a genome-wide differential gene expression analysis using RNA sequencing (RNA-seq) on prenatal (N = 33; 16 smoking-exposed) as well as adult (N = 207; 57 active smokers) human postmortem prefrontal cortices. Smoking exposure during the prenatal period was directly associated with differential expression of 14 genes; in contrast, during adulthood, despite a much larger sample size, only two genes showed significant differential expression (FDR < 10%). Moreover, 1,315 genes showed significantly different exposure effects between maternal smoking during pregnancy and direct exposure in adulthood (FDR < 10%)-these differences were largely driven by prenatal differences that were enriched for pathways previously implicated in addiction and synaptic function. Furthermore, prenatal and age-dependent differentially expressed genes were enriched for genes implicated in non-syndromic autism spectrum disorder (ASD) and were differentially expressed as a set between patients with ASD and controls in postmortem cortical regions. These results underscore the enhanced sensitivity to the biological effect of smoking exposure in the developing brain and offer insight into how maternal smoking during pregnancy affects gene expression in the prenatal human cortex. They also begin to address the relationship between in utero exposure to smoking and the heightened risks for the subsequent development of neuropsychiatric disorders.
Assuntos
Transtorno do Espectro Autista , Efeitos Tardios da Exposição Pré-Natal , Adulto , Encéfalo , Feminino , Humanos , Exposição Materna , Gravidez , Efeitos Tardios da Exposição Pré-Natal/genética , Análise de Sequência de RNA , Fumar/efeitos adversos , Fumar/genética , Transcriptoma/genéticaRESUMO
Genome-wide association studies (GWAS) have identified many genomic loci associated with risk for schizophrenia, but unambiguous identification of the relationship between disease-associated variants and specific genes, and in particular their effect on risk conferring transcripts, has proven difficult. To better understand the specific molecular mechanism(s) at the schizophrenia locus in 11q25, we undertook cis expression quantitative trait loci (cis-eQTL) mapping for this 2 megabase genomic region using postmortem human brain samples. To comprehensively assess the effects of genetic risk upon local expression, we evaluated multiple transcript features: genes, exons, and exon-exon junctions in multiple brain regions-dorsolateral prefrontal cortex (DLPFC), hippocampus, and caudate. Genetic risk variants strongly associated with expression of SNX19 transcript features that tag multiple rare classes of SNX19 transcripts, whereas they only weakly affected expression of an exon-exon junction that tags the majority of abundant transcripts. The most prominent class of SNX19 risk-associated transcripts is predicted to be overexpressed, defined by an exon-exon splice junction between exons 8 and 10 (junc8.10) and that is predicted to encode proteins that lack the characteristic nexin C terminal domain. Risk alleles were also associated with either increased or decreased expression of multiple additional classes of transcripts. With RACE, molecular cloning, and long read sequencing, we found a number of novel SNX19 transcripts that further define the set of potential etiological transcripts. We explored epigenetic regulation of SNX19 expression and found that DNA methylation at CpG sites near the primary transcription start site and within exon 2 partially mediate the effects of risk variants on risk-associated expression. ATAC sequencing revealed that some of the most strongly risk-associated SNPs are located within a region of open chromatin, suggesting a nearby regulatory element is involved. These findings indicate a potentially complex molecular etiology, in which risk alleles for schizophrenia generate epigenetic alterations and dysregulation of multiple classes of SNX19 transcripts.
Assuntos
Esquizofrenia/genética , Nexinas de Classificação/genética , Adulto , Alelos , Autopsia , Encéfalo/metabolismo , Cromatina/metabolismo , Mapeamento Cromossômico/métodos , Metilação de DNA , Éxons/genética , Feminino , Expressão Gênica/genética , Frequência do Gene/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Fatores de Risco , Nexinas de Classificação/metabolismoRESUMO
RNA sequencing (RNA-seq) is a powerful approach for measuring gene expression levels in cells and tissues, but it relies on high-quality RNA. We demonstrate here that statistical adjustment using existing quality measures largely fails to remove the effects of RNA degradation when RNA quality associates with the outcome of interest. Using RNA-seq data from molecular degradation experiments of human primary tissues, we introduce a method-quality surrogate variable analysis (qSVA)-as a framework for estimating and removing the confounding effect of RNA quality in differential expression analysis. We show that this approach results in greatly improved replication rates (>3×) across two large independent postmortem human brain studies of schizophrenia and also removes potential RNA quality biases in earlier published work that compared expression levels of different brain regions and other diagnostic groups. Our approach can therefore improve the interpretation of differential expression analysis of transcriptomic data from human tissue.