Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 65
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Mol Psychiatry ; 29(3): 782-792, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38145985

RESUMEN

Enhancers play an essential role in the etiology of schizophrenia; however, the dysregulation of enhancer activity and its impact on the regulome in schizophrenia remains understudied. To address this gap in our knowledge, we assessed enhancer and gene expression in 1,382 brain samples comprising cases with schizophrenia and unaffected controls. Dysregulation of enhancer expression was concordant with changes in gene expression, and was more closely associated with schizophrenia polygenic risk, suggesting that enhancer dysregulation is proximal to the genetic etiology of the disease. Modeling the shared variance of cis-coordinated genes and enhancers revealed a gene regulatory program that was highly associated with genetic vulnerability to schizophrenia. By integrating coordinated factors with evolutionary constraints, we found that enhancers acquired during human evolution are more likely to regulate genes that are implicated in neuropsychiatric disorders and, thus, hold potential as therapeutic targets. Our analysis provides a systematic view of regulome dysregulation in schizophrenia and highlights its convergence with schizophrenia polygenic risk and human-gained enhancers.


Asunto(s)
Elementos de Facilitación Genéticos , Predisposición Genética a la Enfermedad , Herencia Multifactorial , Esquizofrenia , Humanos , Esquizofrenia/genética , Herencia Multifactorial/genética , Predisposición Genética a la Enfermedad/genética , Elementos de Facilitación Genéticos/genética , Masculino , Femenino , Estudio de Asociación del Genoma Completo/métodos , Encéfalo/metabolismo , Regulación de la Expresión Génica/genética , Factores de Riesgo , Polimorfismo de Nucleótido Simple/genética , Adulto
2.
Bioinformatics ; 37(2): 192-201, 2021 04 19.
Artículo en Inglés | MEDLINE | ID: mdl-32730587

RESUMEN

SUMMARY: Large-scale transcriptome studies with multiple samples per individual are widely used to study disease biology. Yet, current methods for differential expression are inadequate for cross-individual testing for these repeated measures designs. Most problematic, we observe across multiple datasets that current methods can give reproducible false-positive findings that are driven by genetic regulation of gene expression, yet are unrelated to the trait of interest. Here, we introduce a statistical software package, dream, that increases power, controls the false positive rate, enables multiple types of hypothesis tests, and integrates with standard workflows. In 12 analyses in 6 independent datasets, dream yields biological insight not found with existing software while addressing the issue of reproducible false-positive findings. AVAILABILITY AND IMPLEMENTATION: Dream is available within the variancePartition Bioconductor package at http://bioconductor.org/packages/variancePartition. CONTACT: gabriel.hoffman@mssm.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Perfilación de la Expresión Génica , Programas Informáticos , Regulación de la Expresión Génica , Transcriptoma
3.
Am J Hum Genet ; 102(6): 1169-1184, 2018 06 07.
Artículo en Inglés | MEDLINE | ID: mdl-29805045

RESUMEN

Causal genes and variants within genome-wide association study (GWAS) loci can be identified by integrating GWAS statistics with expression quantitative trait loci (eQTL) and determining which variants underlie both GWAS and eQTL signals. Most analyses, however, consider only the marginal eQTL signal, rather than dissect this signal into multiple conditionally independent signals for each gene. Here we show that analyzing conditional eQTL signatures, which could be important under specific cellular or temporal contexts, leads to improved fine mapping of GWAS associations. Using genotypes and gene expression levels from post-mortem human brain samples (n = 467) reported by the CommonMind Consortium (CMC), we find that conditional eQTL are widespread; 63% of genes with primary eQTL also have conditional eQTL. In addition, genomic features associated with conditional eQTL are consistent with context-specific (e.g., tissue-, cell type-, or developmental time point-specific) regulation of gene expression. Integrating the 2014 Psychiatric Genomics Consortium schizophrenia (SCZ) GWAS and CMC primary and conditional eQTL data reveals 40 loci with strong evidence for co-localization (posterior probability > 0.8), including six loci with co-localization of conditional eQTL. Our co-localization analyses support previously reported genes, identify novel genes associated with schizophrenia risk, and provide specific hypotheses for their functional follow-up.


Asunto(s)
Estudio de Asociación del Genoma Completo , Corteza Prefrontal/patología , Sitios de Carácter Cuantitativo/genética , Esquizofrenia/genética , Células Cultivadas , Epigénesis Genética , Genoma Humano , Humanos
4.
Bioinformatics ; 36(9): 2856-2861, 2020 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-32003784

RESUMEN

MOTIVATION: Identifying correlated epigenetic features and finding differences in correlation between individuals with disease compared to controls can give novel insight into disease biology. This framework has been successful in analysis of gene expression data, but application to epigenetic data has been limited by the computational cost, lack of scalable software and lack of robust statistical tests. RESULTS: Decorate, differential epigenetic correlation test, identifies correlated epigenetic features and finds clusters of features that are differentially correlated between two or more subsets of the data. The software scales to genome-wide datasets of epigenetic assays on hundreds of individuals. We apply decorate to four large-scale datasets of DNA methylation, ATAC-seq and histone modification ChIP-seq. AVAILABILITY AND IMPLEMENTATION: decorate R package is available from https://github.com/GabrielHoffman/decorate. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional , Programas Informáticos , Epigénesis Genética , Epigenómica , Genoma , Humanos
5.
Nucleic Acids Res ; 47(20): 10597-10611, 2019 11 18.
Artículo en Inglés | MEDLINE | ID: mdl-31544924

RESUMEN

Identifying functional variants underlying disease risk and adoption of personalized medicine are currently limited by the challenge of interpreting the functional consequences of genetic variants. Predicting the functional effects of disease-associated protein-coding variants is increasingly routine. Yet, the vast majority of risk variants are non-coding, and predicting the functional consequence and prioritizing variants for functional validation remains a major challenge. Here, we develop a deep learning model to accurately predict locus-specific signals from four epigenetic assays using only DNA sequence as input. Given the predicted epigenetic signal from DNA sequence for the reference and alternative alleles at a given locus, we generate a score of the predicted epigenetic consequences for 438 million variants observed in previous sequencing projects. These impact scores are assay-specific, are predictive of allele-specific transcription factor binding and are enriched for variants associated with gene expression and disease risk. Nucleotide-level functional consequence scores for non-coding variants can refine the mechanism of known functional variants, identify novel risk variants and prioritize downstream experiments.


Asunto(s)
Ensamble y Desensamble de Cromatina , Aprendizaje Profundo , Estudio de Asociación del Genoma Completo/métodos , Código de Histonas , Polimorfismo Genético , Análisis de Secuencia de ADN/métodos , Epigénesis Genética , Humanos
6.
Mol Psychiatry ; 24(1): 49-66, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-29483625

RESUMEN

The development of human-induced pluripotent stem cells (hiPSCs) has made possible patient-specific modeling across the spectrum of human disease. Here, we discuss recent advances in psychiatric genomics and post-mortem studies that provide critical insights concerning cell-type composition and sample size that should be considered when designing hiPSC-based studies of complex genetic disease. We review recent hiPSC-based models of SZ, in light of our new understanding of critical power limitations in the design of hiPSC-based studies of complex genetic disorders. Three possible solutions are a movement towards genetically stratified cohorts of rare variant patients, application of CRISPR technologies to engineer isogenic neural cells to study the impact of common variants, and integration of advanced genetics and hiPSC-based datasets in future studies. Overall, we emphasize that to advance the reproducibility and relevance of hiPSC-based studies, stem cell biologists must contemplate statistical and biological considerations that are already well accepted in the field of genetics. We conclude with a discussion of the hypothesis of biological convergence of disease-through molecular, cellular, circuit, and patient level phenotypes-and how this might emerge through hiPSC-based studies.


Asunto(s)
Células Madre Pluripotentes Inducidas/fisiología , Trastornos Mentales/metabolismo , Trastornos Mentales/fisiopatología , Diferenciación Celular , Humanos , Modelos Biológicos , Neuronas , Fenotipo , Reproducibilidad de los Resultados
8.
Hum Mol Genet ; 24(14): 4147-57, 2015 Jul 15.
Artículo en Inglés | MEDLINE | ID: mdl-25935003

RESUMEN

Genome-wide association studies in Crohn's disease (CD) have identified 140 genome-wide significant loci. However, identification of genes driving association signals remains challenging. Furthermore, genome-wide significant thresholds limit false positives at the expense of decreased sensitivity. In this study, we explored gene features contributing to CD pathogenicity, including gene-based association data from CD and autoimmune (AI) diseases, as well as gene expression features (eQTLs, epigenetic markers of expression and intestinal gene expression data). We developed an integrative model based on a CD reference gene set. This integrative approach outperformed gene-based association signals alone in identifying CD-related genes based on statistical validation, gene ontology enrichment, differential expression between M1 and M2 macrophages and a validation using genes causing monogenic forms of inflammatory bowel disease as a reference. Besides gene-level CD association P-values, association with AI diseases was the strongest predictor, highlighting generalized mechanisms of inflammation, and the interferon-γ pathway particularly. Within the 140 high-confidence CD regions, 598 of 1328 genes had low prioritization scores, highlighting genes unlikely to contribute to CD pathogenesis. For select regions, comparably high integrative model scores were observed for multiple genes. This is particularly evident for regions having extensive linkage disequilibrium such as the IBD5 locus. Our analyses provide a standardized reference for prioritizing potential CD-related genes, in regions with both highly significant and nominally significant gene-level association P-values. Our integrative model may be particularly valuable in prioritizing rare, potentially private, missense variants for which genome-wide evidence for association may be unattainable.


Asunto(s)
Enfermedad de Crohn/genética , Expresión Génica , Estudios de Casos y Controles , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Interferón gamma/metabolismo , Intestinos , Desequilibrio de Ligamiento , Modelos Logísticos , Macrófagos , Análisis por Micromatrices , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ARN
9.
Bioinformatics ; 32(12): i101-i110, 2016 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-27307606

RESUMEN

MOTIVATION: Underrepresentation of racial groups represents an important challenge and major gap in phenomics research. Most of the current human phenomics research is based primarily on European populations; hence it is an important challenge to expand it to consider other population groups. One approach is to utilize data from EMR databases that contain patient data from diverse demographics and ancestries. The implications of this racial underrepresentation of data can be profound regarding effects on the healthcare delivery and actionability. To the best of our knowledge, our work is the first attempt to perform comparative, population-scale analyses of disease networks across three different populations, namely Caucasian (EA), African American (AA) and Hispanic/Latino (HL). RESULTS: We compared susceptibility profiles and temporal connectivity patterns for 1988 diseases and 37 282 disease pairs represented in a clinical population of 1 025 573 patients. Accordingly, we revealed appreciable differences in disease susceptibility, temporal patterns, network structure and underlying disease connections between EA, AA and HL populations. We found 2158 significantly comorbid diseases for the EA cohort, 3265 for AA and 672 for HL. We further outlined key disease pair associations unique to each population as well as categorical enrichments of these pairs. Finally, we identified 51 key 'hub' diseases that are the focal points in the race-centric networks and of particular clinical importance. Incorporating race-specific disease comorbidity patterns will produce a more accurate and complete picture of the disease landscape overall and could support more precise understanding of disease relationships and patient management towards improved clinical outcomes. CONTACTS: rong.chen@mssm.edu or joel.dudley@mssm.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Registros Electrónicos de Salud , Negro o Afroamericano , Bases de Datos Factuales , Hispánicos o Latinos , Humanos , Población Blanca
10.
BMC Bioinformatics ; 17(1): 483, 2016 Nov 25.
Artículo en Inglés | MEDLINE | ID: mdl-27884101

RESUMEN

BACKGROUND: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. RESULTS: We describe a statistical and visualization framework, variancePartition, to prioritize drivers of variation based on a genome-wide summary, and identify genes that deviate from the genome-wide trend. Using a linear mixed model, variancePartition quantifies variation in each expression trait attributable to differences in disease status, sex, cell or tissue type, ancestry, genetic background, experimental stimulus, or technical variables. Analysis of four large-scale transcriptome profiling datasets illustrates that variancePartition recovers striking patterns of biological and technical variation that are reproducible across multiple datasets. CONCLUSIONS: Our open source software, variancePartition, enables rapid interpretation of complex gene expression studies as well as other high-throughput genomics assays. variancePartition is available from Bioconductor: http://bioconductor.org/packages/variancePartition .


Asunto(s)
Biología Computacional/métodos , Perfilación de la Expresión Génica , Variación Genética/genética , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Algoritmos , Regulación de la Expresión Génica , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Modelos Lineales
11.
Bioinformatics ; 30(21): 3134-5, 2014 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-25035399

RESUMEN

UNLABELLED: The linear mixed model is the state-of-the-art method to account for the confounding effects of kinship and population structure in genome-wide association studies (GWAS). Current implementations test the effect of one or more genetic markers while including prespecified covariates such as sex. Here we develop an efficient implementation of the linear mixed model that allows composite hypothesis tests to consider genotype interactions with variables such as other genotypes, environment, sex or ancestry. Our R package, lrgpr, allows interactive model fitting and examination of regression diagnostics to facilitate exploratory data analysis in the context of the linear mixed model. By leveraging parallel and out-of-core computing for datasets too large to fit in main memory, lrgpr is applicable to large GWAS datasets and next-generation sequencing data. AVAILABILITY AND IMPLEMENTATION: lrgpr is an R package available from lrgpr.r-forge.r-project.org.


Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Programas Informáticos , Genotipo , Modelos Lineales
12.
PLoS Comput Biol ; 9(6): e1003101, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23825936

RESUMEN

Penalized Multiple Regression (PMR) can be used to discover novel disease associations in GWAS datasets. In practice, proposed PMR methods have not been able to identify well-supported associations in GWAS that are undetectable by standard association tests and thus these methods are not widely applied. Here, we present a combined algorithmic and heuristic framework for PUMA (Penalized Unified Multiple-locus Association) analysis that solves the problems of previously proposed methods including computational speed, poor performance on genome-scale simulated data, and identification of too many associations for real data to be biologically plausible. The framework includes a new minorize-maximization (MM) algorithm for generalized linear models (GLM) combined with heuristic model selection and testing methods for identification of robust associations. The PUMA framework implements the penalized maximum likelihood penalties previously proposed for GWAS analysis (i.e. Lasso, Adaptive Lasso, NEG, MCP), as well as a penalty that has not been previously applied to GWAS (i.e. LOG). Using simulations that closely mirror real GWAS data, we show that our framework has high performance and reliably increases power to detect weak associations, while existing PMR methods can perform worse than single marker testing in overall performance. To demonstrate the empirical value of PUMA, we analyzed GWAS data for type 1 diabetes, Crohns's disease, and rheumatoid arthritis, three autoimmune diseases from the original Wellcome Trust Case Control Consortium. Our analysis replicates known associations for these diseases and we discover novel etiologically relevant susceptibility loci that are invisible to standard single marker tests, including six novel associations implicating genes involved in pancreatic function, insulin pathways and immune-cell function in type 1 diabetes; three novel associations implicating genes in pro- and anti-inflammatory pathways in Crohn's disease; and one novel association implicating a gene involved in apoptosis pathways in rheumatoid arthritis. We provide software for applying our PUMA analysis framework.


Asunto(s)
Estudio de Asociación del Genoma Completo , Modelos Teóricos , Análisis de Regresión , Humanos
13.
Science ; 384(6698): eadh4265, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38781378

RESUMEN

Nucleotide variants in cell type-specific gene regulatory elements in the human brain are risk factors for human disease. We measured chromatin accessibility in 1932 aliquots of sorted neurons and non-neurons from 616 human postmortem brains and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTLs). Only 10.4% of caQTLs are shared between neurons and non-neurons, which supports cell type-specific genetic regulation of the brain regulome. Incorporating allele-specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms that underlie disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs and identified the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides insights into disease etiology.


Asunto(s)
Encefalopatías , Encéfalo , Cromatina , Regulación de la Expresión Génica , Elementos Reguladores de la Transcripción , Humanos , Alelos , Encéfalo/metabolismo , Encefalopatías/genética , Cromatina/metabolismo , Neuronas/metabolismo , Sitios de Carácter Cuantitativo , Masculino , Femenino
14.
Science ; 384(6698): eadg5136, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38781388

RESUMEN

The complexity and heterogeneity of schizophrenia have hindered mechanistic elucidation and the development of more effective therapies. Here, we performed single-cell dissection of schizophrenia-associated transcriptomic changes in the human prefrontal cortex across 140 individuals in two independent cohorts. Excitatory neurons were the most affected cell group, with transcriptional changes converging on neurodevelopment and synapse-related molecular pathways. Transcriptional alterations included known genetic risk factors, suggesting convergence of rare and common genomic variants on neuronal population-specific alterations in schizophrenia. Based on the magnitude of schizophrenia-associated transcriptional change, we identified two populations of individuals with schizophrenia marked by expression of specific excitatory and inhibitory neuronal cell states. This single-cell atlas links transcriptomic changes to etiological genetic risk factors, contextualizing established knowledge within the human cortical cytoarchitecture and facilitating mechanistic understanding of schizophrenia pathophysiology and heterogeneity.


Asunto(s)
Predisposición Genética a la Enfermedad , Neuroglía , Neuronas , Corteza Prefrontal , Esquizofrenia , Análisis de la Célula Individual , Adulto , Femenino , Humanos , Masculino , Estudios de Cohortes , Neuronas/metabolismo , Corteza Prefrontal/metabolismo , Factores de Riesgo , Esquizofrenia/genética , Sinapsis/metabolismo , Transcriptoma , Adulto Joven , Persona de Mediana Edad , Anciano , Anciano de 80 o más Años , Neuroglía/metabolismo
15.
Biol Psychiatry ; 95(2): 187-198, 2024 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-37454787

RESUMEN

BACKGROUND: Converging evidence from large-scale genetic and postmortem studies highlights the role of aberrant neurotransmission and genetic regulation in brain-related disorders. However, identifying neuronal activity-regulated transcriptional programs in the human brain and understanding how changes contribute to disease remain challenging. METHODS: To better understand how the activity-dependent regulome contributes to risk for brain-related disorders, we profiled the transcriptomic and epigenomic changes following neuronal depolarization in human induced pluripotent stem cell-derived glutamatergic neurons (NGN2) from 6 patients with schizophrenia and 5 control participants. RESULTS: Multiomic data integration associated global patterns of chromatin accessibility with gene expression and identified enhancer-promoter interactions in glutamatergic neurons. Within 1 hour of potassium chloride-induced depolarization, independent of diagnosis, glutamatergic neurons displayed substantial activity-dependent changes in the expression of genes regulating synaptic function. Depolarization-induced changes in the regulome revealed significant heritability enrichment for schizophrenia and Parkinson's disease, adding to mounting evidence that sequence variation within activation-dependent regulatory elements contributes to the genetic risk for brain-related disorders. Gene coexpression network analysis elucidated interactions among activity-dependent and disease-associated genes and pointed to a key driver (NAV3) that interacted with multiple genes involved in axon guidance. CONCLUSIONS: Overall, we demonstrated that deciphering the activity-dependent regulome in glutamatergic neurons reveals novel targets for advanced diagnosis and therapy.


Asunto(s)
Células Madre Pluripotentes Inducidas , Esquizofrenia , Humanos , Células Madre Pluripotentes Inducidas/metabolismo , Regulación de la Expresión Génica , Neuronas/metabolismo , Encéfalo
16.
Sci Adv ; 10(21): eadh2588, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38781336

RESUMEN

Sample-wise deconvolution methods estimate cell-type proportions and gene expressions in bulk tissue samples, yet their performance and biological applications remain unexplored, particularly in human brain transcriptomic data. Here, nine deconvolution methods were evaluated with sample-matched data from bulk tissue RNA sequencing (RNA-seq), single-cell/nuclei (sc/sn) RNA-seq, and immunohistochemistry. A total of 1,130,767 nuclei per cells from 149 adult postmortem brains and 72 organoid samples were used. The results showed the best performance of dtangle for estimating cell proportions and bMIND for estimating sample-wise cell-type gene expressions. For eight brain cell types, 25,273 cell-type eQTLs were identified with deconvoluted expressions (decon-eQTLs). The results showed that decon-eQTLs explained more schizophrenia GWAS heritability than bulk tissue or single-cell eQTLs did alone. Differential gene expressions associated with Alzheimer's disease, schizophrenia, and brain development were also examined using the deconvoluted data. Our findings, which were replicated in bulk tissue and single-cell data, provided insights into the biological applications of deconvoluted data in multiple brain disorders.


Asunto(s)
Encéfalo , Análisis de la Célula Individual , Transcriptoma , Humanos , Encéfalo/metabolismo , Análisis de la Célula Individual/métodos , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/metabolismo , Enfermedad de Alzheimer/patología , Perfilación de la Expresión Génica/métodos , Esquizofrenia/genética , Esquizofrenia/metabolismo , Esquizofrenia/patología , Estudio de Asociación del Genoma Completo/métodos , Análisis de Secuencia de ARN/métodos , Adulto
17.
Science ; 384(6698): eadi5199, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38781369

RESUMEN

Single-cell genomics is a powerful tool for studying heterogeneous tissues such as the brain. Yet little is understood about how genetic variants influence cell-level gene expression. Addressing this, we uniformly processed single-nuclei, multiomics datasets into a resource comprising >2.8 million nuclei from the prefrontal cortex across 388 individuals. For 28 cell types, we assessed population-level variation in expression and chromatin across gene families and drug targets. We identified >550,000 cell type-specific regulatory elements and >1.4 million single-cell expression quantitative trait loci, which we used to build cell-type regulatory and cell-to-cell communication networks. These networks manifest cellular changes in aging and neuropsychiatric disorders. We further constructed an integrative model accurately imputing single-cell expression and simulating perturbations; the model prioritized ~250 disease-risk genes and drug targets with associated cell types.


Asunto(s)
Encéfalo , Redes Reguladoras de Genes , Trastornos Mentales , Análisis de la Célula Individual , Humanos , Envejecimiento/genética , Encéfalo/metabolismo , Comunicación Celular/genética , Cromatina/metabolismo , Cromatina/genética , Genómica , Trastornos Mentales/genética , Corteza Prefrontal/metabolismo , Corteza Prefrontal/fisiología , Sitios de Carácter Cuantitativo
18.
bioRxiv ; 2024 Mar 30.
Artículo en Inglés | MEDLINE | ID: mdl-38562822

RESUMEN

Single-cell genomics is a powerful tool for studying heterogeneous tissues such as the brain. Yet, little is understood about how genetic variants influence cell-level gene expression. Addressing this, we uniformly processed single-nuclei, multi-omics datasets into a resource comprising >2.8M nuclei from the prefrontal cortex across 388 individuals. For 28 cell types, we assessed population-level variation in expression and chromatin across gene families and drug targets. We identified >550K cell-type-specific regulatory elements and >1.4M single-cell expression-quantitative-trait loci, which we used to build cell-type regulatory and cell-to-cell communication networks. These networks manifest cellular changes in aging and neuropsychiatric disorders. We further constructed an integrative model accurately imputing single-cell expression and simulating perturbations; the model prioritized ~250 disease-risk genes and drug targets with associated cell types.

19.
bioRxiv ; 2023 Mar 02.
Artículo en Inglés | MEDLINE | ID: mdl-37090548

RESUMEN

Nucleotide variants in cell type-specific gene regulatory elements in the human brain are major risk factors of human disease. We measured chromatin accessibility in sorted neurons and glia from 1,932 samples of human postmortem brain and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTL). Only 10.4% of caQTL are shared between neurons and glia, supporting the cell type specificity of genetic regulation of the brain regulome. Incorporating allele specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms underlying disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs, identifying the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides novel insights into brain disease etiology.

20.
Res Sq ; 2023 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-37205331

RESUMEN

Advances in single-cell and -nucleus transcriptomics have enabled generation of increasingly large-scale datasets from hundreds of subjects and millions of cells. These studies promise to give unprecedented insight into the cell type specific biology of human disease. Yet performing differential expression analyses across subjects remains difficult due to challenges in statistical modeling of these complex studies and scaling analyses to large datasets. Our open-source R package dreamlet (DiseaseNeurogenomics.github.io/dreamlet) uses a pseudobulk approach based on precision-weighted linear mixed models to identify genes differentially expressed with traits across subjects for each cell cluster. Designed for data from large cohorts, dreamlet is substantially faster and uses less memory than existing workflows, while supporting complex statistical models and controlling the false positive rate. We demonstrate computational and statistical performance on published datasets, and a novel dataset of 1.4M single nuclei from postmortem brains of 150 Alzheimer's disease cases and 149 controls.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA