RESUMEN
Differential expression analysis of single-cell RNA sequencing (scRNA-seq) data is central for characterizing how experimental factors affect the distribution of gene expression. However, distinguishing between biological and technical sources of cell-cell variability and assessing the statistical significance of quantitative comparisons between cell groups remain challenging. We introduce Memento, a tool for robust and efficient differential analysis of mean expression, variability, and gene correlation from scRNA-seq data, scalable to millions of cells and thousands of samples. We applied Memento to 70,000 tracheal epithelial cells to identify interferon-responsive genes, 160,000 CRISPR-Cas9 perturbed T cells to reconstruct gene-regulatory networks, 1.2 million peripheral blood mononuclear cells (PBMCs) to map cell-type-specific quantitative trait loci (QTLs), and the 50-million-cell CELLxGENE Discover corpus to compare arbitrary cell groups. In all cases, Memento identified more significant and reproducible differences in mean expression compared with existing methods. It also identified differences in variability and gene correlation that suggest distinct transcriptional regulation mechanisms imparted by perturbations.
RESUMEN
CRISPR-Cas9 genome editing has enabled advanced T cell therapies, but occasional loss of the targeted chromosome remains a safety concern. To investigate whether Cas9-induced chromosome loss is a universal phenomenon and evaluate its clinical significance, we conducted a systematic analysis in primary human T cells. Arrayed and pooled CRISPR screens revealed that chromosome loss was generalizable across the genome and resulted in partial and entire loss of the targeted chromosome, including in preclinical chimeric antigen receptor T cells. T cells with chromosome loss persisted for weeks in culture, implying the potential to interfere with clinical use. A modified cell manufacturing process, employed in our first-in-human clinical trial of Cas9-engineered T cells (NCT03399448), reduced chromosome loss while largely preserving genome editing efficacy. Expression of p53 correlated with protection from chromosome loss observed in this protocol, suggesting both a mechanism and strategy for T cell engineering that mitigates this genotoxicity in the clinic.
Asunto(s)
Sistemas CRISPR-Cas , Aberraciones Cromosómicas , Edición Génica , Linfocitos T , Humanos , Cromosomas , Sistemas CRISPR-Cas/genética , Daño del ADN , Edición Génica/métodos , Ensayos Clínicos como AsuntoRESUMEN
Adoptive transfer of genetically modified immune cells holds great promise for cancer immunotherapy. CRISPR knockin targeting can improve cell therapies, but more high-throughput methods are needed to test which knockin gene constructs most potently enhance primary cell functions in vivo. We developed a widely adaptable technology to barcode and track targeted integrations of large non-viral DNA templates and applied it to perform pooled knockin screens in primary human T cells. Pooled knockin of dozens of unique barcoded templates into the T cell receptor (TCR)-locus revealed gene constructs that enhanced fitness in vitro and in vivo. We further developed pooled knockin sequencing (PoKI-seq), combining single-cell transcriptome analysis and pooled knockin screening to measure cell abundance and cell state ex vivo and in vivo. This platform nominated a novel transforming growth factor ß (TGF-ß) R2-41BB chimeric receptor that improved solid tumor clearance. Pooled knockin screening enables parallelized re-writing of endogenous genetic sequences to accelerate discovery of knockin programs for cell therapies.
Asunto(s)
Técnicas de Sustitución del Gen/métodos , Ingeniería Genética/métodos , Inmunoterapia/métodos , Animales , Células Sanguíneas , Sistemas CRISPR-Cas/genética , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Humanos , Ratones , Ratones Endogámicos NOD , Ratones SCID , ARN Guía de Kinetoplastida/genética , Análisis de la Célula Individual/métodos , Linfocitos T , Transcriptoma/genéticaRESUMEN
Responses to anti-PD-1 immunotherapy occur but are infrequent in bladder cancer. The specific T cells that mediate tumor rejection are unknown. T cells from human bladder tumors and non-malignant tissue were assessed with single-cell RNA and paired T cell receptor (TCR) sequencing of 30,604 T cells from 7 patients. We find that the states and repertoires of CD8+ T cells are not distinct in tumors compared with non-malignant tissues. In contrast, single-cell analysis of CD4+ T cells demonstrates several tumor-specific states, including multiple distinct states of regulatory T cells. Surprisingly, we also find multiple cytotoxic CD4+ T cell states that are clonally expanded. These CD4+ T cells can kill autologous tumors in an MHC class II-dependent fashion and are suppressed by regulatory T cells. Further, a gene signature of cytotoxic CD4+ T cells in tumors predicts a clinical response in 244 metastatic bladder cancer patients treated with anti-PD-L1.
Asunto(s)
Linfocitos T CD4-Positivos/metabolismo , Neoplasias de la Vejiga Urinaria/genética , Neoplasias de la Vejiga Urinaria/metabolismo , Antígeno B7-H1/genética , Antígeno B7-H1/inmunología , Biomarcadores Farmacológicos/análisis , Linfocitos T CD4-Positivos/inmunología , Linfocitos T CD8-positivos/inmunología , Linfocitos T CD8-positivos/metabolismo , Regulación Neoplásica de la Expresión Génica/genética , Genes MHC Clase II , Humanos , Inhibidores de Puntos de Control Inmunológico/farmacología , Inmunoterapia , Linfocitos Infiltrantes de Tumor , Receptor de Muerte Celular Programada 1/genética , Receptores de Antígenos de Linfocitos T/genética , Análisis de la Célula Individual/métodos , Linfocitos T Reguladores , Neoplasias de la Vejiga Urinaria/inmunologíaRESUMEN
Human regulatory T (Treg) cells are essential for immune homeostasis. The transcription factor FOXP3 maintains Treg cell identity, yet the complete set of key transcription factors that control Treg cell gene expression remains unknown. Here, we used pooled and arrayed Cas9 ribonucleoprotein screens to identify transcription factors that regulate critical proteins in primary human Treg cells under basal and proinflammatory conditions. We then generated 54,424 single-cell transcriptomes from Treg cells subjected to genetic perturbations and cytokine stimulation, which revealed distinct gene networks individually regulated by FOXP3 and PRDM1, in addition to a network coregulated by FOXO1 and IRF4. We also discovered that HIVEP2, to our knowledge not previously implicated in Treg cell function, coregulates another gene network with SATB1 and is important for Treg cell-mediated immunosuppression. By integrating CRISPR screens and single-cell RNA-sequencing profiling, we have uncovered transcriptional regulators and downstream gene networks in human Treg cells that could be targeted for immunotherapies.
Asunto(s)
Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Linfocitos T Reguladores/inmunología , Linfocitos T Reguladores/metabolismo , Transcriptoma , Biomarcadores , Sistemas CRISPR-Cas , Susceptibilidad a Enfermedades , Técnicas de Inactivación de Genes , Marcación de Gen , Enfermedad Injerto contra Huésped/etiología , Secuenciación de Nucleótidos de Alto Rendimiento , HumanosRESUMEN
Type 1 interferon (IFN) is a key mediator of organismal responses to pathogens, eliciting prototypical "interferon signature genes" that encode antiviral and inflammatory mediators. For a global view of IFN signatures and regulatory pathways, we performed gene expression and chromatin analyses of the IFN-induced response across a range of immunocyte lineages. These distinguished ISGs by cell-type specificity, kinetics, and sensitivity to tonic IFN and revealed underlying changes in chromatin configuration. We combined 1,398 human and mouse datasets to computationally infer ISG modules and their regulators, validated by genetic analysis in both species. Some ISGs are controlled by Stat1/2 and Irf9 and the ISRE DNA motif, but others appeared dependent on non-canonical factors. This regulatory framework helped to interpret JAK1 blockade pharmacology, different clusters being affected under tonic or IFN-stimulated conditions, and the IFN signatures previously associated with human diseases, revealing unrecognized subtleties in disease footprints, as affected by human ancestry.
Asunto(s)
Redes Reguladoras de Genes , Interferón Tipo I/inmunología , Interferón Tipo I/metabolismo , Animales , Linfocitos T CD4-Positivos/metabolismo , Conjuntos de Datos como Asunto , Humanos , Quinasas Janus/metabolismo , Ratones , Ratones Endogámicos C57BL , Receptor de Interferón alfa y beta/metabolismoRESUMEN
Finding the components of cellular circuits and determining their functions systematically remains a major challenge in mammalian cells. Here, we introduced genome-wide pooled CRISPR-Cas9 libraries into dendritic cells (DCs) to identify genes that control the induction of tumor necrosis factor (Tnf) by bacterial lipopolysaccharide (LPS), a key process in the host response to pathogens, mediated by the Tlr4 pathway. We found many of the known regulators of Tlr4 signaling, as well as dozens of previously unknown candidates that we validated. By measuring protein markers and mRNA profiles in DCs that are deficient in known or candidate genes, we classified the genes into three functional modules with distinct effects on the canonical responses to LPS and highlighted functions for the PAF complex and oligosaccharyltransferase (OST) complex. Our findings uncover new facets of innate immune circuits in primary cells and provide a genetic approach for dissection of mammalian cell circuits.
Asunto(s)
Sistemas CRISPR-Cas , Técnicas Genéticas , Inmunidad Innata , Animales , Células de la Médula Ósea/inmunología , Diferenciación Celular , Supervivencia Celular , Células Dendríticas/citología , Células Dendríticas/inmunología , Técnicas de Inactivación de Genes , Redes Reguladoras de Genes , Hexosiltransferasas/metabolismo , Proteínas de la Membrana/metabolismo , Ratones , Ratones Transgénicos , Receptor Toll-Like 4/inmunología , Factor de Necrosis Tumoral alfa/inmunologíaRESUMEN
CRISPR-enabled screening is a powerful tool for the discovery of genes that control T cell function and has nominated candidate targets for immunotherapies1-6. However, new approaches are required to probe specific nucleotide sequences within key genes. Systematic mutagenesis in primary human T cells could reveal alleles that tune specific phenotypes. DNA base editors are powerful tools for introducing targeted mutations with high efficiency7,8. Here we develop a large-scale base-editing mutagenesis platform with the goal of pinpointing nucleotides that encode amino acid residues that tune primary human T cell activation responses. We generated a library of around 117,000 single guide RNA molecules targeting base editors to protein-coding sites across 385 genes implicated in T cell function and systematically identified protein domains and specific amino acid residues that regulate T cell activation and cytokine production. We found a broad spectrum of alleles with variants encoding critical residues in proteins including PIK3CD, VAV1, LCP2, PLCG1 and DGKZ, including both gain-of-function and loss-of-function mutations. We validated the functional effects of many alleles and further demonstrated that base-editing hits could positively and negatively tune T cell cytotoxic function. Finally, higher-resolution screening using a base editor with relaxed protospacer-adjacent motif requirements9 (NG versus NGG) revealed specific structural domains and protein-protein interaction sites that can be targeted to tune T cell functions. Base-editing screens in primary immune cells thus provide biochemical insights with the potential to accelerate immunotherapy design.
Asunto(s)
Alelos , Edición Génica , Mutagénesis , Linfocitos T , Humanos , Aminoácidos/genética , Sistemas CRISPR-Cas/genética , Mutagénesis/genética , ARN Guía de Sistemas CRISPR-Cas/genética , Linfocitos T/inmunología , Linfocitos T/metabolismo , Activación de Linfocitos , Citocinas/biosíntesis , Citocinas/metabolismo , Mutación con Ganancia de Función , Mutación con Pérdida de FunciónRESUMEN
Neuroanatomists have long speculated that expanded primate brains contain an increased morphological diversity of inhibitory neurons (INs)1, and recent studies have identified primate-specific neuronal populations at the molecular level2. However, we know little about the developmental mechanisms that specify evolutionarily novel cell types in the brain. Here, we reconstruct gene expression trajectories specifying INs generated throughout the neurogenic period in macaques and mice by analysing the transcriptomes of 250,181 cells. We find that the initial classes of INs generated prenatally are largely conserved among mammals. Nonetheless, we identify two contrasting developmental mechanisms for specifying evolutionarily novel cell types during prenatal development. First, we show that recently identified primate-specific TAC3 striatal INs are specified by a unique transcriptional programme in progenitors followed by induction of a distinct suite of neuropeptides and neurotransmitter receptors in new-born neurons. Second, we find that multiple classes of transcriptionally conserved olfactory bulb (OB)-bound precursors are redirected to expanded primate white matter and striatum. These classes include a novel peristriatal class of striatum laureatum neurons that resemble dopaminergic periglomerular cells of the OB. We propose an evolutionary model in which conserved initial classes of neurons supplying the smaller primate OB are reused in the enlarged striatum and cortex. Together, our results provide a unified developmental taxonomy of initial classes of mammalian INs and reveal multiple developmental mechanisms for neural cell type evolution.
Asunto(s)
Evolución Biológica , Cuerpo Estriado , Desarrollo Embrionario , Macaca , Neurogénesis , Neuronas , Bulbo Olfatorio , Animales , Cuerpo Estriado/crecimiento & desarrollo , Neuronas Dopaminérgicas , Femenino , Macaca/crecimiento & desarrollo , Mamíferos , Ratones , Neurogénesis/fisiología , Bulbo Olfatorio/fisiología , Embarazo , PrimatesRESUMEN
Infectious diseases are among the strongest selective pressures driving human evolution1,2. This includes the single greatest mortality event in recorded history, the first outbreak of the second pandemic of plague, commonly called the Black Death, which was caused by the bacterium Yersinia pestis3. This pandemic devastated Afro-Eurasia, killing up to 30-50% of the population4. To identify loci that may have been under selection during the Black Death, we characterized genetic variation around immune-related genes from 206 ancient DNA extracts, stemming from two different European populations before, during and after the Black Death. Immune loci are strongly enriched for highly differentiated sites relative to a set of non-immune loci, suggesting positive selection. We identify 245 variants that are highly differentiated within the London dataset, four of which were replicated in an independent cohort from Denmark, and represent the strongest candidates for positive selection. The selected allele for one of these variants, rs2549794, is associated with the production of a full-length (versus truncated) ERAP2 transcript, variation in cytokine response to Y. pestis and increased ability to control intracellular Y. pestis in macrophages. Finally, we show that protective variants overlap with alleles that are today associated with increased susceptibility to autoimmune diseases, providing empirical evidence for the role played by past pandemics in shaping present-day susceptibility to disease.
Asunto(s)
ADN Antiguo , Predisposición Genética a la Enfermedad , Inmunidad , Peste , Selección Genética , Yersinia pestis , Humanos , Aminopeptidasas/genética , Aminopeptidasas/inmunología , Peste/genética , Peste/inmunología , Peste/microbiología , Peste/mortalidad , Yersinia pestis/inmunología , Yersinia pestis/patogenicidad , Selección Genética/inmunología , Europa (Continente)/epidemiología , Europa (Continente)/etnología , Inmunidad/genética , Conjuntos de Datos como Asunto , Londres/epidemiología , Dinamarca/epidemiologíaRESUMEN
Decades of work have elucidated cytokine signalling and transcriptional pathways that control T cell differentiation and have led the way to targeted biologic therapies that are effective in a range of autoimmune, allergic and inflammatory diseases. Recent evidence indicates that obesity and metabolic disease can also influence the immune system1-7, although the mechanisms and effects on immunotherapy outcomes remain largely unknown. Here, using two models of atopic dermatitis, we show that lean and obese mice mount markedly different immune responses. Obesity converted the classical type 2 T helper (TH2)-predominant disease associated with atopic dermatitis to a more severe disease with prominent TH17 inflammation. We also observed divergent responses to biologic therapies targeting TH2 cytokines, which robustly protected lean mice but exacerbated disease in obese mice. Single-cell RNA sequencing coupled with genome-wide binding analyses revealed decreased activity of nuclear receptor peroxisome proliferator-activated receptor-γ (PPARγ) in TH2 cells from obese mice relative to lean mice. Conditional ablation of PPARγ in T cells revealed that PPARγ is required to focus the in vivo TH response towards a TH2-predominant state and prevent aberrant non-TH2 inflammation. Treatment of obese mice with a small-molecule PPARγ agonist limited development of TH17 pathology and unlocked therapeutic responsiveness to targeted anti-TH2 biologic therapies. These studies reveal the effects of obesity on immunological disease and suggest a precision medicine approach to target the immune dysregulation caused by obesity.
Asunto(s)
Dermatitis Atópica , PPAR gamma , Animales , Citocinas/metabolismo , Modelos Animales de Enfermedad , Inflamación/metabolismo , Ratones , Obesidad/metabolismo , PPAR gamma/agonistas , PPAR gamma/metabolismo , Medicina de Precisión , Análisis de Secuencia de ARN , Células Th2/metabolismoRESUMEN
Inborn errors of immunity (IEI) comprise a diverse spectrum of 485 disorders as recognized by the International Union of Immunological Societies Committee on Inborn Error of Immunity in 2022. While IEI are monogenic by definition, they illuminate various pathways involved in the pathogenesis of polygenic immune dysregulation as in autoimmune or autoinflammatory syndromes, or in more common infectious diseases that may not have a significant genetic basis. Rapid improvement in genomic technologies has been the main driver of the accelerated rate of discovery of IEI and has led to the development of innovative treatment strategies. In this review, we will explore various facets of IEI, delving into the distinctions between PIDD and PIRD. We will examine how Mendelian inheritance patterns contribute to these disorders and discuss advancements in functional genomics that aid in characterizing new IEI. Additionally, we will explore how emerging genomic tools help to characterize new IEI as well as how they are paving the way for innovative treatment approaches for managing and potentially curing these complex immune conditions.
Asunto(s)
Genómica , Humanos , SíndromeRESUMEN
Population-scale single-cell genomics is a transformative approach for unraveling the intricate links between genetic and cellular variation. This approach is facilitated by cutting-edge experimental methodologies, including the development of high-throughput single-cell multiomics and advances in multiplexed environmental and genetic perturbations. Examining the effects of natural or synthetic genetic variants across cellular contexts provides insights into the mutual influence of genetics and the environment in shaping cellular heterogeneity. The development of computational methodologies further enables detailed quantitative analysis of molecular variation, offering an opportunity to examine the respective roles of stochastic, intercellular, and interindividual variation. Future opportunities lie in leveraging long-read sequencing, refining disease-relevant cellular models, and embracing predictive and generative machine learning models. These advancements hold the potential for a deeper understanding of the genetic architecture of human molecular traits, which in turn has important implications for understanding the genetic causes of human disease.
Asunto(s)
Variación Genética , Análisis de la Célula Individual , Humanos , Análisis de la Célula Individual/métodos , Genómica/métodos , Aprendizaje Automático , Genética de PoblaciónRESUMEN
Stromal cells (SCs) establish the compartmentalization of lymphoid tissues critical to the immune response. However, the full diversity of lymph node (LN) SCs remains undefined. Using droplet-based single-cell RNA sequencing, we identified nine peripheral LN non-endothelial SC clusters. Included are the established subsets, Ccl19hi T-zone reticular cells (TRCs), marginal reticular cells, follicular dendritic cells (FDCs), and perivascular cells. We also identified Ccl19lo TRCs, likely including cholesterol-25-hydroxylase+ cells located at the T-zone perimeter, Cxcl9+ TRCs in the T-zone and interfollicular region, CD34+ SCs in the capsule and medullary vessel adventitia, indolethylamine N-methyltransferase+ SCs in the medullary cords, and Nr4a1+ SCs in several niches. These data help define how transcriptionally distinct LN SCs support niche-restricted immune functions and provide evidence that many SCs are in an activated state.
Asunto(s)
Ganglios Linfáticos/inmunología , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Células del Estroma/inmunología , Transcriptoma/inmunología , Animales , Quimiocina CCL19/genética , Quimiocina CCL19/inmunología , Quimiocina CCL19/metabolismo , Células Dendríticas Foliculares/inmunología , Células Dendríticas Foliculares/metabolismo , Femenino , Ganglios Linfáticos/metabolismo , Tejido Linfoide/citología , Tejido Linfoide/inmunología , Tejido Linfoide/metabolismo , Ratones Endogámicos C57BL , Células del Estroma/metabolismoRESUMEN
The development of DNA-barcoded antibodies to tag cell surface molecules has enabled the use of droplet-based single-cell sequencing (dsc-seq) to profile protein abundances from thousands of cells simultaneously. As compared to flow and mass cytometry, the high per cell cost of current dsc-seq-based workflows precludes their use in clinical applications and large-scale pooled screens. Here, we introduce SCITO-seq, a workflow that uses splint oligonucleotides (oligos) to enable combinatorially indexed dsc-seq of DNA-barcoded antibodies from over 105 cells per reaction using commercial microfluidics. By encoding sample barcodes into splint oligos, we demonstrate that multiplexed SCITO-seq produces reproducible estimates of cellular composition and surface protein expression comparable to those from mass cytometry. We further demonstrate two modified splint oligo designs that extend SCITO-seq to achieve compatibility with commercial DNA-barcoded antibodies and simultaneous expression profiling of the transcriptome and surface proteins from the same cell. These results demonstrate SCITO-seq as a flexible and ultra-high-throughput platform for sequencing-based single-cell protein and multimodal profiling.
Asunto(s)
Citometría de Flujo/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Microfluídica/métodos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Transcriptoma , Estudios de Casos y Controles , Perfilación de la Expresión Génica , HumanosRESUMEN
We describe an unvaccinated child at risk for life-threatening COVID-19 due to an inherited deficiency of IRF9, which governs ISGF-3-dependent responses to type I and III interferons (IFN). She was admitted, with a high nasal SARS-CoV-2 load on day 1 of upper respiratory tract infection. She was viremic on day 2 and received casirivimab and imdevimab. Her clinical manifestations and viremia disappeared on days 3 and 4, respectively. Circulating SARS-CoV-2 virus induced the expression of IFN-stimulated genes in leukocytes on day 1, whereas the secretion of blood type I IFNs, which peaked on day 4, did not. Antibody-mediated SARS-CoV-2 neutralization is, therefore, sufficient to overcome a deficiency of antiviral IFNs.
Asunto(s)
Anticuerpos Monoclonales Humanizados/uso terapéutico , COVID-19/terapia , Subunidad gamma del Factor 3 de Genes Estimulados por el Interferón/deficiencia , Subunidad gamma del Factor 3 de Genes Estimulados por el Interferón/genética , SARS-CoV-2/inmunología , Anticuerpos Neutralizantes/uso terapéutico , Preescolar , Femenino , Humanos , Huésped Inmunocomprometido , Mutación , Carga ViralRESUMEN
While genetic variants are known to be associated with overall gene abundance in stimulated immune cells, less is known about their effects on alternative isoform usage. By analyzing RNA-seq profiles of monocyte-derived dendritic cells from 243 individuals, we uncovered thousands of unannotated isoforms synthesized in response to influenza infection and type 1 interferon stimulation. We identified more than a thousand quantitative trait loci (QTLs) associated with alternate isoform usage (isoQTLs), many of which are independent of expression QTLs (eQTLs) for the same gene. Compared with eQTLs, isoQTLs are enriched for splice sites and untranslated regions, but depleted of sequences upstream of annotated transcription start sites. Both eQTLs and isoQTLs explain a significant proportion of the disease heritability attributed to common genetic variants. At the ERAP2 locus, we shed light on the function of the gene and how two frequent, highly differentiated haplotypes with intermediate frequencies could be maintained by balancing selection. At baseline and following type 1 interferon stimulation, the major haplotype is associated with low ERAP2 expression caused by nonsense-mediated decay, while the minor haplotype, known to increase Crohn's disease risk, is associated with high ERAP2 expression. In response to influenza infection, we found two uncharacterized isoforms expressed from the major haplotype, likely the result of multiple perfectly linked variants affecting the transcription and splicing at the locus. Thus, genetic variants at a single locus could modulate independent gene regulatory processes in innate immune responses and, in the case of ERAP2, may confer a historical fitness advantage in response to virus.
Asunto(s)
Empalme Alternativo , Aminopeptidasas/genética , Predisposición Genética a la Enfermedad , Interacciones Huésped-Patógeno/genética , Virus de la Influenza A , Gripe Humana/genética , Gripe Humana/virología , Adolescente , Adulto , Mapeo Cromosómico , Biología Computacional/métodos , Células Dendríticas/metabolismo , Femenino , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Ontología de Genes , Pruebas Genéticas , Variación Genética , Humanos , Interferón Tipo I/metabolismo , Masculino , Persona de Mediana Edad , Modelos Biológicos , Anotación de Secuencia Molecular , Monocitos/metabolismo , Sitios de Carácter Cuantitativo , Transcriptoma , Adulto JovenRESUMEN
The increasing number of genetic association studies conducted in multiple populations provides an unprecedented opportunity to study how the genetic architecture of complex phenotypes varies between populations, a problem important for both medical and population genetics. Here, we have developed a method for estimating the transethnic genetic correlation: the correlation of causal-variant effect sizes at SNPs common in populations. This methods takes advantage of the entire spectrum of SNP associations and uses only summary-level data from genome-wide association studies. This avoids the computational costs and privacy concerns associated with genotype-level information while remaining scalable to hundreds of thousands of individuals and millions of SNPs. We applied our method to data on gene expression, rheumatoid arthritis, and type 2 diabetes and overwhelmingly found that the genetic correlation was significantly less than 1. Our method is implemented in a Python package called Popcorn.
Asunto(s)
Artritis Reumatoide/genética , Diabetes Mellitus Tipo 2/genética , Etnicidad/genética , Estudio de Asociación del Genoma Completo/métodos , Programas Informáticos , Estatura , Índice de Masa Corporal , Genotipo , Humanos , Funciones de Verosimilitud , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Tamaño de la MuestraRESUMEN
The role of rare alleles in complex phenotypes has been hotly debated, but most rare variant association tests (RVATs) do not account for the evolutionary forces that affect genetic architecture. Here, we use simulation and numerical algorithms to show that explosive population growth, as experienced by human populations, can dramatically increase the impact of very rare alleles on trait variance. We then assess the ability of RVATs to detect causal loci using simulations and human RNA-seq data. Surprisingly, we find that statistical performance is worst for phenotypes in which genetic variance is due mainly to rare alleles, and explosive population growth decreases power. Although many studies have attempted to identify causal rare variants, few have reported novel associations. This has sometimes been interpreted to mean that rare variants make negligible contributions to complex trait heritability. Our work shows that RVATs are not robust to realistic human evolutionary forces, so general conclusions about the impact of rare variants on complex traits may be premature.