Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
1.
Nat Genet ; 56(4): 615-626, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38594305

ABSTRACT

Translating genome-wide association study (GWAS) loci into causal variants and genes requires accurate cell-type-specific enhancer-gene maps from disease-relevant tissues. Building enhancer-gene maps is essential but challenging with current experimental methods in primary human tissues. Here we developed a nonparametric statistical method, SCENT (single-cell enhancer target gene mapping), that models association between enhancer chromatin accessibility and gene expression in single-cell or nucleus multimodal RNA sequencing and ATAC sequencing data. We applied SCENT to 9 multimodal datasets including >120,000 single cells or nuclei and created 23 cell-type-specific enhancer-gene maps. These maps were highly enriched for causal variants in expression quantitative loci and GWAS for 1,143 diseases and traits. We identified likely causal genes for both common and rare diseases and linked somatic mutation hotspots to target genes. We demonstrate that application of SCENT to multimodal data from disease-relevant human tissue enables the scalable construction of accurate cell-type-specific enhancer-gene maps, essential for defining noncoding variant function.


Subject(s)
Genome-Wide Association Study , Regulatory Sequences, Nucleic Acid , Humans , Alleles , Genome-Wide Association Study/methods , Chromosome Mapping , Phenotype , Chromatin/genetics , Polymorphism, Single Nucleotide , Genetic Predisposition to Disease/genetics
2.
Nat Commun ; 15(1): 563, 2024 Jan 17.
Article in English | MEDLINE | ID: mdl-38233398

ABSTRACT

Prioritizing disease-critical cell types by integrating genome-wide association studies (GWAS) with functional data is a fundamental goal. Single-cell chromatin accessibility (scATAC-seq) and gene expression (scRNA-seq) have characterized cell types at high resolution, and studies integrating GWAS with scRNA-seq have shown promise, but studies integrating GWAS with scATAC-seq have been limited. Here, we identify disease-critical fetal and adult brain cell types by integrating GWAS summary statistics from 28 brain-related diseases/traits (average N = 298 K) with 3.2 million scATAC-seq and scRNA-seq profiles from 83 cell types. We identified disease-critical fetal (respectively adult) brain cell types for 22 (respectively 23) of 28 traits using scATAC-seq, and for 8 (respectively 17) of 28 traits using scRNA-seq. Significant scATAC-seq enrichments included fetal photoreceptor cells for major depressive disorder, fetal ganglion cells for BMI, fetal astrocytes for ADHD, and adult VGLUT2 excitatory neurons for schizophrenia. Our findings improve our understanding of brain-related diseases/traits and inform future analyses.


Subject(s)
Chromatin Immunoprecipitation Sequencing , Depressive Disorder, Major , Humans , RNA-Seq , Genome-Wide Association Study , Chromatin/genetics , Brain , Single-Cell Analysis
3.
Nat Genet ; 55(12): 2255-2268, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38036787

ABSTRACT

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues. To mitigate technical confounding, we developed scHLApers, a pipeline to accurately quantify single-cell HLA expression using personalized reference genomes. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B and T cells. For example, a T cell HLA-DQA1 eQTL ( rs3104371 ) is strongest in cytotoxic cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.


Subject(s)
Gene Expression Regulation , Quantitative Trait Loci , Humans , Gene Expression Regulation/genetics , Quantitative Trait Loci/genetics , Genome-Wide Association Study , Polymorphism, Single Nucleotide
4.
Nature ; 621(7980): 857-867, 2023 Sep.
Article in English | MEDLINE | ID: mdl-37730992

ABSTRACT

Speciation leads to adaptive changes in organ cellular physiology and creates challenges for studying rare cell-type functions that diverge between humans and mice. Rare cystic fibrosis transmembrane conductance regulator (CFTR)-rich pulmonary ionocytes exist throughout the cartilaginous airways of humans1,2, but limited presence and divergent biology in the proximal trachea of mice has prevented the use of traditional transgenic models to elucidate ionocyte functions in the airway. Here we describe the creation and use of conditional genetic ferret models to dissect pulmonary ionocyte biology and function by enabling ionocyte lineage tracing (FOXI1-CreERT2::ROSA-TG), ionocyte ablation (FOXI1-KO) and ionocyte-specific deletion of CFTR (FOXI1-CreERT2::CFTRL/L). By comparing these models with cystic fibrosis ferrets3,4, we demonstrate that ionocytes control airway surface liquid absorption, secretion, pH and mucus viscosity-leading to reduced airway surface liquid volume and impaired mucociliary clearance in cystic fibrosis, FOXI1-KO and FOXI1-CreERT2::CFTRL/L ferrets. These processes are regulated by CFTR-dependent ionocyte transport of Cl- and HCO3-. Single-cell transcriptomics and in vivo lineage tracing revealed three subtypes of pulmonary ionocytes and a FOXI1-lineage common rare cell progenitor for ionocytes, tuft cells and neuroendocrine cells during airway development. Thus, rare pulmonary ionocytes perform critical CFTR-dependent functions in the proximal airway that are hallmark features of cystic fibrosis airway disease. These studies provide a road map for using conditional genetics in the first non-rodent mammal to address gene function, cell biology and disease processes that have greater evolutionary conservation between humans and ferrets.


Subject(s)
Cystic Fibrosis , Disease Models, Animal , Ferrets , Lung , Transgenes , Animals , Humans , Animals, Genetically Modified , Cell Lineage , Cystic Fibrosis/genetics , Cystic Fibrosis/metabolism , Cystic Fibrosis/pathology , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Cystic Fibrosis Transmembrane Conductance Regulator/metabolism , Ferrets/genetics , Ferrets/physiology , Forkhead Transcription Factors/genetics , Forkhead Transcription Factors/metabolism , Lung/cytology , Lung/metabolism , Lung/pathology , Trachea/cytology , Transgenes/genetics
5.
medRxiv ; 2023 Mar 20.
Article in English | MEDLINE | ID: mdl-36993194

ABSTRACT

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation, and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here, we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues, using personalized reference genomes to mitigate technical confounding. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B, and T cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.

6.
bioRxiv ; 2023 Jan 24.
Article in English | MEDLINE | ID: mdl-36747789

ABSTRACT

E3 ligases regulate key processes, but many of their roles remain unknown. Using Perturb-seq, we interrogated the function of 1,130 E3 ligases, partners and substrates in the inflammatory response in primary dendritic cells (DCs). Dozens impacted the balance of DC1, DC2, migratory DC and macrophage states and a gradient of DC maturation. Family members grouped into co-functional modules that were enriched for physical interactions and impacted specific programs through substrate transcription factors. E3s and their adaptors co-regulated the same processes, but partnered with different substrate recognition adaptors to impact distinct aspects of the DC life cycle. Genetic interactions were more prevalent within than between modules, and a deep learning model, comßVAE, predicts the outcome of new combinations by leveraging modularity. The E3 regulatory network was associated with heritable variation and aberrant gene expression in immune cells in human inflammatory diseases. Our study provides a general approach to dissect gene function.

7.
Nature ; 614(7948): 492-499, 2023 02.
Article in English | MEDLINE | ID: mdl-36755099

ABSTRACT

Both common and rare genetic variants influence complex traits and common diseases. Genome-wide association studies have identified thousands of common-variant associations, and more recently, large-scale exome sequencing studies have identified rare-variant associations in hundreds of genes1-3. However, rare-variant genetic architecture is not well characterized, and the relationship between common-variant and rare-variant architecture is unclear4. Here we quantify the heritability explained by the gene-wise burden of rare coding variants across 22 common traits and diseases in 394,783 UK Biobank exomes5. Rare coding variants (allele frequency < 1 × 10-3) explain 1.3% (s.e. = 0.03%) of phenotypic variance on average-much less than common variants-and most burden heritability is explained by ultrarare loss-of-function variants (allele frequency < 1 × 10-5). Common and rare variants implicate the same cell types, with similar enrichments, and they have pleiotropic effects on the same pairs of traits, with similar genetic correlations. They partially colocalize at individual genes and loci, but not to the same extent: burden heritability is strongly concentrated in significant genes, while common-variant heritability is more polygenic, and burden heritability is also more strongly concentrated in constrained genes. Finally, we find that burden heritability for schizophrenia and bipolar disorder6,7 is approximately 2%. Our results indicate that rare coding variants will implicate a tractable number of large-effect genes, that common and rare associations are mechanistically convergent, and that rare coding variants will contribute only modestly to missing heritability and population risk stratification.


Subject(s)
Exome , Gene Frequency , Genetic Variation , Multifactorial Inheritance , Humans , Exome/genetics , Genetic Variation/genetics , Genome-Wide Association Study , Multifactorial Inheritance/genetics , Risk Factors , United Kingdom , Genetic Loci/genetics , Schizophrenia/genetics , Bipolar Disorder/genetics
9.
Front Oncol ; 12: 929950, 2022.
Article in English | MEDLINE | ID: mdl-36185212

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is one of the most treatment refractory and lethal malignancies. The diversity of endothelial cell (EC) lineages in the tumor microenvironment (TME) impacts the efficacy of antineoplastic therapies, which in turn remodel EC states and distributions. Here, we present a single-cell resolution framework of diverse EC lineages in the PDAC TME in the context of neoadjuvant chemotherapy, radiotherapy, and losartan. We analyzed a custom single-nucleus RNA-seq dataset derived from 37 primary PDAC specimens (18 untreated, 14 neoadjuvant FOLFIRINOX + chemoradiotherapy, 5 neoadjuvant FOLFIRINOX + chemoradiotherapy + losartan). A single-nucleus transcriptome analysis of 15,185 EC profiles revealed two state programs (ribosomal, cycling), four lineage programs (capillary, arterial, venous, lymphatic), and one program that did not overlap significantly with prior signatures but was enriched in pathways involved in vasculogenesis, stem-like state, response to wounding and hypoxia, and endothelial-to-mesenchymal transition (reactive EndMT). A bulk transcriptome analysis of two independent cohorts (n = 269 patients) revealed that the lymphatic and reactive EndMT lineage programs were significantly associated with poor clinical outcomes. While losartan and proton therapy were associated with reduced lymphatic ECs, these therapies also correlated with an increase in reactive EndMT. Thus, the development and inclusion of EndMT-inhibiting drugs (e.g., nintedanib) to a neoadjuvant chemoradiotherapy regimen featuring losartan and/or proton therapy may be most effective in depleting both lymphatic and reactive EndMT populations and potentially improving patient outcomes.

10.
Nat Genet ; 54(10): 1572-1580, 2022 10.
Article in English | MEDLINE | ID: mdl-36050550

ABSTRACT

Single-cell RNA sequencing (scRNA-seq) provides unique insights into the pathology and cellular origin of disease. We introduce single-cell disease relevance score (scDRS), an approach that links scRNA-seq with polygenic disease risk at single-cell resolution, independent of annotated cell types. scDRS identifies cells exhibiting excess expression across disease-associated genes implicated by genome-wide association studies (GWASs). We applied scDRS to 74 diseases/traits and 1.3 million single-cell gene-expression profiles across 31 tissues/organs. Cell-type-level results broadly recapitulated known cell-type-disease associations. Individual-cell-level results identified subpopulations of disease-associated cells not captured by existing cell-type labels, including T cell subpopulations associated with inflammatory bowel disease, partially characterized by their effector-like states; neuron subpopulations associated with schizophrenia, partially characterized by their spatial locations; and hepatocyte subpopulations associated with triglyceride levels, partially characterized by their higher ploidy levels. Genes whose expression was correlated with the scDRS score across cells (reflecting coexpression with GWAS disease-associated genes) were strongly enriched for gold-standard drug target and Mendelian disease genes.


Subject(s)
Genome-Wide Association Study , Single-Cell Analysis , Gene Expression Profiling/methods , Multifactorial Inheritance/genetics , RNA-Seq , Single-Cell Analysis/methods , Triglycerides
11.
Nat Genet ; 54(10): 1466-1469, 2022 10.
Article in English | MEDLINE | ID: mdl-36138231

ABSTRACT

Several biobanks, including UK Biobank (UKBB), are generating large-scale sequencing data. An existing method, SAIGE-GENE, performs well when testing variants with minor allele frequency (MAF) ≤ 1%, but inflation is observed in variance component set-based tests when restricting to variants with MAF ≤ 0.1% or 0.01%. Here, we propose SAIGE-GENE+ with greatly improved type I error control and computational efficiency to facilitate rare variant tests in large-scale data. We further show that incorporating multiple MAF cutoffs and functional annotations can improve power and thus uncover new gene-phenotype associations. In the analysis of UKBB whole exome sequencing data for 30 quantitative and 141 binary traits, SAIGE-GENE+ identified 551 gene-phenotype associations.


Subject(s)
Genome-Wide Association Study , Gene Frequency/genetics , Genome-Wide Association Study/methods , Phenotype , Exome Sequencing
12.
Nat Genet ; 54(10): 1479-1492, 2022 10.
Article in English | MEDLINE | ID: mdl-36175791

ABSTRACT

Genome-wide association studies provide a powerful means of identifying loci and genes contributing to disease, but in many cases, the related cell types/states through which genes confer disease risk remain unknown. Deciphering such relationships is important for identifying pathogenic processes and developing therapeutics. In the present study, we introduce sc-linker, a framework for integrating single-cell RNA-sequencing, epigenomic SNP-to-gene maps and genome-wide association study summary statistics to infer the underlying cell types and processes by which genetic variants influence disease. The inferred disease enrichments recapitulated known biology and highlighted notable cell-disease relationships, including γ-aminobutyric acid-ergic neurons in major depressive disorder, a disease-dependent M-cell program in ulcerative colitis and a disease-specific complement cascade process in multiple sclerosis. In autoimmune disease, both healthy and disease-dependent immune cell-type programs were associated, whereas only disease-dependent epithelial cell programs were prominent, suggesting a role in disease response rather than initiation. Our framework provides a powerful approach for identifying the cell types and cellular processes by which genetic variants influence disease.


Subject(s)
Depressive Disorder, Major , Genome-Wide Association Study , Depressive Disorder, Major/genetics , Genetic Predisposition to Disease , Human Genetics , Humans , Polymorphism, Single Nucleotide/genetics , RNA , gamma-Aminobutyric Acid
13.
Nat Genet ; 54(8): 1178-1191, 2022 08.
Article in English | MEDLINE | ID: mdl-35902743

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal and treatment-refractory cancer. Molecular stratification in pancreatic cancer remains rudimentary and does not yet inform clinical management or therapeutic development. Here, we construct a high-resolution molecular landscape of the cellular subtypes and spatial communities that compose PDAC using single-nucleus RNA sequencing and whole-transcriptome digital spatial profiling (DSP) of 43 primary PDAC tumor specimens that either received neoadjuvant therapy or were treatment naive. We uncovered recurrent expression programs across malignant cells and fibroblasts, including a newly identified neural-like progenitor malignant cell program that was enriched after chemotherapy and radiotherapy and associated with poor prognosis in independent cohorts. Integrating spatial and cellular profiles revealed three multicellular communities with distinct contributions from malignant, fibroblast and immune subtypes: classical, squamoid-basaloid and treatment enriched. Our refined molecular and cellular taxonomy can provide a framework for stratification in clinical trials and serve as a roadmap for therapeutic targeting of specific cellular phenotypes and multicellular interactions.


Subject(s)
Carcinoma, Pancreatic Ductal , Pancreatic Neoplasms , Biomarkers, Tumor/genetics , Carcinoma, Pancreatic Ductal/genetics , Carcinoma, Pancreatic Ductal/pathology , Carcinoma, Pancreatic Ductal/therapy , Gene Expression Profiling , Humans , Neoadjuvant Therapy , Pancreatic Neoplasms/drug therapy , Pancreatic Neoplasms/genetics , Prognosis , Transcriptome/genetics , Pancreatic Neoplasms
14.
Nat Genet ; 54(6): 827-836, 2022 06.
Article in English | MEDLINE | ID: mdl-35668300

ABSTRACT

Disease-associated single-nucleotide polymorphisms (SNPs) generally do not implicate target genes, as most disease SNPs are regulatory. Many SNP-to-gene (S2G) linking strategies have been developed to link regulatory SNPs to the genes that they regulate in cis. Here, we developed a heritability-based framework for evaluating and combining different S2G strategies to optimize their informativeness for common disease risk. Our optimal combined S2G strategy (cS2G) included seven constituent S2G strategies and achieved a precision of 0.75 and a recall of 0.33, more than doubling the recall of any individual strategy. We applied cS2G to fine-mapping results for 49 UK Biobank diseases/traits to predict 5,095 causal SNP-gene-disease triplets (with S2G-derived functional interpretation) with high confidence. We further applied cS2G to provide an empirical assessment of disease omnigenicity; we determined that the top 1% of genes explained roughly half of the SNP heritability linked to all genes and that gene-level architectures vary with variant allele frequency.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Genome-Wide Association Study/methods , Phenotype , Polymorphism, Single Nucleotide/genetics
15.
bioRxiv ; 2021 Nov 23.
Article in English | MEDLINE | ID: mdl-34845454

ABSTRACT

Genome-wide association studies (GWAS) provide a powerful means to identify loci and genes contributing to disease, but in many cases the related cell types/states through which genes confer disease risk remain unknown. Deciphering such relationships is important for identifying pathogenic processes and developing therapeutics. Here, we introduce sc-linker, a framework for integrating single-cell RNA-seq (scRNA-seq), epigenomic maps and GWAS summary statistics to infer the underlying cell types and processes by which genetic variants influence disease. We analyzed 1.6 million scRNA-seq profiles from 209 individuals spanning 11 tissue types and 6 disease conditions, and constructed gene programs capturing cell types, disease progression, and cellular processes both within and across cell types. We evaluated these gene programs for disease enrichment by transforming them to SNP annotations with tissue-specific epigenomic maps and computing enrichment scores across 60 diseases and complex traits (average N= 297K). Cell type, disease progression, and cellular process programs captured distinct heritability signals even within the same cell type, as we show in multiple complex diseases that affect the brain (Alzheimer’s disease, multiple sclerosis), colon (ulcerative colitis) and lung (asthma, idiopathic pulmonary fibrosis, severe COVID-19). The inferred disease enrichments recapitulated known biology and highlighted novel cell-disease relationships, including GABAergic neurons in major depressive disorder (MDD), a disease progression M cell program in ulcerative colitis, and a disease-specific complement cascade process in multiple sclerosis. In autoimmune disease, both healthy and disease progression immune cell type programs were associated, whereas for epithelial cells, disease progression programs were most prominent, perhaps suggesting a role in disease progression over initiation. Our framework provides a powerful approach for identifying the cell types and cellular processes by which genetic variants influence disease.

16.
Nature ; 595(7865): 107-113, 2021 07.
Article in English | MEDLINE | ID: mdl-33915569

ABSTRACT

COVID-19, which is caused by SARS-CoV-2, can result in acute respiratory distress syndrome and multiple organ failure1-4, but little is known about its pathophysiology. Here we generated single-cell atlases of 24 lung, 16 kidney, 16 liver and 19 heart autopsy tissue samples and spatial atlases of 14 lung samples from donors who died of COVID-19. Integrated computational analysis uncovered substantial remodelling in the lung epithelial, immune and stromal compartments, with evidence of multiple paths of failed tissue regeneration, including defective alveolar type 2 differentiation and expansion of fibroblasts and putative TP63+ intrapulmonary basal-like progenitor cells. Viral RNAs were enriched in mononuclear phagocytic and endothelial lung cells, which induced specific host programs. Spatial analysis in lung distinguished inflammatory host responses in lung regions with and without viral RNA. Analysis of the other tissue atlases showed transcriptional alterations in multiple cell types in heart tissue from donors with COVID-19, and mapped cell types and genes implicated with disease severity based on COVID-19 genome-wide association studies. Our foundational dataset elucidates the biological effect of severe SARS-CoV-2 infection across the body, a key step towards new treatments.


Subject(s)
COVID-19/pathology , COVID-19/virology , Kidney/pathology , Liver/pathology , Lung/pathology , Myocardium/pathology , SARS-CoV-2/pathogenicity , Adult , Aged , Aged, 80 and over , Atlases as Topic , Autopsy , Biological Specimen Banks , COVID-19/genetics , COVID-19/immunology , Endothelial Cells , Epithelial Cells/pathology , Epithelial Cells/virology , Female , Fibroblasts , Genome-Wide Association Study , Heart/virology , Humans , Inflammation/pathology , Inflammation/virology , Kidney/virology , Liver/virology , Lung/virology , Male , Middle Aged , Organ Specificity , Phagocytes , Pulmonary Alveoli/pathology , Pulmonary Alveoli/virology , RNA, Viral/analysis , Regeneration , SARS-CoV-2/immunology , Single-Cell Analysis , Viral Load
17.
Nat Med ; 27(3): 546-559, 2021 03.
Article in English | MEDLINE | ID: mdl-33654293

ABSTRACT

Angiotensin-converting enzyme 2 (ACE2) and accessory proteases (TMPRSS2 and CTSL) are needed for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) cellular entry, and their expression may shed light on viral tropism and impact across the body. We assessed the cell-type-specific expression of ACE2, TMPRSS2 and CTSL across 107 single-cell RNA-sequencing studies from different tissues. ACE2, TMPRSS2 and CTSL are coexpressed in specific subsets of respiratory epithelial cells in the nasal passages, airways and alveoli, and in cells from other organs associated with coronavirus disease 2019 (COVID-19) transmission or pathology. We performed a meta-analysis of 31 lung single-cell RNA-sequencing studies with 1,320,896 cells from 377 nasal, airway and lung parenchyma samples from 228 individuals. This revealed cell-type-specific associations of age, sex and smoking with expression levels of ACE2, TMPRSS2 and CTSL. Expression of entry factors increased with age and in males, including in airway secretory cells and alveolar type 2 cells. Expression programs shared by ACE2+TMPRSS2+ cells in nasal, lung and gut tissues included genes that may mediate viral entry, key immune functions and epithelial-macrophage cross-talk, such as genes involved in the interleukin-6, interleukin-1, tumor necrosis factor and complement pathways. Cell-type-specific expression patterns may contribute to the pathogenesis of COVID-19, and our work highlights putative molecular pathways for therapeutic intervention.


Subject(s)
COVID-19/epidemiology , COVID-19/genetics , Host-Pathogen Interactions/genetics , SARS-CoV-2/physiology , Sequence Analysis, RNA/statistics & numerical data , Single-Cell Analysis/statistics & numerical data , Virus Internalization , Adult , Aged , Aged, 80 and over , Alveolar Epithelial Cells/metabolism , Alveolar Epithelial Cells/virology , Angiotensin-Converting Enzyme 2/genetics , Angiotensin-Converting Enzyme 2/metabolism , COVID-19/pathology , COVID-19/virology , Cathepsin L/genetics , Cathepsin L/metabolism , Datasets as Topic/statistics & numerical data , Demography , Female , Gene Expression Profiling/statistics & numerical data , Humans , Lung/metabolism , Lung/virology , Male , Middle Aged , Organ Specificity/genetics , Respiratory System/metabolism , Respiratory System/virology , Sequence Analysis, RNA/methods , Serine Endopeptidases/genetics , Serine Endopeptidases/metabolism , Single-Cell Analysis/methods
18.
bioRxiv ; 2021 Feb 25.
Article in English | MEDLINE | ID: mdl-33655247

ABSTRACT

The SARS-CoV-2 pandemic has caused over 1 million deaths globally, mostly due to acute lung injury and acute respiratory distress syndrome, or direct complications resulting in multiple-organ failures. Little is known about the host tissue immune and cellular responses associated with COVID-19 infection, symptoms, and lethality. To address this, we collected tissues from 11 organs during the clinical autopsy of 17 individuals who succumbed to COVID-19, resulting in a tissue bank of approximately 420 specimens. We generated comprehensive cellular maps capturing COVID-19 biology related to patients' demise through single-cell and single-nucleus RNA-Seq of lung, kidney, liver and heart tissues, and further contextualized our findings through spatial RNA profiling of distinct lung regions. We developed a computational framework that incorporates removal of ambient RNA and automated cell type annotation to facilitate comparison with other healthy and diseased tissue atlases. In the lung, we uncovered significantly altered transcriptional programs within the epithelial, immune, and stromal compartments and cell intrinsic changes in multiple cell types relative to lung tissue from healthy controls. We observed evidence of: alveolar type 2 (AT2) differentiation replacing depleted alveolar type 1 (AT1) lung epithelial cells, as previously seen in fibrosis; a concomitant increase in myofibroblasts reflective of defective tissue repair; and, putative TP63+ intrapulmonary basal-like progenitor (IPBLP) cells, similar to cells identified in H1N1 influenza, that may serve as an emergency cellular reserve for severely damaged alveoli. Together, these findings suggest the activation and failure of multiple avenues for regeneration of the epithelium in these terminal lungs. SARS-CoV-2 RNA reads were enriched in lung mononuclear phagocytic cells and endothelial cells, and these cells expressed distinct host response transcriptional programs. We corroborated the compositional and transcriptional changes in lung tissue through spatial analysis of RNA profiles in situ and distinguished unique tissue host responses between regions with and without viral RNA, and in COVID-19 donor tissues relative to healthy lung. Finally, we analyzed genetic regions implicated in COVID-19 GWAS with transcriptomic data to implicate specific cell types and genes associated with disease severity. Overall, our COVID-19 cell atlas is a foundational dataset to better understand the biological impact of SARS-CoV-2 infection across the human body and empowers the identification of new therapeutic interventions and prevention strategies.

19.
Nat Comput Sci ; 1(4): 272-279, 2021 Apr.
Article in English | MEDLINE | ID: mdl-38217177

ABSTRACT

DNA profiling has become an essential tool for crime solving and prevention, and CODIS (Combined DNA Index System) criminal investigation databases have flourished at the national, state and even local level. However, reports suggest that the DNA profiles of all suspects searched in these databases are often retained, which could result in racial profiling. Here, we devise an approach to both enable broad DNA profile searches and preserve exonerated citizens' privacy through a real-time privacy-preserving procedure to query CODIS databases. Using our approach, an agent can privately and efficiently query a suspect's DNA profile device in the field, learning only whether the profile matches against any database profile. More importantly, the central database learns nothing about the queried profile, and thus cannot retain it. Our approach paves the way to implement privacy-preserving DNA profile searching in CODIS databases and any CODIS-like system.

20.
Sci Transl Med ; 12(544)2020 05 20.
Article in English | MEDLINE | ID: mdl-32434849

ABSTRACT

The diagnosis of Mendelian disorders requires labor-intensive literature research. Trained clinicians can spend hours looking for the right publication(s) supporting a single gene that best explains a patient's disease. AMELIE (Automatic Mendelian Literature Evaluation) greatly accelerates this process. AMELIE parses all 29 million PubMed abstracts and downloads and further parses hundreds of thousands of full-text articles in search of information supporting the causality and associated phenotypes of most published genetic variants. AMELIE then prioritizes patient candidate variants for their likelihood of explaining any patient's given set of phenotypes. Diagnosis of singleton patients (without relatives' exomes) is the most time-consuming scenario, and AMELIE ranked the causative gene at the very top for 66% of 215 diagnosed singleton Mendelian patients from the Deciphering Developmental Disorders project. Evaluating only the top 11 AMELIE-scored genes of 127 (median) candidate genes per patient resulted in a rapid diagnosis in more than 90% of cases. AMELIE-based evaluation of all cases was 3 to 19 times more efficient than hand-curated database-based approaches. We replicated these results on a retrospective cohort of clinical cases from Stanford Children's Health and the Manton Center for Orphan Disease Research. An analysis web portal with our most recent update, programmatic interface, and code is available at AMELIE.stanford.edu.


Subject(s)
Exome , Child , Genotype , Humans , Phenotype , Probability , Retrospective Studies
SELECTION OF CITATIONS
SEARCH DETAIL
...