RESUMEN
Annotating genetic variants to their target genes is of great importance in unraveling the causal variants and genetic mechanisms that underlie complex diseases. However, disease-associated genetic variants are often located in non-coding regions and manifest context-specific effects, making it challenging to accurately identify the target genes and regulatory mechanisms. Here, we present TargetGene (https://ngdc.cncb.ac.cn/targetgene/), a comprehensive database reporting target genes for human genetic variants from various aspects. Specifically, we collected a comprehensive catalog of multi-omics data at the single-cell and bulk levels and from various human tissues, cell types and developmental stages. To facilitate the identification of Single Nucleotide Polymorphism (SNP)-to-gene connections, we have implemented multiple analytical tools based on chromatin co-accessibility, 3D interaction, enhancer activities and quantitative trait loci, among others. We applied the pipeline to evaluate variants from nearly 1300 Genome-wide association studies (GWAS) and assembled a comprehensive atlas of multiscale regulation of genetic variants. TargetGene is equipped with user-friendly web interfaces that enable intuitive searching, navigation and browsing through the results. Overall, TargetGene provides a unique resource to empower researchers to study the regulatory mechanisms of genetic variants in complex human traits.
Asunto(s)
Bases de Datos Genéticas , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Humanos , Cromatina/genética , Estudio de Asociación del Genoma Completo/métodos , Polimorfismo de Nucleótido SimpleRESUMEN
Large-scale genome-wide association studies (GWAS) have provided profound insights into complex traits and diseases. Yet, deciphering the fine-scale molecular mechanisms of how genetic variants manifest to cause the phenotypes remains a daunting task. Here, we present COLOCdb (https://ngdc.cncb.ac.cn/colocdb), a comprehensive genetic colocalization database by integrating more than 3000 GWAS summary statistics and 13 types of xQTL to date. By employing two representative approaches for the colocalization analysis, COLOCdb deposits results from three key components: (i) GWAS-xQTL, pair-wise colocalization between GWAS loci and different types of xQTL, (ii) GWAS-GWAS, pair-wise colocalization between the trait-associated genetic loci from GWASs and (iii) xQTL-xQTL, pair-wise colocalization between the genetic loci associated with molecular phenotypes in xQTLs. These results together represent the most comprehensive colocalization analysis, which also greatly expands the list of shared variants with genetic pleiotropy. We expect that COLOCdb can serve as a unique and useful resource in advancing the discovery of new biological mechanisms and benefit future functional studies.
Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Estudio de Asociación del Genoma Completo/métodos , Herencia Multifactorial/genética , Sitios de Carácter Cuantitativo , Fenotipo , Pleiotropía Genética , Polimorfismo de Nucleótido SimpleRESUMEN
A broad range of complex phenotypes are related to dysfunctions in brain (hereafter referred to as brain-related traits), including various mental and behavioral disorders and diseases of the nervous system. These traits in general share overlapping symptoms, pathogenesis, and genetic components. Here, we present Brain Catalog (https://ngdc.cncb.ac.cn/braincatalog), a comprehensive database aiming to delineate the genetic components of more than 500 GWAS summary statistics datasets for brain-related traits from multiple aspects. First, Brain Catalog provides results of candidate causal variants, causal genes, and functional tissues and cell types for each trait identified by multiple methods using comprehensive annotation datasets (58 QTL datasets spanning 6 types of QTLs). Second, Brain Catalog estimates the SNP-based heritability, the partitioning heritability based on functional annotations, and genetic correlations among traits. Finally, through bidirectional Mendelian randomization analyses, Brain Catalog presents inference of risk factors that are likely causal to each trait. In conclusion, Brain Catalog presents a one-stop shop for the genetic components of brain-related traits, potentially serving as a valuable resource for worldwide researchers to advance the understanding of how GWAS signals may contribute to the biological etiology of brain-related traits.
Asunto(s)
Encéfalo , Bases de Datos Genéticas , Trastornos Mentales , Encéfalo/fisiopatología , Fenotipo , Sitios de Carácter Cuantitativo , Trastornos Mentales/genéticaRESUMEN
Novel coronary pneumonia (COVID-19) is a respiratory distress syndrome caused by a new type of coronavirus. Understanding the genetic basis of susceptibility and prognosis to COVID-19 is of great significance to disease prevention, molecular typing, prognosis, and treatment. However, so far, there have been only two genome-wide association studies (GWASs) on the susceptibility of COVID-19. Starting with these reported DNA variants, we found the genes regulated by these variants through cis-eQTL and cis-meQTL acting. We further did a series of bioinformatics analysis on these potential risk genes. The analysis shows that the genetic variants on EHF regulate the expression of its neighbor CAT gene via cis-eQTL. There was significant evidence that CAT and the SARS-CoV-2-related S protein binding protein ACE2 interact with each other. Intracellular localization results showed that CAT and ACE2 proteins both exists in the cell membrane and extracellular area and their interaction could have an impact on the cell invasion ability of S protein. In addition, the expression of these three genes showed a significant positive correlation in the lungs. Based on these results, we propose that CAT plays a crucial intermediary role in binding effectiveness of ACE2, thereby affecting the susceptibility to COVID-19.
Asunto(s)
COVID-19 , Catalasa , Regulación Enzimológica de la Expresión Génica , Predisposición Genética a la Enfermedad , Polimorfismo Genético , SARS-CoV-2/metabolismo , Enzima Convertidora de Angiotensina 2/genética , Enzima Convertidora de Angiotensina 2/metabolismo , COVID-19/genética , COVID-19/metabolismo , Catalasa/biosíntesis , Catalasa/genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Estudios Retrospectivos , Glicoproteína de la Espiga del Coronavirus/genética , Glicoproteína de la Espiga del Coronavirus/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
Methylation quantitative trait loci (mQTLs) are essential for understanding the role of DNA methylation changes in genetic predisposition, yet they have not been fully characterized in East Asians (EAs). Here we identified mQTLs in whole blood from 3,523 Chinese individuals and replicated them in additional 1,858 Chinese individuals from two cohorts. Over 9% of mQTLs displayed specificity to EAs, facilitating the fine-mapping of EA-specific genetic associations, as shown for variants associated with height. Trans-mQTL hotspots revealed biological pathways contributing to EA-specific genetic associations, including an ERG-mediated 233 trans-mCpG network, implicated in hematopoietic cell differentiation, which likely reflects binding efficiency modulation of the ERG protein complex. More than 90% of mQTLs were shared between different blood cell lineages, with a smaller fraction of lineage-specific mQTLs displaying preferential hypomethylation in the respective lineages. Our study provides new insights into the mQTL landscape across genetic ancestries and their downstream effects on cellular processes and diseases/traits.
Asunto(s)
Metilación de ADN , Pueblos del Este de Asia , Sitios de Carácter Cuantitativo , Femenino , Humanos , Masculino , Pueblos del Este de Asia/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Herencia Multifactorial , Polimorfismo de Nucleótido SimpleRESUMEN
Background: The immune responses to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are crucial in maintaining a delicate balance between protective effects and harmful pathological reactions that drive the progression of coronavirus disease 2019 (COVID-19). T cells play a significant role in adaptive antiviral immune responses, making it valuable to investigate the heterogeneity and diversity of SARS-CoV-2-specific T cell responses in COVID-19 patients with varying disease severity. Methods: In this study, we employed high-throughput T cell receptor (TCR) ß repertoire sequencing to analyze TCR profiles in the peripheral blood of 192 patients with COVID-19, including those with moderate, severe, or critical symptoms, and compared them with 81 healthy controls. We specifically focused on SARS-CoV-2-associated TCR clonotypes. Results: We observed a decrease in the diversity of TCR clonotypes in COVID-19 patients compared to healthy controls. However, the overall abundance of dominant clones increased with disease severity. Additionally, we identified significant differences in the genomic rearrangement of variable (V), joining (J), and VJ pairings between the patient groups. Furthermore, the SARS-CoV-2-associated TCRs we identified enabled accurate differentiation between COVID-19 patients and healthy controls (AUC > 0.98) and distinguished those with moderate symptoms from those with more severe forms of the disease (AUC > 0.8). These findings suggest that TCR repertoires can serve as informative biomarkers for monitoring COVID-19 progression. Conclusions: Our study provides valuable insights into TCR repertoire signatures that can be utilized to assess host immunity to COVID-19. These findings have important implications for the use of TCR ß repertoires in monitoring disease development and indicating disease severity.
Asunto(s)
COVID-19 , Humanos , SARS-CoV-2 , Linfocitos T , Receptores de Antígenos de Linfocitos T/genética , Gravedad del PacienteRESUMEN
Amyotrophic lateral sclerosis (ALS) is a fatal progressive multisystem disorder with limited therapeutic options. Although genome-wide association studies (GWASs) have revealed multiple ALS susceptibility loci, the exact identities of causal variants, genes, cell types, tissues, and their functional roles in the development of ALS remain largely unknown. Here, we reported a comprehensive post-GWAS analysis of the recent large ALS GWAS (n = 80,610), including functional mapping and annotation (FUMA), transcriptome-wide association study (TWAS), colocalization (COLOC), and summary data-based Mendelian randomization analyses (SMR) in extensive multi-omics datasets. Gene property analysis highlighted inhibitory neuron 6, oligodendrocytes, and GABAergic neurons (Gad1/Gad2) as functional cell types of ALS and confirmed cerebellum and cerebellar hemisphere as functional tissues of ALS. Functional annotation detected the presence of multiple deleterious variants at three loci (9p21.2, 12q13.3, and 12q14.2) and highlighted a list of SNPs that are potentially functional. TWAS, COLOC, and SMR identified 43 genes at 24 loci, including 23 novel genes and 10 novel loci, showing significant evidence of causality. Integrating multiple lines of evidence, we further proposed that rs2453555 at 9p21.2 and rs229243 at 14q12 functionally contribute to the development of ALS by regulating the expression of C9orf72 in pituitary and SCFD1 in skeletal muscle, respectively. Together, these results advance our understanding of the biological etiology of ALS, feed into new therapies, and provide a guide for subsequent functional experiments.