ABSTRACT
Stroke, characterized by sudden neurological deficits, is the second leading cause of death worldwide. Although genome-wide association studies (GWAS) have successfully identified many genomic regions associated with ischemic stroke (IS), the genes underlying risk and their regulatory mechanisms remain elusive. Here, we integrate a large-scale GWAS (N = 1 296 908) for IS together with molecular QTLs data, including mRNA, splicing, enhancer RNA (eRNA), and protein expression data from up to 50 tissues (total N = 11 588). We identify 136 genes/eRNA/proteins associated with IS risk across 60 independent genomic regions and find IS risk is most enriched for eQTLs in arterial and brain-related tissues. Focusing on IS-relevant tissues, we prioritize 9 genes/proteins using probabilistic fine-mapping TWAS analyses. In addition, we discover that blood cell traits, particularly reticulocyte cells, have shared genetic contributions with IS using TWAS-based pheWAS and genetic correlation analysis. Lastly, we integrate our findings with a large-scale pharmacological database and identify a secondary bile acid, deoxycholic acid, as a potential therapeutic component. Our work highlights IS risk genes/splicing-sites/enhancer activity/proteins with their phenotypic consequences using relevant tissues as well as identify potential therapeutic candidates for IS.
Subject(s)
Ischemic Stroke , Transcriptome , Humans , Genome-Wide Association Study , Ischemic Stroke/genetics , Genomics , Phenotype , Genetic Predisposition to Disease , Polymorphism, Single Nucleotide/geneticsABSTRACT
Genome-wide association studies (GWASs) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra-large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (NĀ = 420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (pĀ = 2.58E-10) and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest shared etiologies between rheumatoid arthritis and periodontal condition in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWASs.
Subject(s)
Arthritis, Rheumatoid , Genome-Wide Association Study , Male , Humans , Genome-Wide Association Study/methods , Multifactorial Inheritance , Phenotype , Brain , Arthritis, Rheumatoid/genetics , Polymorphism, Single Nucleotide/genetics , Genetic PleiotropyABSTRACT
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p = 5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights.
Subject(s)
COVID-19 , Exome , Humans , Exome/genetics , Genome-Wide Association Study , COVID-19/genetics , Genetic Predisposition to Disease , Toll-Like Receptor 7/genetics , SARS-CoV-2/geneticsABSTRACT
BACKGROUND: Researchers have long studied the regulatory processes of genes to uncover their functions. Gene regulatory network analysis is one of the popular approaches for understanding these processes, requiring accurate identification of interactions among the genes to establish the gene regulatory network. Advances in genome-wide association studies and expression quantitative trait loci studies have led to a wealth of genomic data, facilitating more accurate inference of gene-gene interactions. However, unknown confounding factors may influence these interactions, making their interpretation complicated. Mendelian randomization (MR) has emerged as a valuable tool for causal inference in genetics, addressing confounding effects by estimating causal relationships using instrumental variables. In this paper, we propose a new statistical method, MR-GGI, for accurately inferring gene-gene interactions using Mendelian randomization. RESULTS: MR-GGI applies one gene as the exposure and another as the outcome, using causal cis-single-nucleotide polymorphisms as instrumental variables in the inverse-variance weighted MR model. Through simulations, we have demonstrated MR-GGI's ability to control type 1 error and maintain statistical power despite confounding effects. MR-GGI performed the best when compared to other methods using the F1 score on the DREAM5 dataset. Additionally, when applied to yeast genomic data, MR-GGI successfully identified six clusters. Through gene ontology analysis, we have confirmed that each cluster in our study performs distinct functional roles by gathering genes with specific functions. CONCLUSION: These findings demonstrate that MR-GGI accurately inferences gene-gene interactions despite the confounding effects in real biological environments.
Subject(s)
Mendelian Randomization Analysis , Polymorphism, Single Nucleotide , Genome-Wide Association Study/methods , Gene Regulatory Networks/genetics , Epistasis, Genetic/genetics , Quantitative Trait Loci , Humans , Saccharomyces cerevisiae/geneticsABSTRACT
Systemic juvenile idiopathic arthritis (sJIA) is a rare subtype of juvenile idiopathic arthritis, whose clinical features are systemic fever and rash accompanied by painful joints and inflammation. Even though sJIA has been reported to be an autoinflammatory disorder, its exact pathogenesis remains unclear. In this study, we integrated a meta-analysis with a weighted gene co-expression network analysis (WGCNA) using 5 microarray datasets and an RNA sequencing dataset to understand the interconnection of susceptibility genes for sJIA. Using the integrative analysis, we identified a robust sJIA signature that consisted of 2 co-expressed gene sets comprising 103 up-regulated genes and 25 down-regulated genes in sJIA patients compared with healthy controls. Among the 128 sJIA signature genes, we identified an up-regulated cluster of 11 genes and a down-regulated cluster of 4 genes, which may play key roles in the pathogenesis of sJIA. We then detected 10 bioactive molecules targeting the significant gene clusters as potential novel drug candidates for sJIA using an in silico drug repositioning analysis. These findings suggest that the gene clusters may be potential genetic markers of sJIA and 10 drug candidates can contribute to the development of new therapeutic options for sJIA.
Subject(s)
Arthritis, Juvenile/drug therapy , Arthritis, Juvenile/genetics , Genetic Markers/genetics , Transcriptome/genetics , Down-Regulation/genetics , Drug Discovery/methods , Gene Expression Profiling/methods , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Humans , Microarray Analysis/methods , Up-Regulation/geneticsABSTRACT
As major components of spider venoms, neurotoxic peptides exhibit structural diversity, target specificity, and have great pharmaceutical potential. Deep learning may be an alternative to the laborious and time-consuming methods for identifying these peptides. However, the major hurdle in developing a deep learning model is the limited data on neurotoxic peptides. Here, we present a peptide data augmentation method that improves the recognition of neurotoxic peptides via a convolutional neural network model. The neurotoxic peptides were augmented with the known neurotoxic peptides from UniProt database, and the models were trained using a training set with or without the generated sequences to verify the augmented data. The model trained with the augmented dataset outperformed the one with the unaugmented dataset, achieving accuracy of 0.9953, precision of 0.9922, recall of 0.9984, and F1 score of 0.9953 in simulation dataset. From the set of all RNA transcripts of Callobius koreanus spider, we discovered neurotoxic peptides via the model, resulting in 275 putative peptides of which 252 novel sequences and only 23 sequences showing homology with the known peptides by Basic Local Alignment Search Tool. Among these 275 peptides, four were selected and shown to have neuromodulatory effects on the human neuroblastoma cell line SH-SY5Y. The augmentation method presented here may be applied to the identification of other functional peptides from biological resources with insufficient data.
Subject(s)
Databases, Protein , Deep Learning , Neurotoxins , Peptides , Spider Venoms , Spiders , Animals , Neurotoxins/chemistry , Neurotoxins/genetics , Peptides/chemistry , Peptides/genetics , Spider Venoms/chemistry , Spider Venoms/genetics , Spiders/chemistry , Spiders/geneticsABSTRACT
BACKGROUND: Regulatory hotspots are genetic variations that may regulate the expression levels of many genes. It has been of great interest to find those hotspots utilizing expression quantitative trait locus (eQTL) analysis. However, it has been reported that many of the findings are spurious hotspots induced by various unknown confounding factors. Recently, methods utilizing complicated statistical models have been developed that successfully identify genuine hotspots. Next-generation Intersample Correlation Emended (NICE) is one of the methods that show high sensitivity and low false-discovery rate in finding regulatory hotspots. Even though the methods successfully find genuine hotspots, they have not been widely used due to their non-user-friendly interfaces and complex running processes. Furthermore, most of the methods are impractical due to their prohibitively high computational complexity. RESULTS: To overcome the limitations of existing methods, we developed a fully automated web-based tool, referred to as NICER (NICE Renew), which is based on NICE program. First, we dramatically reduced running and installing burden of NICE. Second, we significantly reduced running time by incorporating multi-processing. Third, besides our web-based NICER, users can use NICER on Google Compute Engine and can readily install and run the NICER web service on their local computers. Finally, we provide different input formats and visualizations tools to show results. Utilizing a yeast dataset, we show that NICER can be successfully used in an eQTL analysis to identify many genuine regulatory hotspots, for which more than half of the hotspots were previously reported elsewhere. CONCLUSIONS: Even though many hotspot analysis tools have been proposed, they have not been widely used for many practical reasons. NICER is a fully-automated web-based solution for eQTL mapping and regulatory hotspots analysis. NICER provides a user-friendly interface and has made hotspot analysis more viable by reducing the running time significantly. We believe that NICER will become the method of choice for increasing power of eQTL hotspot analysis.
Subject(s)
Quantitative Trait Loci , Saccharomyces cerevisiae , Chromosome Mapping , Internet , Models, Statistical , Saccharomyces cerevisiae/geneticsABSTRACT
The recent generation of induced neurons by direct lineage conversion holds promise for in vitro modelling of sporadic Alzheimer's disease. Here, we report the generation of induced neuron-based model of sporadic Alzheimer's disease in mice and humans, and used this system to explore the pathogenic mechanisms resulting from the sporadic Alzheimer's disease risk factor apolipoprotein E (APOE) ĆĀ3/4 allele. We show that mouse and human induced neurons overexpressing mutant amyloid precursor protein in the background of APOE ĆĀ3/4 allele exhibit altered amyloid precursor protein (APP) processing, abnormally increased production of amyloid-Ć42 and hyperphosphorylation of tau. Importantly, we demonstrate that APOE ĆĀ3/4 patient induced neuron culture models can faithfully recapitulate molecular signatures seen in APOE ĆĀ3/4-associated sporadic Alzheimer's disease patients. Moreover, analysis of the gene network derived from APOE ĆĀ3/4 patient induced neurons reveals a strong interaction between APOE ĆĀ3/4 and another Alzheimer's disease risk factor, desmoglein 2 (DSG2). Knockdown of DSG2 in APOE ĆĀ3/4 induced neurons effectively rescued defective APP processing, demonstrating the functional importance of this interaction. These data provide a direct connection between APOE ĆĀ3/4 and another Alzheimer's disease susceptibility gene and demonstrate in proof of principle the utility of induced neuron-based modelling of Alzheimer's disease for therapeutic discovery.
Subject(s)
Alleles , Alzheimer Disease/genetics , Alzheimer Disease/metabolism , Apolipoprotein E3/genetics , Apolipoprotein E4/genetics , Neurons/metabolism , Amyloid beta-Peptides/biosynthesis , Amyloid beta-Protein Precursor/biosynthesis , Amyloid beta-Protein Precursor/metabolism , Animals , Cells, Cultured , Cellular Reprogramming Techniques , Desmoglein 2/genetics , Fibroblasts/cytology , Gene Knockdown Techniques , Humans , Mice , Models, Neurological , Peptide Fragments/biosynthesis , Phosphorylation , tau Proteins/metabolismABSTRACT
BACKGROUND: Juvenile idiopathic arthritis (JIA) is one of the most prevalent rheumatic disorders in children and is classified as an autoimmune disease (AID). While a robust genetic contribution to JIA etiology has been established, the exact pathogenesis remains unclear. METHODS: To prioritize biologically interpretable susceptibility genes and proteins for JIA, we conducted transcriptome-wide and proteome-wide association studies (TWAS/PWAS). Then, to understand the genetic architecture of JIA, we systematically analyzed single-nucleotide polymorphism (SNP)-based heritability, a signature of natural selection, and polygenicity. Next, we conducted HLA typing using multi-ethnicity RNA sequencing data. Additionally, we examined the T cell receptor (TCR) repertoire at a single-cell level to explore the potential links between immunity and JIA risk. RESULTS: We have identified 19 TWAS genes and two PWAS proteins associated with JIA risks. Furthermore, we observe that the heritability and cell type enrichment analysis of JIA are enriched in T lymphocytes and HLA regions and that JIA shows higher polygenicity compared to other AIDs. In multi-ancestry HLA typing, B*45:01 is more prevalent in African JIA patients than in European JIA patients, whereas DQA1*01:01, DQA1*03:01, and DRB1*04:01 exhibit a higher frequency in European JIA patients. Using single-cell immune repertoire analysis, we identify clonally expanded T cell subpopulations in JIA patients, including CXCL13+BHLHE40+ TH cells which are significantly associated with JIA risks. CONCLUSION: Our findings shed new light on the pathogenesis of JIA and provide a strong foundation for future mechanistic studies aimed at uncovering the molecular drivers of JIA.
Subject(s)
Arthritis, Juvenile , Child , Humans , Arthritis, Juvenile/genetics , Genetic Predisposition to Disease/genetics , Proteins/genetics , AllelesABSTRACT
Colorectal cancer (CRC) is one of the top five most common and life-threatening malignancies worldwide. Most CRC develops from advanced colorectal adenoma (ACA), a precancerous stage, through the adenoma-carcinoma sequence. However, its underlying mechanisms, including how the tumor microenvironment changes, remain elusive. Therefore, we conducted an integrative analysis comparing RNA-seq data collected from 40 ACA patients who visited Dongguk University Ilsan Hospital with normal adjacent colons and tumor samples from 18 CRC patients collected from a public database. Differential expression analysis identified 21 and 79 sequentially up- or down-regulated genes across the continuum, respectively. The functional centrality of the continuum genes was assessed through network analysis, identifying 11 up- and 13 down-regulated hub-genes. Subsequently, we validated the prognostic effects of hub-genes using the Kaplan-Meier survival analysis. To estimate the immunological transition of the adenoma-carcinoma sequence, single-cell deconvolution and immune repertoire analyses were conducted. Significant composition changes for innate immunity cells and decreased plasma B-cells with immunoglobulin diversity were observed, along with distinctive immunoglobulin recombination patterns. Taken together, we believe our findings suggest underlying transcriptional and immunological changes during the adenoma-carcinoma sequence, contributing to the further development of pre-diagnostic markers for CRC.
Subject(s)
Adenoma , Colorectal Neoplasms , Computational Biology , Gene Expression Regulation, Neoplastic , Humans , Colorectal Neoplasms/genetics , Colorectal Neoplasms/immunology , Colorectal Neoplasms/pathology , Adenoma/genetics , Adenoma/immunology , Adenoma/pathology , Republic of Korea , Computational Biology/methods , Male , Female , Tumor Microenvironment/genetics , Tumor Microenvironment/immunology , Prognosis , Middle Aged , Aged , Biomarkers, Tumor/genetics , Kaplan-Meier Estimate , Gene Expression ProfilingABSTRACT
Accurate identification of human leukocyte antigen (HLA) alleles is essential for various clinical and research applications, such as transplant matching and drug sensitivities. Recent advances in RNA-seq technology have made it possible to impute HLA types from sequencing data, spurring the development of a large number of computational HLA typing tools. However, the relative performance of these tools is unknown, limiting the ability for clinical and biomedical research to make informed choices regarding which tools to use. Here we report the study design of a comprehensive benchmarking of the performance of 12 HLA callers across 682 RNA-seq samples from 8 datasets with molecularly defined gold standard at 5 loci, HLA-A, -B, -C, -DRB1, and -DQB1. For each HLA typing tool, we will comprehensively assess their accuracy, compare default with optimized parameters, and examine for discrepancies in accuracy at the allele and loci levels. We will also evaluate the computational expense of each HLA caller measured in terms of CPU time and RAM. We also plan to evaluate the influence of read length over the HLA region on accuracy for each tool. Most notably, we will examine the performance of HLA callers across European and African groups, to determine discrepancies in accuracy associated with ancestry. We hypothesize that RNA-Seq HLA callers are capable of returning high-quality results, but the tools that offer a good balance between accuracy and computational expensiveness for all ancestry groups are yet to be developed. We believe that our study will provide clinicians and researchers with clear guidance to inform their selection of an appropriate HLA caller.
ABSTRACT
Stroke, characterized by sudden neurological deficits, is the second leading cause of death worldwide. Although genome-wide association studies (GWAS) have successfully identified many genomic regions associated with ischemic stroke (IS), the genes underlying risk and their regulatory mechanisms remain elusive. Here, we integrate a large-scale GWAS (N=1,296,908) for IS together with mRNA, splicing, enhancer RNA (eRNA) and protein expression data (N=11,588) from 50 tissues. We identify 136 genes/eRNA/proteins associated with IS risk across 54 independent genomic regions and find IS risk is most enriched for eQTLs in arterial and brain-related tissues. Focusing on IS-relevant tissues, we prioritize 9 genes/proteins using probabilistic fine-mapping TWAS analyses. In addition, we discover that blood cell traits, particularly reticulocyte cells, have shared genetic contributions with IS using TWAS-based pheWAS and genetic correlation analysis. Lastly, we integrate our findings with a large-scale pharmacological database and identify a secondary bile acid, deoxycholic acid, as a potential therapeutic component. Our work highlights IS risk genes/splicing-sites/enhancer activity/proteins with their phenotypic consequences using relevant tissues as well as identify potential therapeutic candidates for IS.
ABSTRACT
Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes, while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (N=420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (P=2.58E-10), and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest novel shared etiologies between rheumatoid arthritis and periodontal condition, in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWAS.
ABSTRACT
Cerebral adrenoleukodystrophy (cALD) is a rare neurodegenerative disease characterized by inflammatory demyelination in the central nervous system. Another neurodegenerative disease with a high prevalence, Alzheimer's disease (AD), shares many common features with cALD such as cognitive impairment and the alleviation of symptoms by erucic acid. We investigated cALD and AD in parallel to study the shared pathological pathways between a rare disease and a more common disease. The approach may expand the biological understandings and reveal novel therapeutic targets. Gene set enrichment analysis (GSEA) and weighted gene correlation network analysis (WGCNA) were conducted to identify both the resemblance in gene expression patterns and genes that are pathologically relevant in the two diseases. Within differentially expressed genes (DEGs), GSEA identified 266 common genes with similar up- or down-regulation patterns in cALD and AD. Among the interconnected genes in AD data, two gene sets containing 1,486 genes preserved in cALD data were selected by WGCNA that may significantly affect the development and progression of cALD. WGCNA results filtered by functional correlation via protein-protein interaction analysis overlapping with GSEA revealed four genes (annexin A5, beta-2-microglobulin, CD44 molecule, and fibroblast growth factor 2) that showed robust associations with the pathogeneses of cALD and AD, where they were highly involved in inflammation, apoptosis, and the mitogen-activated protein kinase pathway. This study provided an integrated strategy to provide new insights into a rare disease with scant publicly available data (cALD) using a more prevalent disorder with some pathological association (AD), which suggests novel druggable targets and drug candidates.
ABSTRACT
Atopic dermatitis (AD) is one of the most common inflammatory skin diseases, which significantly impact the quality of life. Transcriptome-wide association study (TWAS) was conducted to estimate both transcriptomic and genomic features of AD and detected significant associations between 31 expression quantitative loci and 25 genes. Our results replicated well-known genetic markers for AD, as well as 4 novel associated genes. Next, transcriptome meta-analysis was conducted with 5 studies retrieved from public databases and identified 5 additional novel susceptibility genes for AD. Applying the connectivity map to the results from TWAS and meta-analysis, robustly enriched perturbations were identified and their chemical or functional properties were analyzed. Here, we report the first research on integrative approaches for an AD, combining TWAS and transcriptome meta-analysis. Together, our findings could provide a comprehensive understanding of the pathophysiologic mechanisms of AD and suggest potential drug candidates as alternative treatment options.
Subject(s)
Dermatitis, Atopic , Transcriptome , Dermatitis, Atopic/drug therapy , Dermatitis, Atopic/genetics , Dermatitis, Atopic/metabolism , Drug Repositioning , Genome-Wide Association Study/methods , Humans , Quality of LifeABSTRACT
In standard genome-wide association studies (GWAS), the standard association test is underpowered to detect associations between loci with multiple causal variants with small effect sizes. We propose a statistical method, Model-based Association test Reflecting causal Status (MARS), that finds associations between variants in risk loci and a phenotype, considering the causal status of variants, only requiring the existing summary statistics to detect associated risk loci. Utilizing extensive simulated data and real data, we show that MARS increases the power of detecting true associated risk loci compared to previous approaches that consider multiple variants, while controlling the type I error.
Subject(s)
Alleles , Genetic Association Studies/methods , Genetic Heterogeneity , Models, Genetic , Models, Statistical , Algorithms , Genome-Wide Association Study/methods , HumansABSTRACT
A recent report found that rare predicted loss-of-function (pLOF) variants across 13 candidate genes in TLR3- and IRF7-dependent type I IFN pathways explain up to 3.5% of severe COVID-19 cases. We performed whole-exome or whole-genome sequencing of 1,864 COVID-19 cases (713 with severe and 1,151 with mild disease) and 15,033 ancestry-matched population controls across 4 independent COVID-19 biobanks. We tested whether rare pLOF variants in these 13 genes were associated with severe COVID-19. We identified only 1 rare pLOF mutation across these genes among 713 cases with severe COVID-19 and observed no enrichment of pLOFs in severe cases compared to population controls or mild COVID-19 cases. We found no evidence of association of rare LOF variants in the 13 candidate genes with severe COVID-19 outcomes.
Subject(s)
COVID-19/genetics , COVID-19/immunology , Interferon Type I/genetics , Interferon Type I/immunology , Loss of Function Mutation , SARS-CoV-2 , Adolescent , Adult , Aged , Aged, 80 and over , Case-Control Studies , Child , Child, Preschool , Cohort Studies , Female , Genetic Association Studies , Genetic Predisposition to Disease , Humans , Infant , Infant, Newborn , Interferon Regulatory Factor-7/genetics , Male , Middle Aged , Severity of Illness Index , Toll-Like Receptor 3/genetics , Exome Sequencing , Whole Genome Sequencing , Young AdultABSTRACT
BACKGROUND: Dupuytren's disease (DD) is a fibroproliferative disorder characterized by thickening and contracting palmar fascia. The exact pathogenesis of DD remains unknown. RESULTS: In this study, we identified co-expressed gene set (DD signature) consisting of 753 genes via weighted gene co-expression network analysis. To confirm the robustness of DD signature, module enrichment analysis and meta-analysis were performed. Moreover, this signature effectively classified DD disease samples. The DD signature were significantly enriched in unfolded protein response (UPR) related to endoplasmic reticulum (ER) stress. Next, we conducted multiple-phenotype regression analysis to identify trans-regulatory hotspots regulating expression levels of DD signature using Genotype-Tissue Expression data. Finally, 10 trans-regulatory hotspots and 16 eGenes genes that are significantly associated with at least one cis-eQTL were identified. CONCLUSIONS: Among these eGenes, major histocompatibility complex class II genes and ZFP57 zinc finger protein were closely related to ER stress and UPR, suggesting that these genetic markers might be potential therapeutic targets for DD.
Subject(s)
Dupuytren Contracture/genetics , Gene Expression Profiling , Genetic Markers/genetics , Genomics , Animals , Gene Regulatory Networks , HumansABSTRACT
Gold nanoparticles (AuNPs) with diverse physicochemical properties are reported to affect biological systems differently, but the relationship between the physicochemical properties of AuNPs and their biological effects is not clearly understood. Here, we aimed to elucidate the molecular origins of AuNP-induced cytotoxicity and their mechanisms, focusing on the surface charge and structural properties of modified AuNPs. We prepared a library of well-tailored AuNPs exhibiting various functional groups and surface charges. Through this work, we revealed that the direction or the magnitude of surface charge is not an exclusive factor that determines the cytotoxicity of AuNPs. We, instead, suggested that toxic AuNPs share a common structural characteristics of a hydrophobic moiety neighbouring the positive charge, which can induce lytic interaction with plasma membrane. Mechanistic study showed that the toxic AuNPs interfered with the formation of cytoskeletal structure to slow cell migration, inhibited DNA replication and caused DNA damage via oxidative stress to hinder cell proliferation. Gene expression analysis showed that the toxic AuNPs down-regulated genes associated with cell cycle processes. We discovered structural characteristics that define the cytotoxic AuNPs and suggested the mechanisms of their cytotoxicity. These findings will help us to understand and to predict the biological effects of modified AuNPs based on their physicochemical properties.