RESUMEN
A novel algorithm, AlphaMissense, has been shown to have an improved ability to predict the pathogenicity of rare missense genetic variants. However, it is not known whether AlphaMissense improves the ability of gene-based testing to identify disease-influencing genes. Using whole-exome sequencing data from the UK Biobank, we compared gene-based association analysis strategies including sets of deleterious variants: predicted loss-of-function (pLoF) variants only, pLoF plus AlphaMissense pathogenic variants, pLoF with missense variants predicted to be deleterious by any of five commonly utilized annotation methods (Missense (1/5)) or only variants predicted to be deleterious by all five methods (Missense (5/5)). We measured performance to identify 519 previously identified positive control genes, which can lead to Mendelian diseases, or are the targets of successfully developed medicines. These strategies identified 0.85 million pLoF variants and 5 million deleterious missense variants, including 22,131 likely pathogenic missense variants identified exclusively by AlphaMissense. The gene-based association tests found 608 significant gene associations (at p < 1.25 × 10-7) across 24 common traits and diseases. Compared with pLoFs plus Missense (5/5), tests using pLoFs and AlphaMissense variants found slightly more significant gene-disease and gene-trait associations, albeit with a marginally lower proportion of positive control genes. Nevertheless, their overall performance was similar. Merging AlphaMissense with Missense (5/5), whether through their intersection or union, did not yield any further enhancement in performance. In summary, employing AlphaMissense to select deleterious variants for gene-based testing did not improve the ability to identify genes that are known to influence disease.
Asunto(s)
Predisposición Genética a la Enfermedad , Mutación Missense , Humanos , Mutación Missense/genética , Predisposición Genética a la Enfermedad/genética , Algoritmos , Secuenciación del Exoma/métodos , Estudio de Asociación del Genoma Completo/métodos , Biología Computacional/métodosRESUMEN
DNA methylation plays an essential role in regulating gene activity, modulating disease risk, and determining treatment response. We can obtain insight into methylation patterns at a single-nucleotide level via next-generation sequencing technologies. However, complex features inherent in the data obtained via these technologies pose challenges beyond the typical big data problems. Identifying differentially methylated cytosines (dmc) or regions is one such challenge. We have developed DMCFB, an efficient dmc identification method based on Bayesian functional regression, to tackle these challenges. Using simulations, we establish that DMCFB outperforms current methods and results in better smoothing and efficient imputation. We analyzed a dataset of patients with acute promyelocytic leukemia and control samples. With DMCFB, we discovered many new dmcs and, more importantly, exhibited enhanced consistency of differential methylation within islands and their adjacent shores. Additionally, we detected differential methylation at more of the binding sites of the fused gene involved in this cancer.
Asunto(s)
Teorema de Bayes , Metilación de ADN , Epigénesis Genética , Metilación de ADN/genética , Humanos , Leucemia Promielocítica Aguda/genéticaRESUMEN
Motivated by a DNA methylation application, this article addresses the problem of fitting and inferring a multivariate binomial regression model for outcomes that are contaminated by errors and exhibit extra-parametric variations, also known as dispersion. While dispersion in univariate binomial regression has been extensively studied, addressing dispersion in the context of multivariate outcomes remains a complex and relatively unexplored task. The complexity arises from a noteworthy data characteristic observed in our motivating dataset: non-constant yet correlated dispersion across outcomes. To address this challenge and account for possible measurement error, we propose a novel hierarchical quasi-binomial varying coefficient mixed model, which enables flexible dispersion patterns through a combination of additive and multiplicative dispersion components. To maximize the Laplace-approximated quasi-likelihood of our model, we further develop a specialized two-stage expectation-maximization (EM) algorithm, where a plug-in estimate for the multiplicative scale parameter enhances the speed and stability of the EM iterations. Simulations demonstrated that our approach yields accurate inference for smooth covariate effects and exhibits excellent power in detecting non-zero effects. Additionally, we applied our proposed method to investigate the association between DNA methylation, measured across the genome through targeted custom capture sequencing of whole blood, and levels of anti-citrullinated protein antibodies (ACPA), a preclinical marker for rheumatoid arthritis (RA) risk. Our analysis revealed 23 significant genes that potentially contribute to ACPA-related differential methylation, highlighting the relevance of cell signaling and collagen metabolism in RA. We implemented our method in the R Bioconductor package called "SOMNiBUS."
Asunto(s)
Algoritmos , Simulación por Computador , Metilación de ADN , Modelos Estadísticos , Humanos , Análisis Multivariante , Artritis Reumatoide/genética , Funciones de Verosimilitud , Sulfitos/química , Análisis de Secuencia de ADN/métodosRESUMEN
BACKGROUND: Preventive measures and treatments for psychiatric disorders are limited. Circulating metabolites are potential candidates for biomarker and therapeutic target identification, given their measurability and essential roles in biological processes. METHODS: Leveraging large-scale genome-wide association studies, we conducted Mendelian randomization analyses to assess the associations between circulating metabolite abundances and the risks of bipolar disorder, schizophrenia, and depression. Genetic instruments were selected for 94 metabolites measured in the Canadian Longitudinal Study on Aging cohort (N = 8299). We repeated Mendelian randomization analyses based on the UK Biobank, INTERVAL, and EPIC (European Prospective Investigation into Cancer)-Norfolk studies. RESULTS: After validating Mendelian randomization assumptions and colocalization evidence, we found that a 1 SD increase in genetically predicted circulating abundances of eicosapentaenoate and docosapentaenoate was associated with odds ratios of 0.72 (95% CI, 0.65-0.79) and 0.63 (95% CI, 0.55-0.72), respectively, for bipolar disorder. Genetically increased Ω-3 unsaturated fatty acids abundance and Ω-3-to-total fatty acids ratio, as well as genetically decreased Ω-6-to-Ω-3 ratio, were negatively associated with the risk of bipolar disorder in the UK Biobank. Genetically increased circulating abundances of 3 N-acetyl-amino acids were associated with an increased risk of schizophrenia with a maximum odds ratio of 1.31 (95% CI, 1.18-1.44) per 1 SD increase. Furthermore, a 1 SD increase in genetically predicted circulating abundance of hypotaurine was associated with an odds ratio of 0.85 (95% CI, 0.78-0.93) for depression. CONCLUSIONS: The biological mechanisms that underlie Ω-3 unsaturated fatty acids, NAT8-catalyzed N-acetyl-amino acids, and hypotaurine warrant exploration to identify new biomarkers and potential therapeutic targets.
Asunto(s)
Trastorno Bipolar , Estudio de Asociación del Genoma Completo , Análisis de la Aleatorización Mendeliana , Esquizofrenia , Humanos , Trastorno Bipolar/genética , Trastorno Bipolar/sangre , Esquizofrenia/genética , Esquizofrenia/sangre , Esquizofrenia/epidemiología , Masculino , Femenino , Biomarcadores/sangre , Estudios Longitudinales , Persona de Mediana Edad , Canadá/epidemiología , AncianoRESUMEN
The HostSeq initiative recruited 10,059 Canadians infected with SARS-CoV-2 between March 2020 and March 2023, obtained clinical information on their disease experience and whole genome sequenced (WGS) their DNA. We analyzed the WGS data for genetic contributors to severe COVID-19 (considering 3,499 hospitalized cases and 4,975 non-hospitalized after quality control). We investigated the evidence for replication of loci reported by the International Host Genetics Initiative (HGI); analyzed the X chromosome; conducted rare variant gene-based analysis and polygenic risk score testing. Population stratification was adjusted for using meta-analysis across ancestry groups. We replicated two loci identified by the HGI for COVID-19 severity: the LZTFL1/SLC6A20 locus on chromosome 3 and the FOXP4 locus on chromosome 6 (the latter with a variant significant at P < 5E-8). We found novel significant associations with MRAS and WDR89 in gene-based analyses, and constructed a polygenic risk score that explained 1.01% of the variance in severe COVID-19. This study provides independent evidence confirming the robustness of previously identified COVID-19 severity loci by the HGI and identifies novel genes for further investigation.
Asunto(s)
COVID-19 , Pueblos de América del Norte , Humanos , COVID-19/genética , SARS-CoV-2/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Canadá/epidemiología , Estudio de Asociación del Genoma Completo , Proteínas de Transporte de Membrana , Factores de Transcripción ForkheadRESUMEN
This study explored the interactions among prenatal stress, child sex, and polygenic risk scores (PGS) for attention-deficit/hyperactivity disorder (ADHD) on structural developmental changes of brain regions implicated in ADHD. We used data from two population-based birth cohorts: Growing Up in Singapore Towards healthy Outcomes (GUSTO) from Singapore (n = 113) and Generation R from Rotterdam, the Netherlands (n = 433). Prenatal stress was assessed using questionnaires. We obtained latent constructs of prenatal adversity and prenatal mood problems using confirmatory factor analyses. The participants were genotyped using genome-wide single nucleotide polymorphism arrays, and ADHD PGSs were computed. Magnetic resonance imaging scans were acquired at 4.5 and 6 years (GUSTO), and at 10 and 14 years (Generation R). We estimated the age-related rate of change for brain outcomes related to ADHD and performed (1) prenatal stress by sex interaction models, (2) prenatal stress by ADHD PGS interaction models, and (3) 3-way interaction models, including prenatal stress, sex, and ADHD PGS. We observed an interaction between prenatal stress and ADHD PGS on mean cortical thickness annual rate of change in Generation R (i.e., in individuals with higher ADHD PGS, higher prenatal stress was associated with a lower rate of cortical thinning, whereas in individuals with lower ADHD PGS, higher prenatal stress was associated with a higher rate of cortical thinning). None of the other tested interactions were statistically significant. Higher prenatal stress may promote a slower brain developmental rate during adolescence in individuals with higher ADHD genetic vulnerability, whereas it may promote a faster brain developmental rate in individuals with lower ADHD genetic vulnerability.
Asunto(s)
Trastorno por Déficit de Atención con Hiperactividad , Niño , Adolescente , Humanos , Trastorno por Déficit de Atención con Hiperactividad/genética , Adelgazamiento de la Corteza Cerebral , Encéfalo/diagnóstico por imagen , Puntuación de Riesgo Genético , Herencia MultifactorialRESUMEN
DNA methylation (DNAm) is a dynamic, age-dependent epigenetic modification that can be used to study interactions between genetic and environmental factors. Environmental exposures during critical periods of growth and development may alter DNAm patterns, leading to increased susceptibility to diseases such as asthma and allergies. One method to study the role of DNAm is the epigenetic clock-an algorithm that uses DNAm levels at select age-informative Cytosine-phosphate-Guanine (CpG) dinucleotides to predict epigenetic age (EA). The difference between EA and calendar age (CA) is termed epigenetic age acceleration (EAA) and reveals information about the biological capacity of an individual. Associations between EAA and disease susceptibility have been demonstrated for a variety of age-related conditions and, more recently, phenotypes such as asthma and allergic diseases, which often begin in childhood and progress throughout the lifespan. In this review, we explore different epigenetic clocks and how they have been applied, particularly as related to childhood asthma. We delve into how in utero and early life exposures (e.g., smoking, air pollution, maternal BMI) result in methylation changes. Furthermore, we explore the potential for EAA to be used as a biomarker for asthma and allergic diseases and identify areas for further study.
Asunto(s)
Contaminación del Aire , Asma , Hipersensibilidad , Humanos , Hipersensibilidad/genética , Asma/genética , Biomarcadores , Epigénesis GenéticaRESUMEN
Background: Both genetic and early life risk factors play important roles in the pathogenesis and progression of adult depression. However, the interplay between these risk factors and their added value to risk prediction models have not been fully elucidated. Methods: Leveraging a meta-analysis of major depressive disorder genome-wide association studies (N = 45,591 cases and 97,674 controls), we developed and optimized a polygenic risk score for depression using LDpred in a model selection dataset from the UK Biobank (N = 130,092 European ancestry individuals). In a UK Biobank test dataset (N = 278,730 European ancestry individuals), we tested whether the polygenic risk score and early life risk factors were associated with each other and compared their associations with depression phenotypes. Finally, we conducted joint predictive modeling to combine this polygenic risk score with early life risk factors by stepwise regression, and assessed the model performance in identifying individuals at high risk of depression. Results: In the UK Biobank test dataset, the polygenic risk score for depression was moderately associated with multiple early life risk factors. For instance, a one standard deviation increase in the polygenic risk score was associated with 1.16-fold increased odds of frequent domestic violence (95% CI: 1.14-1.19) and 1.09-fold increased odds of not having access to medical care as a child (95% CI: 1.05-1.14). However, the polygenic risk score was more strongly associated with depression phenotypes than most early life risk factors. A joint predictive model integrating the polygenic risk score, early life risk factors, age and sex achieved an AUROC of 0.6766 for predicting strictly defined major depressive disorder, while a model without the polygenic risk score and a model without any early life risk factors had an AUROC of 0.6593 and 0.6318, respectively. Conclusion: We have developed a polygenic risk score to partly capture the genetic liability to depression. Although genetic and early life risk factors can be correlated, joint predictive models improved risk stratification despite limited improvement in magnitude, and may be explored as tools to better identify individuals at high risk of depression.
RESUMEN
In this study, we leveraged the combined evidence of rare coding variants and common alleles to identify therapeutic targets for osteoporosis. We undertook a large-scale multiancestry exome-wide association study for estimated bone mineral density, which showed that the burden of rare coding alleles in 19 genes was associated with estimated bone mineral density (P < 3.6 × 10-7). These genes were highly enriched for a set of known causal genes for osteoporosis (65-fold; P = 2.5 × 10-5). Exome-wide significant genes had 96-fold increased odds of being the top ranked effector gene at a given GWAS locus (P = 1.8 × 10-10). By integrating proteomics Mendelian randomization evidence, we prioritized CD109 (cluster of differentiation 109) as a gene for which heterozygous loss of function is associated with higher bone density. CRISPR-Cas9 editing of CD109 in SaOS-2 osteoblast-like cell lines showed that partial CD109 knockdown led to increased mineralization. This study demonstrates that the convergence of common and rare variants, proteomics and CRISPR can highlight new bone biology to guide therapeutic development.
Asunto(s)
Predisposición Genética a la Enfermedad , Osteoporosis , Humanos , Secuenciación del Exoma , Osteoporosis/genética , Densidad Ósea/genética , Alelos , Factores de Transcripción/genética , Estudio de Asociación del Genoma CompletoRESUMEN
CONTEXT: Effects of modest alcohol consumption remain controversial. Mendelian randomization (MR) can help to mitigate biases due to confounding and reverse causation in observational studies, and evaluate the potential causal role of alcohol consumption. OBJECTIVE: This work aimed to evaluate dose-dependent effect of alcohol consumption on obesity and type 2 diabetes. METHODS: Assessing 408 540 participants of European ancestry in the UK Biobank, we first tested the association between self-reported alcohol intake frequency and 10 anthropometric measurements, obesity, and type 2 diabetes. We then conducted MR analyses both in the overall population and in subpopulations stratified by alcohol intake frequency. RESULTS: Among individuals having more than 14 drinks per week, a 1-drink-per-week increase in genetically predicted alcohol intake frequency was associated with a 0.36-kg increase in fat mass (SD = 0.03 kg), a 1.08-fold increased odds of obesity (95% CI, 1.06-1.10), and a 1.10-fold increased odds of type 2 diabetes (95% CI, 1.06-1.13). These associations were stronger in women than in men. Furthermore, no evidence was found supporting the association between genetically increased alcohol intake frequency and improved health outcomes among individuals having 7 or fewer drinks per week, as MR estimates largely overlapped with the null. These results withstood multiple sensitivity analyses assessing the validity of MR assumptions. CONCLUSION: As opposed to observational associations, MR results suggest there may not be protective effects of modest alcohol consumption on obesity traits and type 2 diabetes. Heavy alcohol consumption could lead to increased measures of obesity as well as increased risk of type 2 diabetes.
Asunto(s)
Diabetes Mellitus Tipo 2 , Masculino , Humanos , Femenino , Diabetes Mellitus Tipo 2/etiología , Diabetes Mellitus Tipo 2/genética , Análisis de la Aleatorización Mendeliana , Consumo de Bebidas Alcohólicas/efectos adversos , Consumo de Bebidas Alcohólicas/epidemiología , Obesidad/epidemiología , Obesidad/genética , Causalidad , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Abnormal DNA methylation is thought to contribute to the onset and progression of systemic sclerosis. Currently, the most comprehensive assay for profiling DNA methylation is whole-genome bisulfite sequencing (WGBS), but its precision depends on read depth and it may be subject to sequencing errors. SOMNiBUS, a method for regional analysis, attempts to overcome some of these limitations. Using SOMNiBUS, we re-analyzed WGBS data previously analyzed using bumphunter, an approach that initially fits single CpG associations, to contrast DNA methylation estimates by both methods. METHODS: Purified CD4+ T lymphocytes of 9 SSc and 4 control females were sequenced using WGBS. We separated the resulting sequencing data into regions with dense CpG data, and differentially methylated regions (DMRs) were inferred with the SOMNiBUS region-level test, adjusted for age. Pathway enrichment analysis was performed with ingenuity pathway analysis (IPA). We compared the results obtained by SOMNiBUS and bumphunter. RESULTS: Of 8268 CpG regions of ≥ 60 CpGs eligible for analysis with SOMNiBUS, we identified 131 DMRs and 125 differentially methylated genes (DMGs; p-values less than Bonferroni-corrected threshold of 6.05-06 controlling family-wise error rate at 0.05; 1.6% of the regions). In comparison, bumphunter identified 821,929 CpG regions, 599 DMRs (of which none had ≥ 60 CpGs) and 340 DMGs (q-value of 0.05; 0.04% of all regions). The top ranked gene identified by SOMNiBUS was FLT4, a lymphangiogenic orchestrator, and the top ranked gene on chromosome X was CHST7, known to catalyze the sulfation of glycosaminoglycans in the extracellular matrix. The top networks identified by IPA included connective tissue disorders. CONCLUSIONS: SOMNiBUS is a complementary method of analyzing WGBS data that enhances biological insights into SSc and provides novel avenues of investigation into its pathogenesis.
Asunto(s)
Metilación de ADN , Esclerodermia Sistémica , Femenino , Humanos , Islas de CpG , Secuenciación Completa del Genoma/métodos , Esclerodermia Sistémica/genéticaRESUMEN
Background: Ovarian cancer (OC) is the deadliest gynecological cancer, often diagnosed at advanced stages. A fast and accurate diagnostic method for early-stage OC is needed. The tumor marker gangliosides, GD2 and GD3, exhibit properties that make them ideal potential diagnostic biomarkers, but they have never before been quantified in OC. We investigated the diagnostic utility of GD2 and GD3 for diagnosis of all subtypes and stages of OC. Methods: This retrospective study evaluated GD2 and GD3 expression in biobanked tissue and serum samples from patients with invasive epithelial OC, healthy donors, non-malignant gynecological conditions, and other cancers. GD2 and GD3 levels were evaluated in tissue samples by immunohistochemistry (n=299) and in two cohorts of serum samples by quantitative ELISA. A discovery cohort (n=379) showed feasibility of GD2 and GD3 quantitative ELISA for diagnosing OC, and a subsequent model cohort (n=200) was used to train and cross-validate a diagnostic model. Results: GD2 and GD3 were expressed in tissues of all OC subtypes and FIGO stages but not in surrounding healthy tissue or other controls. In serum, GD2 and GD3 were elevated in patients with OC. A diagnostic model that included serum levels of GD2+GD3+age was superior to the standard of care (CA125, p<0.001) in diagnosing OC and early-stage (I/II) OC. Conclusion: GD2 and GD3 expression was associated with high rates of selectivity and specificity for OC. A diagnostic model combining GD2 and GD3 quantification in serum had diagnostic power for all subtypes and all stages of OC, including early stage. Further research exploring the utility of GD2 and GD3 for diagnosis of OC is warranted.
RESUMEN
Not all familial ovarian cancer (OC) cases are explained by pathogenic germline variants in known risk genes. A candidate gene approach involving DNA repair pathway genes was applied to identify rare recurring pathogenic variants in familial OC cases not associated with known OC risk genes from a population exhibiting genetic drift. Whole exome sequencing (WES) data of 15 OC cases from 13 families tested negative for pathogenic variants in known OC risk genes were investigated for candidate variants in 468 DNA repair pathway genes. Filtering and prioritization criteria were applied to WES data to select top candidates for further analyses. Candidates were genotyped in ancestry defined study groups of 214 familial and 998 sporadic OC or breast cancer (BC) cases and 1025 population-matched controls and screened for additional carriers in 605 population-matched OC cases. The candidate genes were also analyzed in WES data from 937 familial or sporadic OC cases of diverse ancestries. Top candidate variants in ERCC5, EXO1, FANCC, NEIL1 and NTHL1 were identified in 5/13 (39%) OC families. Collectively, candidate variants were identified in 7/435 (1.6%) sporadic OC cases and 1/566 (0.2%) sporadic BC cases versus 1/1025 (0.1%) controls. Additional carriers were identified in 6/605 (0.9%) OC cases. Tumour DNA from ERCC5, NEIL1 and NTHL1 variant carriers exhibited loss of the wild-type allele. Carriers of various candidate variants in these genes were identified in 31/937 (3.3%) OC cases of diverse ancestries versus 0-0.004% in cancer-free controls. The strategy of applying a candidate gene approach in a population exhibiting genetic drift identified new candidate OC predisposition variants in DNA repair pathway genes.
RESUMEN
OBJECTIVE: The primary purpose of this study was to investigate the contribution of genetic predispositions to depression and inflammation, as measured through polygenic risk scores, on symptom burden (physical and psychological) in patients with head and neck cancer in the immediate post-treatment period (i.e., at three months post-diagnosis), as well as on 3-, 6-, 12-, 24- and 36-month survival. METHODS: Prospective longitudinal study of 223 adults (72 % participation) newly diagnosed with a first occurrence of primary head and neck cancer, paired with genetic data (Illumina PsychArray), validated psychometric measures, Structured Clinical Interviews for DSM Disorders (SCID-I), and medical chart reviews. RESULTS: Symptom burden at 3 months was predicted by (R2 adj. = 0.38, p < 0.001): a baseline SCID-I Anxiety Disorder (b = 1.69, B = 0.23, 95%CI = 0.43-2.94; p = 0.009), baseline levels of HADS anxiety (b = 0.20, B = 0.29, 95%CI = 0.07-0.34; p = 0.003), the polygenic risk score (PRS) for depression (b = 0.66, B = 0.18, 95%CI = 0.003-1.32; p = 0.049), and cumulated dose of radiotherapy (b = 0.002, B = 0.46, 95%CI = 0.001-0.003; p < 0.001). When controlling for factors known to be associated with cancer survival, patients with a higher PRS associated with depression and inflammation, respectively, presented higher risk of death within 36 months (b = 1.75, Exp(B) = 5.75, 95%CI = 1.55-21.27, p = 0.009 and b = 0.14, Exp(B) = 1.15, 95%CI = 1.01-1.30, p = 0.03). CONCLUSIONS: Our results outline three potential pathways of symptom burden in patients with head and neck cancer: a genetic predisposition towards depression; an initial anxiety disorder upon being diagnosed with cancer or high levels of anxiety upon diagnosis; and a dose-related response to radiotherapy. One may want to investigate early interventions in these areas to alleviate symptom burden in patients faced with a life-threatening disease, as well as consider targeting genetic predisposition towards depression and inflammation implicated in survival. The high prevalence of distress in patients with head and neck cancer is an opportunity to study genetic predispositions, which could potentially be broadly generalized to other cancers and diseases.
Asunto(s)
Predisposición Genética a la Enfermedad , Neoplasias de Cabeza y Cuello , Adulto , Humanos , Estudios Longitudinales , Predisposición Genética a la Enfermedad/genética , Estudios Prospectivos , Depresión/genética , Depresión/diagnóstico , Ansiedad/genética , Ansiedad/psicología , Neoplasias de Cabeza y Cuello/genética , Inflamación/genéticaRESUMEN
FANCI was recently identified as a new candidate ovarian cancer (OC)-predisposing gene from the genetic analysis of carriers of FANCI c.1813C>T; p.L605F in OC families. Here, we aimed to investigate the molecular genetic characteristics of FANCI, as they have not been described in the context of cancer. We first investigated the germline genetic landscape of two sisters with OC from the discovery FANCI c.1813C>T; p.L605F family (F1528) to re-affirm the plausibility of this candidate. As we did not find other conclusive candidates, we then performed a candidate gene approach to identify other candidate variants in genes involved in the FANCI protein interactome in OC families negative for pathogenic variants in BRCA1, BRCA2, BRIP1, RAD51C, RAD51D, and FANCI, which identified four candidate variants. We then investigated FANCI in high-grade serous ovarian carcinoma (HGSC) from FANCI c.1813C>T carriers and found evidence of loss of the wild-type allele in tumour DNA from some of these cases. The somatic genetic landscape of OC tumours from FANCI c.1813C>T carriers was investigated for mutations in selected genes, copy number alterations, and mutational signatures, which determined that the profiles of tumours from carriers were characteristic of features exhibited by HGSC cases. As other OC-predisposing genes such as BRCA1 and BRCA2 are known to increase the risk of other cancers including breast cancer, we investigated the carrier frequency of germline FANCI c.1813C>T in various cancer types and found overall more carriers among cancer cases compared to cancer-free controls (p = 0.007). In these different tumour types, we also identified a spectrum of somatic variants in FANCI that were not restricted to any specific region within the gene. Collectively, these findings expand on the characteristics described for OC cases carrying FANCI c.1813C>T; p.L605F and suggest the possible involvement of FANCI in other cancer types at the germline and/or somatic level.
Asunto(s)
Proteínas del Grupo de Complementación de la Anemia de Fanconi , Predisposición Genética a la Enfermedad , Neoplasias Ováricas , Femenino , Humanos , Proteínas del Grupo de Complementación de la Anemia de Fanconi/genética , Genes BRCA2 , Biología Molecular , Mutación , Neoplasias Ováricas/genéticaRESUMEN
Metabolic processes can influence disease risk and provide therapeutic targets. By conducting genome-wide association studies of 1,091 blood metabolites and 309 metabolite ratios, we identified associations with 690 metabolites at 248 loci and associations with 143 metabolite ratios at 69 loci. Integrating metabolite-gene and gene expression information identified 94 effector genes for 109 metabolites and 48 metabolite ratios. Using Mendelian randomization (MR), we identified 22 metabolites and 20 metabolite ratios having estimated causal effect on 12 traits and diseases, including orotate for estimated bone mineral density, α-hydroxyisovalerate for body mass index and ergothioneine for inflammatory bowel disease and asthma. We further measured the orotate level in a separate cohort and demonstrated that, consistent with MR, orotate levels were positively associated with incident hip fractures. This study provides a valuable resource describing the genetic architecture of metabolites and delivers insights into their roles in common diseases, thereby offering opportunities for therapeutic targets.
Asunto(s)
Estudio de Asociación del Genoma Completo , Metaboloma , Humanos , Metaboloma/genética , Fenotipo , Densidad Ósea/genética , Genómica , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
BACKGROUND: There is a pressing need for novel drug targets for psychiatric disorders. Circulating proteins are potential candidates because they are relatively easy to measure and modulate and play important roles in signaling. METHODS: We performed two-sample Mendelian randomization analyses to estimate the associations between circulating protein abundances and risk of 10 psychiatric disorders. Genetic variants associated with 1611 circulating protein abundances identified in 6 large-scale proteomic studies were used as genetic instruments. Effects of the circulating proteins on psychiatric disorders were estimated by Wald ratio or inverse variance-weighted ratio tests. Horizontal pleiotropy, colocalization, and protein-altering effects were examined to validate the assumptions of Mendelian randomization. RESULTS: Nine circulating protein-to-disease associations withstood multiple sensitivity analyses. Among them, 2 circulating proteins had associations replicated in 3 proteomic studies. A 1 standard deviation increase in the genetically predicted circulating TIMP4 level was associated with a reduced risk of anorexia nervosa (minimum odds ratio [OR] = 0.83; 95% CI, 0.76-0.91) and bipolar disorder (minimum OR = 0.88; 95% CI, 0.82-0.94). A 1 standard deviation increase in the genetically predicted circulating ESAM level was associated with an increased risk of schizophrenia (maximum OR = 1.32; 95% CI, 1.22-1.43). In addition, 58 suggestive protein-to-disease associations warrant validation with observational or experimental evidence. For instance, a 1 standard deviation increase in the ERLEC1-201-to-ERLEC1-202 splice variant ratio was associated with a reduced risk of schizophrenia (OR = 0.94; 95% CI, 0.90-0.97). CONCLUSIONS: Prioritized circulating proteins appear to influence the risk of psychiatric disease and may be explored as intervention targets.
Asunto(s)
Trastornos Mentales , Esquizofrenia , Humanos , Análisis de la Aleatorización Mendeliana , Estudio de Asociación del Genoma Completo , Proteómica , Trastornos Mentales/genética , Esquizofrenia/genética , Polimorfismo de Nucleótido SimpleRESUMEN
Dementia with Lewy bodies (DLB) is a common form of dementia with known genetic and environmental interactions. However, the underlying epigenetic mechanisms which reflect these gene-environment interactions are poorly studied. Herein, we measure genome-wide DNA methylation profiles of post-mortem brain tissue (Broadmann area 7) from 15 pathologically confirmed DLB brains and compare them with 16 cognitively normal controls using Illumina MethylationEPIC arrays. We identify 17 significantly differentially methylated CpGs (DMCs) and 17 differentially methylated regions (DMRs) between the groups. The DMCs are mainly located at the CpG islands, promoter and first exon regions. Genes associated with the DMCs are linked to "Parkinson's disease" and "metabolic pathway", as well as the diseases of "severe intellectual disability" and "mood disorders". Overall, our study highlights previously unreported DMCs offering insights into DLB pathogenesis with the possibility that some of these could be used as biomarkers of DLB in the future.
Asunto(s)
Enfermedad por Cuerpos de Lewy , Humanos , Enfermedad por Cuerpos de Lewy/genética , Autopsia , Biomarcadores , Encéfalo , Islas de CpGRESUMEN
Genomic risk prediction is on the emerging path toward personalized medicine. However, the accuracy of polygenic prediction varies strongly in different individuals. Based on up to 352,277 European ancestry participants in the UK Biobank, we constructed polygenic risk scores for 15 physiological and biochemical quantitative traits. We identified a total of 185 polygenic prediction variability quantitative trait loci for 11 traits by Levene's test among 254,376 unrelated individuals. We validated the effects of prediction variability quantitative trait loci using an independent test set of 58,927 individuals. For instance, a score aggregating 51 prediction variability quantitative trait locus variants for triglycerides had the strongest Spearman correlation of 0.185 (P-value <1.0 × 10-300) with the squared prediction errors. We found a strong enrichment of complex genetic effects conferred by prediction variability quantitative trait loci compared to risk loci identified in genome-wide association studies, including 89 prediction variability quantitative trait loci exhibiting dominance effects. Incorporation of dominance effects into polygenic risk scores significantly improved polygenic prediction for triglycerides, low-density lipoprotein cholesterol, vitamin D, and platelet. In conclusion, we have discovered and profiled genetic determinants of polygenic prediction variability for 11 quantitative biomarkers. These findings may assist interpretation of genomic risk prediction in various contexts and encourage novel approaches for constructing polygenic risk scores with complex genetic effects.