Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 204
1.
Genes (Basel) ; 15(5)2024 Apr 28.
Article En | MEDLINE | ID: mdl-38790197

Currently, more than 55 million people around the world suffer from dementia, and Alzheimer's Disease and Related Dementias (ADRD) accounts for nearly 60-70% of all those cases. The spread of Alzheimer's Disease (AD) pathology and progressive neurodegeneration in the hippocampus and cerebral cortex is strongly correlated with cognitive decline in AD patients; however, the molecular underpinning of ADRD's causality is still unclear. Studies of postmortem AD brains and animal models of AD suggest that elevated endoplasmic reticulum (ER) stress may have a role in ADRD pathology through altered neurocellular homeostasis in brain regions associated with learning and memory. To study the ER stress-associated neurocellular response and its effects on neurocellular homeostasis and neurogenesis, we modeled an ER stress challenge using thapsigargin (TG), a specific inhibitor of sarco/endoplasmic reticulum Ca2+ ATPase (SERCA), in the induced pluripotent stem cell (iPSC)-derived neural stem cells (NSCs) of two individuals from our Mexican American Family Study (MAFS). High-content screening and transcriptomic analysis of the control and ER stress-challenged NSCs showed that the NSCs' ER stress response resulted in a significant decline in NSC self-renewal and an increase in apoptosis and cellular oxidative stress. A total of 2300 genes were significantly (moderated t statistics FDR-corrected p-value ≤ 0.05 and fold change absolute ≥ 2.0) differentially expressed (DE). The pathway enrichment and gene network analysis of DE genes suggests that all three unfolded protein response (UPR) pathways, protein kinase RNA-like ER kinase (PERK), activating transcription factor-6 (ATF-6), and inositol-requiring enzyme-1 (IRE1), were significantly activated and cooperatively regulated the NSCs' transcriptional response to ER stress. Our results show that IRE1/X-box binding protein 1 (XBP1) mediated transcriptional regulation of the E2F transcription factor 1 (E2F1) gene, and its downstream targets have a dominant role in inducing G1/S-phase cell cycle arrest in ER stress-challenged NSCs. The ER stress-challenged NSCs also showed the activation of C/EBP homologous protein (CHOP)-mediated apoptosis and the dysregulation of synaptic plasticity and neurotransmitter homeostasis-associated genes. Overall, our results suggest that the ER stress-associated attenuation of NSC self-renewal, increased apoptosis, and dysregulated synaptic plasticity and neurotransmitter homeostasis plausibly play a role in the causation of ADRD.


Alzheimer Disease , Endoplasmic Reticulum Stress , Humans , Alzheimer Disease/genetics , Alzheimer Disease/metabolism , Alzheimer Disease/pathology , Neural Stem Cells/metabolism , Neural Stem Cells/pathology , Protein Serine-Threonine Kinases/genetics , Protein Serine-Threonine Kinases/metabolism , Endoribonucleases/genetics , Endoribonucleases/metabolism , Induced Pluripotent Stem Cells/metabolism , Thapsigargin/pharmacology , Dementia/genetics , Dementia/metabolism , Dementia/pathology , eIF-2 Kinase/genetics , eIF-2 Kinase/metabolism , Male , Activating Transcription Factor 6/metabolism , Activating Transcription Factor 6/genetics , Neurogenesis , X-Box Binding Protein 1/metabolism , X-Box Binding Protein 1/genetics , Female , Unfolded Protein Response , Transcription Factor CHOP
2.
bioRxiv ; 2024 May 14.
Article En | MEDLINE | ID: mdl-38798596

Reconstructing the DNA of ancestors from their descendants has the potential to empower phenotypic analyses (including association and genetic nurture studies), improve pedigree reconstruction, and shed light on the ancestral population and phenotypes of ancestors. We developed HAPI-RECAP, a method that reconstructs the DNA of parents from full siblings and their relatives. This tool leverages HAPI2's output, a new phasing approach that applies to siblings (and optionally one or both parents) and reliably infers parent haplotypes but does not link the ungenotyped parents' DNA across chromosomes or between segments flanking ambiguities. By combining IBD between the reconstructed parents and the relatives, HAPI-RECAP resolves the source parent of these segments. Moreover, the method exploits crossovers the children inherited and sex-specific genetic maps to infer the reconstructed parents' sexes. We validated these methods on research participants from both 23andMe, Inc. and the San Antonio Mexican American Family Studies. Given data for one parent, HAPI2 reconstructs large fractions of the missing parent's DNA, between 77.6% and 99.97% among all families, and 90.3% on average in three- and four-child families. When reconstructing both parents, HAPI-RECAP inferred between 33.2% and 96.6% of the parents' genotypes, averaging 70.6% in four-child families. Reconstructed genotypes have average error rates < 10-3, or comparable to those from direct genotyping. HAPI-RECAP inferred the parent sexes 100% correctly given IBD-linked segments and can also reconstruct parents without any IBD. As datasets grow in size, more families will be implicitly collected; HAPI-RECAP holds promise to enable high quality parent genotype reconstruction.

3.
Front Genet ; 15: 1240462, 2024.
Article En | MEDLINE | ID: mdl-38495670

Background: Socioeconomic Status (SES) is a potent environmental determinant of health. To our knowledge, no assessment of genotype-environment interaction has been conducted to consider the joint effects of socioeconomic status and genetics on risk for metabolic disease. We analyzed data from the Mexican American Family Studies (MAFS) to evaluate the hypothesis that genotype-by-environment interaction (GxE) is an essential determinant of variation in risk factors for metabolic syndrome (MS). Methods: We employed a maximum likelihood estimation of the decomposition of variance components to detect GxE interaction. After excluding individuals with diabetes and individuals on medication for diabetes, hypertension, or dyslipidemia, we analyzed 12 MS risk factors: fasting glucose (FG), fasting insulin (FI), 2-h glucose (2G), 2-h insulin (2I), body mass index (BMI), waist circumference (WC), leptin (LP), high-density lipoprotein-cholesterol (HDL-C), triglycerides (TG), total serum cholesterol (TSC), systolic blood pressure (SBP), and diastolic blood pressure (DBP). Our SES variable used a combined score of Duncan's socioeconomic index and education years. Heterogeneity in the additive genetic variance across the SES continuum and a departure from unity in the genetic correlation coefficient were taken as evidence of GxE interaction. Hypothesis tests were conducted using standard likelihood ratio tests. Results: We found evidence of GxE for fasting glucose, 2-h glucose, 2-h insulin, BMI, and triglycerides. The genetic effects underlying the insulin/glucose metabolism component of MS are upregulated at the lower end of the SES spectrum. We also determined that the household variance for systolic blood pressure decreased with increasing SES. Conclusion: These results show a significant change in the GxE interaction underlying the major components of MS in response to changes in socioeconomic status. Further mRNA sequencing studies will identify genes and canonical gene pathways to support our molecular-level hypotheses.

4.
Cells ; 13(5)2024 Feb 21.
Article En | MEDLINE | ID: mdl-38474333

A large portion of the heterogeneity in coronavirus disease 2019 (COVID-19) susceptibility and severity of illness (SOI) remains poorly understood. Recent evidence suggests that SARS-CoV-2 infection-associated damage to alveolar epithelial type 2 cells (AT2s) in the distal lung may directly contribute to disease severity and poor prognosis in COVID-19 patients. Our in vitro modeling of SARS-CoV-2 infection in induced pluripotent stem cell (iPSC)-derived AT2s from 10 different individuals showed interindividual variability in infection susceptibility and the postinfection cellular viral load. To understand the underlying mechanism of the AT2's capacity to regulate SARS-CoV-2 infection and cellular viral load, a genome-wide differential gene expression analysis between the mock and SARS-CoV-2 infection-challenged AT2s was performed. The 1393 genes, which were significantly (one-way ANOVA FDR-corrected p ≤ 0.05; FC abs ≥ 2.0) differentially expressed (DE), suggest significant upregulation of viral infection-related cellular innate immune response pathways (p-value ≤ 0.05; activation z-score ≥ 3.5), and significant downregulation of the cholesterol- and xenobiotic-related metabolic pathways (p-value ≤ 0.05; activation z-score ≤ -3.5). Whilst the effect of post-SARS-CoV-2 infection response on the infection susceptibility and postinfection viral load in AT2s is not clear, interestingly, pre-infection (mock-challenged) expression of 238 DE genes showed a high correlation with the postinfection SARS-CoV-2 viral load (FDR-corrected p-value ≤ 0.05 and r2-absolute ≥ 0.57). The 85 genes whose expression was negatively correlated with the viral load showed significant enrichment in viral recognition and cytokine-mediated innate immune GO biological processes (p-value range: 4.65 × 10-10 to 2.24 × 10-6). The 153 genes whose expression was positively correlated with the viral load showed significant enrichment in cholesterol homeostasis, extracellular matrix, and MAPK/ERK pathway-related GO biological processes (p-value range: 5.06 × 10-5 to 6.53 × 10-4). Overall, our results strongly suggest that AT2s' pre-infection innate immunity and metabolic state affect their susceptibility to SARS-CoV-2 infection and viral load.


COVID-19 , Induced Pluripotent Stem Cells , Humans , SARS-CoV-2 , Viral Load , Immunity, Innate , Cholesterol
5.
Nat Commun ; 15(1): 1540, 2024 Feb 20.
Article En | MEDLINE | ID: mdl-38378775

Recent advancements in plasma lipidomic profiling methodology have significantly increased specificity and accuracy of lipid measurements. This evolution, driven by improved chromatographic and mass spectrometric resolution of newer platforms, has made it challenging to align datasets created at different times, or on different platforms. Here we present a framework for harmonising such plasma lipidomic datasets with different levels of granularity in their lipid measurements. Our method utilises elastic-net prediction models, constructed from high-resolution lipidomics reference datasets, to predict unmeasured lipid species in lower-resolution studies. The approach involves (1) constructing composite lipid measures in the reference dataset that map to less resolved lipids in the target dataset, (2) addressing discrepancies between aligned lipid species, (3) generating prediction models, (4) assessing their transferability into the targe dataset, and (5) evaluating their prediction accuracy. To demonstrate our approach, we used the AusDiab population-based cohort (747 lipid species) as the reference to impute unmeasured lipid species into the LIPID study (342 lipid species). Furthermore, we compared measured and imputed lipids in terms of parameter estimation and predictive performance, and validated imputations in an independent study. Our method for harmonising plasma lipidomic datasets will facilitate model validation and data integration efforts.


Lipidomics , Plasma , Humans , Mass Spectrometry , Lipids
6.
Blood ; 143(18): 1845-1855, 2024 May 02.
Article En | MEDLINE | ID: mdl-38320121

ABSTRACT: Coagulation factor VIII (FVIII) and its carrier protein von Willebrand factor (VWF) are critical to coagulation and platelet aggregation. We leveraged whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program along with TOPMed-based imputation of genotypes in additional samples to identify genetic associations with circulating FVIII and VWF levels in a single-variant meta-analysis, including up to 45 289 participants. Gene-based aggregate tests were implemented in TOPMed. We identified 3 candidate causal genes and tested their functional effect on FVIII release from human liver endothelial cells (HLECs) and VWF release from human umbilical vein endothelial cells. Mendelian randomization was also performed to provide evidence for causal associations of FVIII and VWF with thrombotic outcomes. We identified associations (P < 5 × 10-9) at 7 new loci for FVIII (ST3GAL4, CLEC4M, B3GNT2, ASGR1, F12, KNG1, and TREM1/NCR2) and 1 for VWF (B3GNT2). VWF, ABO, and STAB2 were associated with FVIII and VWF in gene-based analyses. Multiphenotype analysis of FVIII and VWF identified another 3 new loci, including PDIA3. Silencing of B3GNT2 and the previously reported CD36 gene decreased release of FVIII by HLECs, whereas silencing of B3GNT2, CD36, and PDIA3 decreased release of VWF by HVECs. Mendelian randomization supports causal association of higher FVIII and VWF with increased risk of thrombotic outcomes. Seven new loci were identified for FVIII and 1 for VWF, with evidence supporting causal associations of FVIII and VWF with thrombotic outcomes. B3GNT2, CD36, and PDIA3 modulate the release of FVIII and/or VWF in vitro.


Cell Adhesion Molecules , Factor VIII , Kininogens , Lectins, C-Type , Receptors, Cell Surface , von Willebrand Factor , Humans , von Willebrand Factor/genetics , von Willebrand Factor/metabolism , Factor VIII/genetics , Factor VIII/metabolism , Polymorphism, Single Nucleotide , Human Umbilical Vein Endothelial Cells/metabolism , Mendelian Randomization Analysis , Genome-Wide Association Study , Thrombosis/genetics , Thrombosis/blood , Genetic Association Studies , Male , Endothelial Cells/metabolism , Female
7.
bioRxiv ; 2023 Nov 02.
Article En | MEDLINE | ID: mdl-37961350

Large-scale whole-genome sequencing (WGS) studies have improved our understanding of the contributions of coding and noncoding rare variants to complex human traits. Leveraging association effect sizes across multiple traits in WGS rare variant association analysis can improve statistical power over single-trait analysis, and also detect pleiotropic genes and regions. Existing multi-trait methods have limited ability to perform rare variant analysis of large-scale WGS data. We propose MultiSTAAR, a statistical framework and computationally-scalable analytical pipeline for functionally-informed multi-trait rare variant analysis in large-scale WGS studies. MultiSTAAR accounts for relatedness, population structure and correlation among phenotypes by jointly analyzing multiple traits, and further empowers rare variant association analysis by incorporating multiple functional annotations. We applied MultiSTAAR to jointly analyze three lipid traits (low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and triglycerides) in 61,861 multi-ethnic samples from the Trans-Omics for Precision Medicine (TOPMed) Program. We discovered new associations with lipid traits missed by single-trait analysis, including rare variants within an enhancer of NIPSNAP3A and an intergenic region on chromosome 1.

8.
Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.
Article En | MEDLINE | ID: mdl-38014529

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.


Atherosclerosis , Coronary Artery Disease , Diabetes Mellitus, Type 2 , Plaque, Atherosclerotic , Humans , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/genetics , Carotid Intima-Media Thickness , Risk Factors , Atherosclerosis/genetics , Genomics
9.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Article En | MEDLINE | ID: mdl-37802043

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


RNA, Long Noncoding , Humans , RNA, Long Noncoding/genetics , Genome-Wide Association Study , Precision Medicine , Whole Genome Sequencing/methods , Lipids/genetics , Polymorphism, Single Nucleotide/genetics
10.
Front Genet ; 14: 1132110, 2023.
Article En | MEDLINE | ID: mdl-37795246

Background: Socioeconomic status (SES) is a potent environmental determinant of health. To our knowledge, no assessment of genotype-environment interaction has been conducted to consider the joint effects of socioeconomic status and genetics on risk for cardiovascular disease (CVD). We analyzed Mexican American Family Studies (MAFS) data to evaluate the hypothesis that genotype-by-environment interaction (GxE) is an important determinant of variation in CVD risk factors. Methods: We employed a linear mixed model to investigate GxE in Mexican American extended families. We studied two proxies for CVD [Pooled Cohort Equation Risk Scores/Framingham Risk Scores (FRS/PCRS) and carotid artery intima-media thickness (CA-IMT)] in relation to socioeconomic status as determined by Duncan's Socioeconomic Index (SEI), years of education, and household income. Results: We calculated heritability for FRS/PCRS and carotid artery intima-media thickness. There was evidence of GxE due to additive genetic variance heterogeneity and genetic correlation for FRS, PCRS, and CA-IMT measures for education (environment) but not for household income or SEI. Conclusion: The genetic effects underlying CVD are dynamically modulated at the lower end of the SES spectrum. There is a significant change in the genetic architecture underlying the major components of CVD in response to changes in education.

11.
medRxiv ; 2023 Aug 16.
Article En | MEDLINE | ID: mdl-37645892

Background: The CCL2/CCR2 axis governs monocyte trafficking and recruitment to atherosclerotic lesions. Human genetic analyses and population-based studies support an association between circulating CCL2 levels and atherosclerosis. Still, it remains unknown whether pharmacological targeting of CCR2, the main CCL2 receptor, would provide protection against human atherosclerotic disease. Methods: In whole-exome sequencing data from 454,775 UK Biobank participants (40-69 years), we identified predicted loss-of-function (LoF) or damaging missense (REVEL score >0.5) variants within the CCR2 gene. We prioritized variants associated with lower monocyte count (p<0.05) and tested associations with vascular risk factors and risk of atherosclerotic disease over a mean follow-up of 14 years. The results were replicated in a pooled cohort of three independent datasets (TOPMed, deCODE and Penn Medicine BioBank; total n=441,445) and the effect of the most frequent damaging variant was experimentally validated. Results: A total of 45 predicted LoF or damaging missense variants were identified in the CCR2 gene, 4 of which were also significantly associated with lower monocyte count, but not with other white blood cell counts. Heterozygous carriers of these variants were at a lower risk of a combined atherosclerosis outcome, showed a lower burden of atherosclerosis across four vascular beds, and were at a lower lifetime risk of coronary artery disease and myocardial infarction. There was no evidence of association with vascular risk factors including LDL-cholesterol, blood pressure, glycemic status, or C-reactive protein. Using a cAMP assay, we found that cells transfected with the most frequent CCR2 damaging variant (3:46358273:T:A, M249K, 547 carriers, frequency: 0.14%) show a decrease in signaling in response to CCL2. The associations of the M249K variant with myocardial infarction were consistent across cohorts (ORUKB: 0.62 95%CI: 0.39-0.96; ORexternal: 0.64 95%CI: 0.34-1.19; ORpooled: 0.64 95%CI: 0.450.90). In a phenome-wide association study, we found no evidence for higher risk of common infections or mortality among carriers of damaging CCR2 variants. Conclusions: Heterozygous carriers of damaging CCR2 variants have a lower burden of atherosclerosis and lower lifetime risk of myocardial infarction. In conjunction with previous evidence from experimental and epidemiological studies, our findings highlight the translational potential of CCR2-targeting as an atheroprotective approach.

12.
medRxiv ; 2023 Jun 29.
Article En | MEDLINE | ID: mdl-37425772

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions. Large-scale whole genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess the associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with blood lipid levels (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare variant aggregate association tests using the STAAR (variant-Set Test for Association using Annotation infoRmation) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare coding variants in nearby protein coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500 kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variations and rare protein coding variations at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNA, implicating new therapeutic opportunities.

13.
Front Neurol ; 14: 1071766, 2023.
Article En | MEDLINE | ID: mdl-36970519

Introduction: The cocktail-party problem refers to the difficulty listeners face when trying to attend to relevant sounds that are mixed with irrelevant ones. Previous studies have shown that solving these problems relies on perceptual as well as cognitive processes. Previously, we showed that speech-reception thresholds (SRTs) on a cocktail-party listening task were influenced by genetic factors. Here, we estimated the degree to which these genetic factors overlapped with those influencing cognitive abilities. Methods: We measured SRTs and hearing thresholds (HTs) in 493 listeners, who ranged in age from 18 to 91 years old. The same individuals completed a cognitive test battery comprising 18 measures of various cognitive domains. Individuals belonged to large extended pedigrees, which allowed us to use variance component models to estimate the narrow-sense heritability of each trait, followed by phenotypic and genetic correlations between pairs of traits. Results: All traits were heritable. The phenotypic and genetic correlations between SRTs and HTs were modest, and only the phenotypic correlation was significant. By contrast, all genetic SRT-cognition correlations were strong and significantly different from 0. For some of these genetic correlations, the hypothesis of complete pleiotropy could not be rejected. Discussion: Overall, the results suggest that there was substantial genetic overlap between SRTs and a wide range of cognitive abilities, including abilities without a major auditory or verbal component. The findings highlight the important, yet sometimes overlooked, contribution of higher-order processes to solving the cocktail-party problem, raising an important caveat for future studies aiming to identify specific genetic factors that influence cocktail-party listening.

14.
Neurology ; 100(18): e1930-e1943, 2023 05 02.
Article En | MEDLINE | ID: mdl-36927883

BACKGROUND AND OBJECTIVES: Previous studies suggest that lower mitochondrial DNA (mtDNA) copy number (CN) is associated with neurodegenerative diseases. However, whether mtDNA CN in whole blood is related to endophenotypes of Alzheimer disease (AD) and AD-related dementia (AD/ADRD) needs further investigation. We assessed the association of mtDNA CN with cognitive function and MRI measures in community-based samples of middle-aged to older adults. METHODS: We included dementia-free participants from 9 diverse community-based cohorts with whole-genome sequencing in the Trans-Omics for Precision Medicine (TOPMed) program. Circulating mtDNA CN was estimated as twice the ratio of the average coverage of mtDNA to nuclear DNA. Brain MRI markers included total brain, hippocampal, and white matter hyperintensity volumes. General cognitive function was derived from distinct cognitive domains. We performed cohort-specific association analyses of mtDNA CN with AD/ADRD endophenotypes assessed within ±5 years (i.e., cross-sectional analyses) or 5-20 years after blood draw (i.e., prospective analyses) adjusting for potential confounders. We further explored associations stratified by sex and age (<60 vs ≥60 years). Fixed-effects or sample size-weighted meta-analyses were performed to combine results. Finally, we performed mendelian randomization (MR) analyses to assess causality. RESULTS: We included up to 19,152 participants (mean age 59 years, 57% women). Higher mtDNA CN was cross-sectionally associated with better general cognitive function (ß = 0.04; 95% CI 0.02-0.06) independent of age, sex, batch effects, race/ethnicity, time between blood draw and cognitive evaluation, cohort-specific variables, and education. Additional adjustment for blood cell counts or cardiometabolic traits led to slightly attenuated results. We observed similar significant associations with cognition in prospective analyses, although of reduced magnitude. We found no significant associations between mtDNA CN and brain MRI measures in meta-analyses. MR analyses did not reveal a causal relation between mtDNA CN in blood and cognition. DISCUSSION: Higher mtDNA CN in blood is associated with better current and future general cognitive function in large and diverse communities across the United States. Although MR analyses did not support a causal role, additional research is needed to assess causality. Circulating mtDNA CN could serve nevertheless as a biomarker of current and future cognitive function in the community.


Alzheimer Disease , DNA, Mitochondrial , Middle Aged , Humans , Female , Aged , Male , DNA, Mitochondrial/genetics , DNA Copy Number Variations , Prospective Studies , Cross-Sectional Studies , Magnetic Resonance Imaging , Cognition , Brain
15.
bioRxiv ; 2023 Jan 25.
Article En | MEDLINE | ID: mdl-36747810

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

16.
Res Sq ; 2023 Feb 03.
Article En | MEDLINE | ID: mdl-36778386

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hematologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

17.
Nat Genet ; 55(2): 291-300, 2023 02.
Article En | MEDLINE | ID: mdl-36702996

Most transcriptome-wide association studies (TWASs) so far focus on European ancestry and lack diversity. To overcome this limitation, we aggregated genome-wide association study (GWAS) summary statistics, whole-genome sequences and expression quantitative trait locus (eQTL) data from diverse ancestries. We developed a new approach, TESLA (multi-ancestry integrative study using an optimal linear combination of association statistics), to integrate an eQTL dataset with a multi-ancestry GWAS. By exploiting shared phenotypic effects between ancestries and accommodating potential effect heterogeneities, TESLA improves power over other TWAS methods. When applied to tobacco use phenotypes, TESLA identified 273 new genes, up to 55% more compared with alternative TWAS methods. These hits and subsequent fine mapping using TESLA point to target genes with biological relevance. In silico drug-repurposing analyses highlight several drugs with known efficacy, including dextromethorphan and galantamine, and new drugs such as muscle relaxants that may be repurposed for treating nicotine addiction.


Drug Repositioning , Transcriptome , Humans , Transcriptome/genetics , Genome-Wide Association Study/methods , Tobacco Use , Biology , Polymorphism, Single Nucleotide/genetics , Genetic Predisposition to Disease
18.
Nat Genet ; 55(1): 154-164, 2023 01.
Article En | MEDLINE | ID: mdl-36564505

Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.


Genome-Wide Association Study , Lipids , Genome-Wide Association Study/methods , Whole Genome Sequencing/methods , Exome Sequencing , Phenotype , Lipids/genetics
19.
Nat Commun ; 13(1): 7592, 2022 12 08.
Article En | MEDLINE | ID: mdl-36481753

Genome-wide association studies have identified thousands of single nucleotide variants and small indels that contribute to variation in hematologic traits. While structural variants are known to cause rare blood or hematopoietic disorders, the genome-wide contribution of structural variants to quantitative blood cell trait variation is unknown. Here we utilized whole genome sequencing data in ancestrally diverse participants of the NHLBI Trans Omics for Precision Medicine program (N = 50,675) to detect structural variants associated with hematologic traits. Using single variant tests, we assessed the association of common and rare structural variants with red cell-, white cell-, and platelet-related quantitative traits and observed 21 independent signals (12 common and 9 rare) reaching genome-wide significance. The majority of these associations (N = 18) replicated in independent datasets. In genome-editing experiments, we provide evidence that a deletion associated with lower monocyte counts leads to disruption of an S1PR3 monocyte enhancer and decreased S1PR3 expression.


Blood Cells , Genome-Wide Association Study , Humans , Whole Genome Sequencing
20.
Nature ; 612(7941): 720-724, 2022 12.
Article En | MEDLINE | ID: mdl-36477530

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1-4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.


Alcohol Drinking , Genetic Predisposition to Disease , Genetic Variation , Internationality , Multifactorial Inheritance , Tobacco Use , Humans , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Genome-Wide Association Study/methods , Multifactorial Inheritance/genetics , Risk Factors , Tobacco Use/genetics , Alcohol Drinking/genetics , Transcriptome , Sample Size , Genetic Loci/genetics , Europe/ethnology
...