Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 343
Filtrar
1.
Diabetes ; 2024 Feb 23.
Artigo em Inglês | MEDLINE | ID: mdl-38394643

RESUMO

Diabetic kidney disease (DKD) is the leading cause of end-stage kidney disease. As many genes associate with DKD, multi-omics approaches were employed to narrow the list of functional genes, gene products and related pathways providing insights into the pathophysiological mechanisms of DKD. The Kidney Precision Medicine Project human kidney single-cell RNA-sequencing (scRNAseq) dataset and Mendeley Data on human kidney cortex biopsy proteomics were utilized. R package Seurat was used to analyze scRNAseq and subset proximal tubule cells. PathfindR was applied for pathway analysis in cell type-specific differentially expressed genes and R limma package was used to analyze differential protein expression in kidney cortex. A total of 790 differentially expressed genes were identified in proximal tubule cells, including 530 upregulated and 260 downregulated transcripts. Compared with differentially expressed proteins, 24 genes/proteins were in common. An integrated analysis combining protein quantitative trait loci (pQTL), GWAS hits (estimated glomerular filtration rate) and a plasma metabolomics analysis was performed using baseline metabolites predictive of DKD progression in our longitudinal Diabetes Heart Study samples. Aldo-keto reductase family 1 member A1 gene (AKR1A1) was revealed as a potential molecular hub for DKD cellular dysfunction in several cross-linked pathways featured by deficiency of this enzyme.

2.
bioRxiv ; 2023 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-37961350

RESUMO

Large-scale whole-genome sequencing (WGS) studies have improved our understanding of the contributions of coding and noncoding rare variants to complex human traits. Leveraging association effect sizes across multiple traits in WGS rare variant association analysis can improve statistical power over single-trait analysis, and also detect pleiotropic genes and regions. Existing multi-trait methods have limited ability to perform rare variant analysis of large-scale WGS data. We propose MultiSTAAR, a statistical framework and computationally-scalable analytical pipeline for functionally-informed multi-trait rare variant analysis in large-scale WGS studies. MultiSTAAR accounts for relatedness, population structure and correlation among phenotypes by jointly analyzing multiple traits, and further empowers rare variant association analysis by incorporating multiple functional annotations. We applied MultiSTAAR to jointly analyze three lipid traits (low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and triglycerides) in 61,861 multi-ethnic samples from the Trans-Omics for Precision Medicine (TOPMed) Program. We discovered new associations with lipid traits missed by single-trait analysis, including rare variants within an enhancer of NIPSNAP3A and an intergenic region on chromosome 1.

3.
Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38014529

RESUMO

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.


Assuntos
Aterosclerose , Doença da Artéria Coronariana , Diabetes Mellitus Tipo 2 , Placa Aterosclerótica , Humanos , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/genética , Espessura Intima-Media Carotídea , Fatores de Risco , Aterosclerose/genética , Genômica
4.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-37802043

RESUMO

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


Assuntos
RNA Longo não Codificante , Humanos , RNA Longo não Codificante/genética , Estudo de Associação Genômica Ampla , Medicina de Precisão , Sequenciamento Completo do Genoma/métodos , Lipídeos/genética , Polimorfismo de Nucleotídeo Único/genética
5.
medRxiv ; 2023 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-37662265

RESUMO

Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals (P < 5 × 10-9). Notably, we identified and replicated a novel low frequency single nucleotide polymorphism (SNP) in MTMR3 that was common in individuals of African descent. Using a diverse study population, we further identified two novel secondary signals in known BMI loci and pinpointed two likely causal variants in the POC5 and DMD loci. Our work demonstrates the benefits of combining WGS and diverse cohorts in expanding current catalog of variants and genes confer risk for obesity, bringing us one step closer to personalized medicine.

6.
Nat Genet ; 55(10): 1651-1664, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37770635

RESUMO

Coronary artery calcification (CAC), a measure of subclinical atherosclerosis, predicts future symptomatic coronary artery disease (CAD). Identifying genetic risk factors for CAC may point to new therapeutic avenues for prevention. Currently, there are only four known risk loci for CAC identified from genome-wide association studies (GWAS) in the general population. Here we conducted the largest multi-ancestry GWAS meta-analysis of CAC to date, which comprised 26,909 individuals of European ancestry and 8,867 individuals of African ancestry. We identified 11 independent risk loci, of which eight were new for CAC and five had not been reported for CAD. These new CAC loci are related to bone mineralization, phosphate catabolism and hormone metabolic pathways. Several new loci harbor candidate causal genes supported by multiple lines of functional evidence and are regulators of smooth muscle cell-mediated calcification ex vivo and in vitro. Together, these findings help refine the genetic architecture of CAC and extend our understanding of the biological and potential druggable pathways underlying CAC.


Assuntos
Aterosclerose , Doença da Artéria Coronariana , Humanos , Aterosclerose/genética , População Negra/genética , Doença da Artéria Coronariana/genética , Estudo de Associação Genômica Ampla , Fatores de Risco , População Europeia/genética
7.
Nat Genet ; 55(10): 1640-1650, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37709864

RESUMO

Nonalcoholic fatty liver disease (NAFLD) is common and partially heritable and has no effective treatments. We carried out a genome-wide association study (GWAS) meta-analysis of imaging (n = 66,814) and diagnostic code (3,584 cases versus 621,081 controls) measured NAFLD across diverse ancestries. We identified NAFLD-associated variants at torsin family 1 member B (TOR1B), fat mass and obesity associated (FTO), cordon-bleu WH2 repeat protein like 1 (COBLL1)/growth factor receptor-bound protein 14 (GRB14), insulin receptor (INSR), sterol regulatory element-binding transcription factor 1 (SREBF1) and patatin-like phospholipase domain-containing protein 2 (PNPLA2), as well as validated NAFLD-associated variants at patatin-like phospholipase domain-containing protein 3 (PNPLA3), transmembrane 6 superfamily 2 (TM6SF2), apolipoprotein E (APOE), glucokinase regulator (GCKR), tribbles homolog 1 (TRIB1), glycerol-3-phosphate acyltransferase (GPAM), mitochondrial amidoxime-reducing component 1 (MARC1), microsomal triglyceride transfer protein large subunit (MTTP), alcohol dehydrogenase 1B (ADH1B), transmembrane channel like 4 (TMC4)/membrane-bound O-acyltransferase domain containing 7 (MBOAT7) and receptor-type tyrosine-protein phosphatase δ (PTPRD). Implicated genes highlight mitochondrial, cholesterol and de novo lipogenesis as causally contributing to NAFLD predisposition. Phenome-wide association study (PheWAS) analyses suggest at least seven subtypes of NAFLD. Individuals in the top 10% and 1% of genetic risk have a 2.5-fold to 6-fold increased risk of NAFLD, cirrhosis and hepatocellular carcinoma. These genetic variants identify subtypes of NAFLD, improve estimates of disease risk and can guide the development of targeted therapeutics.


Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/complicações , Hepatopatia Gordurosa não Alcoólica/metabolismo , Estudo de Associação Genômica Ampla , Cirrose Hepática/genética , Aciltransferases/genética , Aciltransferases/metabolismo , Fosfolipases/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Fígado/metabolismo , Proteínas Serina-Treonina Quinases/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , Dioxigenase FTO Dependente de alfa-Cetoglutarato/genética , Dioxigenase FTO Dependente de alfa-Cetoglutarato/metabolismo
8.
Metabolomics ; 19(8): 72, 2023 08 09.
Artigo em Inglês | MEDLINE | ID: mdl-37558891

RESUMO

CONTEXT: Insulin resistance is associated with multiple complex diseases; however, precise measures of insulin resistance are invasive, expensive, and time-consuming. OBJECTIVE: Develop estimation models for measures of insulin resistance, including insulin sensitivity index (SI) and homeostatic model assessment of insulin resistance (HOMA-IR) from metabolomics data. DESIGN: Insulin Resistance Atherosclerosis Family Study (IRASFS). SETTING: Community based. PARTICIPANTS: Mexican Americans (MA) and African Americans (AA). MAIN OUTCOME: Estimation models for measures of insulin resistance, i.e. SI and HOMA-IR. RESULTS: Least Absolute Shrinkage and Selection Operator (LASSO) and Elastic Net regression were used to build insulin resistance estimation models from 1274 metabolites combined with clinical data, e.g. age, sex, body mass index (BMI). Metabolite data were transformed using three approaches, i.e. inverse normal transformation, standardization, and Box Cox transformation. The analysis was performed in one MA recruitment site (San Luis Valley, Colorado (SLV); N = 450) and tested in another MA recruitment site (San Antonio, Texas (SA); N = 473). In addition, the two MA recruitment sites were combined and estimation models tested in the AA recruitment sample (Los Angeles, California; N = 495). Estimated and empiric SI were correlated in the SA (r2 = 0.77) and AA (r2 = 0.74) testing datasets. Further, estimated and empiric SI were consistently associated with BMI, low-density lipoprotein cholesterol (LDL), and triglycerides. We applied similar approaches to estimate HOMA-IR with similar results. CONCLUSIONS: We have developed a method for estimating insulin resistance with metabolomics data that has the potential for application to a wide range of biomedical studies and conditions.


Assuntos
Aterosclerose , Resistência à Insulina , Humanos , Metabolômica , Aterosclerose/metabolismo
9.
medRxiv ; 2023 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-37645892

RESUMO

Background: The CCL2/CCR2 axis governs monocyte trafficking and recruitment to atherosclerotic lesions. Human genetic analyses and population-based studies support an association between circulating CCL2 levels and atherosclerosis. Still, it remains unknown whether pharmacological targeting of CCR2, the main CCL2 receptor, would provide protection against human atherosclerotic disease. Methods: In whole-exome sequencing data from 454,775 UK Biobank participants (40-69 years), we identified predicted loss-of-function (LoF) or damaging missense (REVEL score >0.5) variants within the CCR2 gene. We prioritized variants associated with lower monocyte count (p<0.05) and tested associations with vascular risk factors and risk of atherosclerotic disease over a mean follow-up of 14 years. The results were replicated in a pooled cohort of three independent datasets (TOPMed, deCODE and Penn Medicine BioBank; total n=441,445) and the effect of the most frequent damaging variant was experimentally validated. Results: A total of 45 predicted LoF or damaging missense variants were identified in the CCR2 gene, 4 of which were also significantly associated with lower monocyte count, but not with other white blood cell counts. Heterozygous carriers of these variants were at a lower risk of a combined atherosclerosis outcome, showed a lower burden of atherosclerosis across four vascular beds, and were at a lower lifetime risk of coronary artery disease and myocardial infarction. There was no evidence of association with vascular risk factors including LDL-cholesterol, blood pressure, glycemic status, or C-reactive protein. Using a cAMP assay, we found that cells transfected with the most frequent CCR2 damaging variant (3:46358273:T:A, M249K, 547 carriers, frequency: 0.14%) show a decrease in signaling in response to CCL2. The associations of the M249K variant with myocardial infarction were consistent across cohorts (ORUKB: 0.62 95%CI: 0.39-0.96; ORexternal: 0.64 95%CI: 0.34-1.19; ORpooled: 0.64 95%CI: 0.450.90). In a phenome-wide association study, we found no evidence for higher risk of common infections or mortality among carriers of damaging CCR2 variants. Conclusions: Heterozygous carriers of damaging CCR2 variants have a lower burden of atherosclerosis and lower lifetime risk of myocardial infarction. In conjunction with previous evidence from experimental and epidemiological studies, our findings highlight the translational potential of CCR2-targeting as an atheroprotective approach.

10.
medRxiv ; 2023 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-37425772

RESUMO

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions. Large-scale whole genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess the associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with blood lipid levels (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare variant aggregate association tests using the STAAR (variant-Set Test for Association using Annotation infoRmation) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare coding variants in nearby protein coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500 kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variations and rare protein coding variations at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNA, implicating new therapeutic opportunities.

11.
Sci Adv ; 9(17): eabm4945, 2023 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-37126548

RESUMO

Nononcogenic somatic mutations are thought to be uncommon and inconsequential. To test this, we analyzed 43,693 National Heart, Lung and Blood Institute Trans-Omics for Precision Medicine blood whole genomes from 37 cohorts and identified 7131 non-missense somatic mutations that are recurrently mutated in at least 50 individuals. These recurrent non-missense somatic mutations (RNMSMs) are not clearly explained by other clonal phenomena such as clonal hematopoiesis. RNMSM prevalence increased with age, with an average 50-year-old having 27 RNMSMs. Inherited germline variation associated with RNMSM acquisition. These variants were found in genes involved in adaptive immune function, proinflammatory cytokine production, and lymphoid lineage commitment. In addition, the presence of eight specific RNMSMs associated with blood cell traits at effect sizes comparable to Mendelian genetic mutations. Overall, we found that somatic mutations in blood are an unexpectedly common phenomenon with ancestry-specific determinants and human health consequences.


Assuntos
Mutação em Linhagem Germinativa , Hematopoese , Humanos , Pessoa de Meia-Idade , Mutação , Mutação de Sentido Incorreto , Fenótipo
12.
Metabolomics ; 19(4): 35, 2023 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-37005925

RESUMO

INTRODUCTION: African Americans are at increased risk for type 2 diabetes. OBJECTIVES: This work aimed to examine metabolomic signature of glucose homeostasis in African Americans. METHODS: We used an untargeted liquid chromatography-mass spectrometry metabolomic approach to comprehensively profile 727 plasma metabolites among 571 African Americans from the Insulin Resistance Atherosclerosis Family Study (IRAS-FS) and investigate the associations between these metabolites and both the dynamic (SI, insulin sensitivity; AIR, acute insulin response; DI, disposition index; and SG, glucose effectiveness) and basal (HOMA-IR and HOMA-B) measures of glucose homeostasis using univariate and regularized regression models. We also compared the results with our previous findings in the IRAS-FS Mexican Americans. RESULTS: We confirmed increased plasma metabolite levels of branched-chain amino acids and their metabolic derivatives, 2-aminoadipate, 2-hydroxybutyrate, glutamate, arginine and its metabolic derivatives, carbohydrate metabolites, and medium- and long-chain fatty acids were associated with insulin resistance, while increased plasma metabolite levels in the glycine, serine and threonine metabolic pathway were associated with insulin sensitivity. We also observed a differential ancestral effect of glutamate on glucose homeostasis with significantly stronger effects observed in African Americans than those previously observed in Mexican Americans. CONCLUSION: We extended the observations that metabolites are useful biomarkers in the identification of prediabetes in individuals at risk of type 2 diabetes in African Americans. We revealed, for the first time, differential ancestral effect of certain metabolites (i.e., glutamate) on glucose homeostasis traits. Our study highlights the need for additional comprehensive metabolomic studies in well-characterized multiethnic cohorts.


Assuntos
Aterosclerose , Diabetes Mellitus Tipo 2 , Resistência à Insulina , Humanos , Aterosclerose/metabolismo , Negro ou Afro-Americano , Diabetes Mellitus Tipo 2/metabolismo , Glucose , Glutamatos , Homeostase/fisiologia , Metabolômica
13.
Nature ; 616(7958): 755-763, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-37046083

RESUMO

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Assuntos
Hematopoiese Clonal , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Alelos , Hematopoiese Clonal/genética , Estudo de Associação Genômica Ampla , Hematopoese/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Mutação , Regiões Promotoras Genéticas
14.
Res Sq ; 2023 Feb 03.
Artigo em Inglês | MEDLINE | ID: mdl-36778386

RESUMO

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hematologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

15.
bioRxiv ; 2023 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-36747810

RESUMO

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

16.
Nat Genet ; 55(1): 154-164, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36564505

RESUMO

Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.


Assuntos
Estudo de Associação Genômica Ampla , Lipídeos , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Sequenciamento do Exoma , Fenótipo , Lipídeos/genética
17.
Nature ; 612(7941): 720-724, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36477530

RESUMO

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1-4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.


Assuntos
Consumo de Bebidas Alcoólicas , Predisposição Genética para Doença , Variação Genética , Internacionalidade , Herança Multifatorial , Uso de Tabaco , Humanos , Predisposição Genética para Doença/genética , Variação Genética/genética , Estudo de Associação Genômica Ampla/métodos , Herança Multifatorial/genética , Fatores de Risco , Uso de Tabaco/genética , Consumo de Bebidas Alcoólicas/genética , Transcriptoma , Tamanho da Amostra , Loci Gênicos/genética , Europa (Continente)/etnologia
18.
Nat Methods ; 19(12): 1599-1611, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36303018

RESUMO

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.


Assuntos
Estudo de Associação Genômica Ampla , Genoma , Humanos , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Fenótipo , Variação Genética
19.
Nat Hum Behav ; 6(11): 1577-1586, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-35927319

RESUMO

Common genetic variants explain less variation in complex phenotypes than inferred from family-based studies, and there is a debate on the source of this 'missing heritability'. We investigated the contribution of rare genetic variants to tobacco use with whole-genome sequences from up to 26,257 unrelated individuals of European ancestries and 11,743 individuals of African ancestries. Across four smoking traits, single-nucleotide-polymorphism-based heritability ([Formula: see text]) was estimated from 0.13 to 0.28 (s.e., 0.10-0.13) in European ancestries, with 35-74% of it attributable to rare variants with minor allele frequencies between 0.01% and 1%. These heritability estimates are 1.5-4 times higher than past estimates based on common variants alone and accounted for 60% to 100% of our pedigree-based estimates of narrow-sense heritability ([Formula: see text], 0.18-0.34). In the African ancestry samples, [Formula: see text] was estimated from 0.03 to 0.33 (s.e., 0.09-0.14) across the four smoking traits. These results suggest that rare variants are important contributors to the heritability of smoking.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Frequência do Gene , Polimorfismo de Nucleotídeo Único/genética , Fenótipo , Fumar/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA