Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 143
Filtrar
1.
Nature ; 627(8003): 347-357, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38374256

RESUMO

Type 2 diabetes (T2D) is a heterogeneous disease that develops through diverse pathophysiological processes1,2 and molecular mechanisms that are often specific to cell type3,4. Here, to characterize the genetic contribution to these processes across ancestry groups, we aggregate genome-wide association study data from 2,535,601 individuals (39.7% not of European ancestry), including 428,452 cases of T2D. We identify 1,289 independent association signals at genome-wide significance (P < 5 × 10-8) that map to 611 loci, of which 145 loci are, to our knowledge, previously unreported. We define eight non-overlapping clusters of T2D signals that are characterized by distinct profiles of cardiometabolic trait associations. These clusters are differentially enriched for cell-type-specific regions of open chromatin, including pancreatic islets, adipocytes, endothelial cells and enteroendocrine cells. We build cluster-specific partitioned polygenic scores5 in a further 279,552 individuals of diverse ancestry, including 30,288 cases of T2D, and test their association with T2D-related vascular outcomes. Cluster-specific partitioned polygenic scores are associated with coronary artery disease, peripheral artery disease and end-stage diabetic nephropathy across ancestry groups, highlighting the importance of obesity-related processes in the development of vascular outcomes. Our findings show the value of integrating multi-ancestry genome-wide association study data with single-cell epigenomics to disentangle the aetiological heterogeneity that drives the development and progression of T2D. This might offer a route to optimize global access to genetically informed diabetes care.


Assuntos
Diabetes Mellitus Tipo 2 , Progressão da Doença , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Adipócitos/metabolismo , Cromatina/genética , Cromatina/metabolismo , Doença da Artéria Coronariana/complicações , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 2/classificação , Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/patologia , Diabetes Mellitus Tipo 2/fisiopatologia , Nefropatias Diabéticas/complicações , Nefropatias Diabéticas/genética , Células Endoteliais/metabolismo , Células Enteroendócrinas , Epigenômica , Predisposição Genética para Doença/genética , Ilhotas Pancreáticas/metabolismo , Herança Multifatorial/genética , Doença Arterial Periférica/complicações , Doença Arterial Periférica/genética , Análise de Célula Única
2.
Front Genet ; 14: 1235337, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38028628

RESUMO

Introduction: Educational attainment, widely used in epidemiologic studies as a surrogate for socioeconomic status, is a predictor of cardiovascular health outcomes. Methods: A two-stage genome-wide meta-analysis of low-density lipoprotein cholesterol (LDL), high-density lipoprotein cholesterol (HDL), and triglyceride (TG) levels was performed while accounting for gene-educational attainment interactions in up to 226,315 individuals from five population groups. We considered two educational attainment variables: "Some College" (yes/no, for any education beyond high school) and "Graduated College" (yes/no, for completing a 4-year college degree). Genome-wide significant (p < 5 × 10-8) and suggestive (p < 1 × 10-6) variants were identified in Stage 1 (in up to 108,784 individuals) through genome-wide analysis, and those variants were followed up in Stage 2 studies (in up to 117,531 individuals). Results: In combined analysis of Stages 1 and 2, we identified 18 novel lipid loci (nine for LDL, seven for HDL, and two for TG) by two degree-of-freedom (2 DF) joint tests of main and interaction effects. Four loci showed significant interaction with educational attainment. Two loci were significant only in cross-population analyses. Several loci include genes with known or suggested roles in adipose (FOXP1, MBOAT4, SKP2, STIM1, STX4), brain (BRI3, FILIP1, FOXP1, LINC00290, LMTK2, MBOAT4, MYO6, SENP6, SRGAP3, STIM1, TMEM167A, TMEM30A), and liver (BRI3, FOXP1) biology, highlighting the potential importance of brain-adipose-liver communication in the regulation of lipid metabolism. An investigation of the potential druggability of genes in identified loci resulted in five gene targets shown to interact with drugs approved by the Food and Drug Administration, including genes with roles in adipose and brain tissue. Discussion: Genome-wide interaction analysis of educational attainment identified novel lipid loci not previously detected by analyses limited to main genetic effects.

3.
Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38014529

RESUMO

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.


Assuntos
Aterosclerose , Doença da Artéria Coronariana , Diabetes Mellitus Tipo 2 , Placa Aterosclerótica , Humanos , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/genética , Espessura Intima-Media Carotídea , Fatores de Risco , Aterosclerose/genética , Genômica
4.
J Am Heart Assoc ; 12(20): e029090, 2023 10 17.
Artigo em Inglês | MEDLINE | ID: mdl-37804200

RESUMO

Background The relationship between mitochondrial DNA copy number (mtDNA CN) and cardiovascular disease remains elusive. Methods and Results We performed cross-sectional and prospective association analyses of blood-derived mtDNA CN and cardiovascular disease outcomes in 27 316 participants in 8 cohorts of multiple racial and ethnic groups with whole-genome sequencing. We also performed Mendelian randomization to explore causal relationships of mtDNA CN with coronary heart disease (CHD) and cardiometabolic risk factors (obesity, diabetes, hypertension, and hyperlipidemia). P<0.01 was used for significance. We validated most of the previously reported associations between mtDNA CN and cardiovascular disease outcomes. For example, 1-SD unit lower level of mtDNA CN was associated with 1.08 (95% CI, 1.04-1.12; P<0.001) times the hazard for developing incident CHD, adjusting for covariates. Mendelian randomization analyses showed no causal effect from a lower level of mtDNA CN to a higher CHD risk (ß=0.091; P=0.11) or in the reverse direction (ß=-0.012; P=0.076). Additional bidirectional Mendelian randomization analyses revealed that low-density lipoprotein cholesterol had a causal effect on mtDNA CN (ß=-0.084; P<0.001), but the reverse direction was not significant (P=0.059). No causal associations were observed between mtDNA CN and obesity, diabetes, and hypertension, in either direction. Multivariable Mendelian randomization analyses showed no causal effect of CHD on mtDNA CN, controlling for low-density lipoprotein cholesterol level (P=0.52), whereas there was a strong direct causal effect of higher low-density lipoprotein cholesterol on lower mtDNA CN, adjusting for CHD status (ß=-0.092; P<0.001). Conclusions Our findings indicate that high low-density lipoprotein cholesterol may underlie the complex relationships between mtDNA CN and vascular atherosclerosis.


Assuntos
Doenças Cardiovasculares , Doença das Coronárias , Diabetes Mellitus , Hipertensão , Humanos , DNA Mitocondrial/genética , Fatores de Risco , Doenças Cardiovasculares/epidemiologia , Doenças Cardiovasculares/genética , LDL-Colesterol , Variações do Número de Cópias de DNA , Estudos Transversais , Doença das Coronárias/genética , HDL-Colesterol , Hipertensão/epidemiologia , Hipertensão/genética , Obesidade
5.
medRxiv ; 2023 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-37662265

RESUMO

Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals (P < 5 × 10-9). Notably, we identified and replicated a novel low frequency single nucleotide polymorphism (SNP) in MTMR3 that was common in individuals of African descent. Using a diverse study population, we further identified two novel secondary signals in known BMI loci and pinpointed two likely causal variants in the POC5 and DMD loci. Our work demonstrates the benefits of combining WGS and diverse cohorts in expanding current catalog of variants and genes confer risk for obesity, bringing us one step closer to personalized medicine.

6.
Nat Genet ; 55(10): 1640-1650, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37709864

RESUMO

Nonalcoholic fatty liver disease (NAFLD) is common and partially heritable and has no effective treatments. We carried out a genome-wide association study (GWAS) meta-analysis of imaging (n = 66,814) and diagnostic code (3,584 cases versus 621,081 controls) measured NAFLD across diverse ancestries. We identified NAFLD-associated variants at torsin family 1 member B (TOR1B), fat mass and obesity associated (FTO), cordon-bleu WH2 repeat protein like 1 (COBLL1)/growth factor receptor-bound protein 14 (GRB14), insulin receptor (INSR), sterol regulatory element-binding transcription factor 1 (SREBF1) and patatin-like phospholipase domain-containing protein 2 (PNPLA2), as well as validated NAFLD-associated variants at patatin-like phospholipase domain-containing protein 3 (PNPLA3), transmembrane 6 superfamily 2 (TM6SF2), apolipoprotein E (APOE), glucokinase regulator (GCKR), tribbles homolog 1 (TRIB1), glycerol-3-phosphate acyltransferase (GPAM), mitochondrial amidoxime-reducing component 1 (MARC1), microsomal triglyceride transfer protein large subunit (MTTP), alcohol dehydrogenase 1B (ADH1B), transmembrane channel like 4 (TMC4)/membrane-bound O-acyltransferase domain containing 7 (MBOAT7) and receptor-type tyrosine-protein phosphatase δ (PTPRD). Implicated genes highlight mitochondrial, cholesterol and de novo lipogenesis as causally contributing to NAFLD predisposition. Phenome-wide association study (PheWAS) analyses suggest at least seven subtypes of NAFLD. Individuals in the top 10% and 1% of genetic risk have a 2.5-fold to 6-fold increased risk of NAFLD, cirrhosis and hepatocellular carcinoma. These genetic variants identify subtypes of NAFLD, improve estimates of disease risk and can guide the development of targeted therapeutics.


Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/complicações , Hepatopatia Gordurosa não Alcoólica/metabolismo , Estudo de Associação Genômica Ampla , Cirrose Hepática/genética , Aciltransferases/genética , Aciltransferases/metabolismo , Fosfolipases/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Fígado/metabolismo , Proteínas Serina-Treonina Quinases/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , Dioxigenase FTO Dependente de alfa-Cetoglutarato/genética , Dioxigenase FTO Dependente de alfa-Cetoglutarato/metabolismo
7.
Nat Commun ; 14(1): 4646, 2023 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-37532724

RESUMO

Resting heart rate is associated with cardiovascular diseases and mortality in observational and Mendelian randomization studies. The aims of this study are to extend the number of resting heart rate associated genetic variants and to obtain further insights in resting heart rate biology and its clinical consequences. A genome-wide meta-analysis of 100 studies in up to 835,465 individuals reveals 493 independent genetic variants in 352 loci, including 68 genetic variants outside previously identified resting heart rate associated loci. We prioritize 670 genes and in silico annotations point to their enrichment in cardiomyocytes and provide insights in their ECG signature. Two-sample Mendelian randomization analyses indicate that higher genetically predicted resting heart rate increases risk of dilated cardiomyopathy, but decreases risk of developing atrial fibrillation, ischemic stroke, and cardio-embolic stroke. We do not find evidence for a linear or non-linear genetic association between resting heart rate and all-cause mortality in contrast to our previous Mendelian randomization study. Systematic alteration of key differences between the current and previous Mendelian randomization study indicates that the most likely cause of the discrepancy between these studies arises from false positive findings in previous one-sample MR analyses caused by weak-instrument bias at lower P-value thresholds. The results extend our understanding of resting heart rate biology and give additional insights in its role in cardiovascular disease development.


Assuntos
Fibrilação Atrial , Doenças Cardiovasculares , Humanos , Doenças Cardiovasculares/genética , Fatores de Risco , Frequência Cardíaca/genética , Predisposição Genética para Doença , Análise da Randomização Mendeliana/métodos , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único
8.
medRxiv ; 2023 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-37645892

RESUMO

Background: The CCL2/CCR2 axis governs monocyte trafficking and recruitment to atherosclerotic lesions. Human genetic analyses and population-based studies support an association between circulating CCL2 levels and atherosclerosis. Still, it remains unknown whether pharmacological targeting of CCR2, the main CCL2 receptor, would provide protection against human atherosclerotic disease. Methods: In whole-exome sequencing data from 454,775 UK Biobank participants (40-69 years), we identified predicted loss-of-function (LoF) or damaging missense (REVEL score >0.5) variants within the CCR2 gene. We prioritized variants associated with lower monocyte count (p<0.05) and tested associations with vascular risk factors and risk of atherosclerotic disease over a mean follow-up of 14 years. The results were replicated in a pooled cohort of three independent datasets (TOPMed, deCODE and Penn Medicine BioBank; total n=441,445) and the effect of the most frequent damaging variant was experimentally validated. Results: A total of 45 predicted LoF or damaging missense variants were identified in the CCR2 gene, 4 of which were also significantly associated with lower monocyte count, but not with other white blood cell counts. Heterozygous carriers of these variants were at a lower risk of a combined atherosclerosis outcome, showed a lower burden of atherosclerosis across four vascular beds, and were at a lower lifetime risk of coronary artery disease and myocardial infarction. There was no evidence of association with vascular risk factors including LDL-cholesterol, blood pressure, glycemic status, or C-reactive protein. Using a cAMP assay, we found that cells transfected with the most frequent CCR2 damaging variant (3:46358273:T:A, M249K, 547 carriers, frequency: 0.14%) show a decrease in signaling in response to CCL2. The associations of the M249K variant with myocardial infarction were consistent across cohorts (ORUKB: 0.62 95%CI: 0.39-0.96; ORexternal: 0.64 95%CI: 0.34-1.19; ORpooled: 0.64 95%CI: 0.450.90). In a phenome-wide association study, we found no evidence for higher risk of common infections or mortality among carriers of damaging CCR2 variants. Conclusions: Heterozygous carriers of damaging CCR2 variants have a lower burden of atherosclerosis and lower lifetime risk of myocardial infarction. In conjunction with previous evidence from experimental and epidemiological studies, our findings highlight the translational potential of CCR2-targeting as an atheroprotective approach.

9.
medRxiv ; 2023 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-37398003

RESUMO

Genetic studies have identified numerous regions associated with plasma fibrinogen levels in Europeans, yet missing heritability and limited inclusion of non-Europeans necessitates further studies with improved power and sensitivity. Compared with array-based genotyping, whole genome sequencing (WGS) data provides better coverage of the genome and better representation of non-European variants. To better understand the genetic landscape regulating plasma fibrinogen levels, we meta-analyzed WGS data from the NHLBI's Trans-Omics for Precision Medicine (TOPMed) program (n=32,572), with array-based genotype data from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium (n=131,340) imputed to the TOPMed or Haplotype Reference Consortium panel. We identified 18 loci that have not been identified in prior genetic studies of fibrinogen. Of these, four are driven by common variants of small effect with reported MAF at least 10% higher in African populations. Three ( SERPINA1, ZFP36L2 , and TLR10) signals contain predicted deleterious missense variants. Two loci, SOCS3 and HPN , each harbor two conditionally distinct, non-coding variants. The gene region encoding the protein chain subunits ( FGG;FGB;FGA ), contains 7 distinct signals, including one novel signal driven by rs28577061, a variant common (MAF=0.180) in African reference panels but extremely rare (MAF=0.008) in Europeans. Through phenome-wide association studies in the VA Million Veteran Program, we found associations between fibrinogen polygenic risk scores and thrombotic and inflammatory disease phenotypes, including an association with gout. Our findings demonstrate the utility of WGS to augment genetic discovery in diverse populations and offer new insights for putative mechanisms of fibrinogen regulation. Key Points: Largest and most diverse genetic study of plasma fibrinogen identifies 54 regions (18 novel), housing 69 conditionally distinct variants (20 novel).Sufficient power achieved to identify signal driven by African population variant.Links to (1) liver enzyme, blood cell and lipid genetic signals, (2) liver regulatory elements, and (3) thrombotic and inflammatory disease.

11.
medRxiv ; 2023 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-37034649

RESUMO

Type 2 diabetes (T2D) is a heterogeneous disease that develops through diverse pathophysiological processes. To characterise the genetic contribution to these processes across ancestry groups, we aggregate genome-wide association study (GWAS) data from 2,535,601 individuals (39.7% non-European ancestry), including 428,452 T2D cases. We identify 1,289 independent association signals at genome-wide significance (P<5×10-8) that map to 611 loci, of which 145 loci are previously unreported. We define eight non-overlapping clusters of T2D signals characterised by distinct profiles of cardiometabolic trait associations. These clusters are differentially enriched for cell-type specific regions of open chromatin, including pancreatic islets, adipocytes, endothelial, and enteroendocrine cells. We build cluster-specific partitioned genetic risk scores (GRS) in an additional 137,559 individuals of diverse ancestry, including 10,159 T2D cases, and test their association with T2D-related vascular outcomes. Cluster-specific partitioned GRS are more strongly associated with coronary artery disease and end-stage diabetic nephropathy than an overall T2D GRS across ancestry groups, highlighting the importance of obesity-related processes in the development of vascular outcomes. Our findings demonstrate the value of integrating multi-ancestry GWAS with single-cell epigenomics to disentangle the aetiological heterogeneity driving the development and progression of T2D, which may offer a route to optimise global access to genetically-informed diabetes care.

12.
Nat Hum Behav ; 7(5): 790-801, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-36864135

RESUMO

Identifying genetic determinants of reproductive success may highlight mechanisms underlying fertility and identify alleles under present-day selection. Using data in 785,604 individuals of European ancestry, we identified 43 genomic loci associated with either number of children ever born (NEB) or childlessness. These loci span diverse aspects of reproductive biology, including puberty timing, age at first birth, sex hormone regulation, endometriosis and age at menopause. Missense variants in ARHGAP27 were associated with higher NEB but shorter reproductive lifespan, suggesting a trade-off at this locus between reproductive ageing and intensity. Other genes implicated by coding variants include PIK3IP1, ZFP82 and LRP4, and our results suggest a new role for the melanocortin 1 receptor (MC1R) in reproductive biology. As NEB is one component of evolutionary fitness, our identified associations indicate loci under present-day natural selection. Integration with data from historical selection scans highlighted an allele in the FADS1/2 gene locus that has been under selection for thousands of years and remains so today. Collectively, our findings demonstrate that a broad range of biological mechanisms contribute to reproductive success.


Assuntos
Fertilidade , Reprodução , Criança , Feminino , Humanos , Envelhecimento/fisiologia , Fertilidade/genética , Menopausa/genética , Reprodução/genética , Seleção Genética
13.
J Mol Endocrinol ; 70(3)2023 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-36748836

RESUMO

Human genome-wide association studies found single-nucleotide polymorphisms (SNPs) near LYPLAL1 (Lysophospholipase-like protein 1) that have sex-specific effects on fat distribution and metabolic traits. To determine whether altering LYPLAL1 affects obesity and metabolic disease, we created and characterized a mouse knockout (KO) of Lyplal1. We fed the experimental group of mice a high-fat, high-sucrose (HFHS) diet for 23 weeks, and the controls were fed regular chow diet. Here, we show that CRISPR-Cas9 whole-body Lyplal1 KO mice fed an HFHS diet showed sex-specific differences in weight gain and fat accumulation as compared to chow diet. Female, not male, KO mice weighed less than WT mice, had reduced body fat percentage, had white fat mass, and had adipocyte diameter not accounted for by changes in the metabolic rate. Female, but not male, KO mice had increased serum triglycerides, decreased aspartate, and decreased alanine aminotransferase. Lyplal1 KO mice of both sexes have reduced liver triglycerides and steatosis. These diet-specific effects resemble the effects of SNPs near LYPLAL1 in humans, suggesting that LYPLAL1 has an evolutionary conserved sex-specific effect on adiposity. This murine model can be used to study this novel gene-by-sex-by-diet interaction to elucidate the metabolic effects of LYPLAL1 on human obesity.


Assuntos
Estudo de Associação Genômica Ampla , Lisofosfolipase , Obesidade , Animais , Feminino , Humanos , Masculino , Camundongos , Dieta Hiperlipídica/efeitos adversos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Obesidade/genética , Obesidade/metabolismo , Triglicerídeos , Lisofosfolipase/genética
14.
Nat Genet ; 55(2): 291-300, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36702996

RESUMO

Most transcriptome-wide association studies (TWASs) so far focus on European ancestry and lack diversity. To overcome this limitation, we aggregated genome-wide association study (GWAS) summary statistics, whole-genome sequences and expression quantitative trait locus (eQTL) data from diverse ancestries. We developed a new approach, TESLA (multi-ancestry integrative study using an optimal linear combination of association statistics), to integrate an eQTL dataset with a multi-ancestry GWAS. By exploiting shared phenotypic effects between ancestries and accommodating potential effect heterogeneities, TESLA improves power over other TWAS methods. When applied to tobacco use phenotypes, TESLA identified 273 new genes, up to 55% more compared with alternative TWAS methods. These hits and subsequent fine mapping using TESLA point to target genes with biological relevance. In silico drug-repurposing analyses highlight several drugs with known efficacy, including dextromethorphan and galantamine, and new drugs such as muscle relaxants that may be repurposed for treating nicotine addiction.


Assuntos
Reposicionamento de Medicamentos , Transcriptoma , Humanos , Transcriptoma/genética , Estudo de Associação Genômica Ampla/métodos , Uso de Tabaco , Biologia , Polimorfismo de Nucleotídeo Único/genética , Predisposição Genética para Doença
15.
Nat Genet ; 55(1): 154-164, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36564505

RESUMO

Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.


Assuntos
Estudo de Associação Genômica Ampla , Lipídeos , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Sequenciamento do Exoma , Fenótipo , Lipídeos/genética
16.
Nature ; 612(7941): 720-724, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36477530

RESUMO

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1-4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.


Assuntos
Consumo de Bebidas Alcoólicas , Predisposição Genética para Doença , Variação Genética , Internacionalidade , Herança Multifatorial , Uso de Tabaco , Humanos , Predisposição Genética para Doença/genética , Variação Genética/genética , Estudo de Associação Genômica Ampla/métodos , Herança Multifatorial/genética , Fatores de Risco , Uso de Tabaco/genética , Consumo de Bebidas Alcoólicas/genética , Transcriptoma , Tamanho da Amostra , Loci Gênicos/genética , Europa (Continente)/etnologia
17.
Nature ; 610(7933): 704-712, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36224396

RESUMO

Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.


Assuntos
Estatura , Mapeamento Cromossômico , Polimorfismo de Nucleotídeo Único , Humanos , Estatura/genética , Frequência do Gene/genética , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Haplótipos/genética , Desequilíbrio de Ligação/genética , Polimorfismo de Nucleotídeo Único/genética , Europa (Continente)/etnologia , Tamanho da Amostra , Fenótipo
18.
Nat Methods ; 19(12): 1599-1611, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36303018

RESUMO

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.


Assuntos
Estudo de Associação Genômica Ampla , Genoma , Humanos , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Fenótipo , Variação Genética
19.
Nat Genet ; 54(9): 1332-1344, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36071172

RESUMO

Although physical activity and sedentary behavior are moderately heritable, little is known about the mechanisms that influence these traits. Combining data for up to 703,901 individuals from 51 studies in a multi-ancestry meta-analysis of genome-wide association studies yields 99 loci that associate with self-reported moderate-to-vigorous intensity physical activity during leisure time (MVPA), leisure screen time (LST) and/or sedentary behavior at work. Loci associated with LST are enriched for genes whose expression in skeletal muscle is altered by resistance training. A missense variant in ACTN3 makes the alpha-actinin-3 filaments more flexible, resulting in lower maximal force in isolated type IIA muscle fibers, and possibly protection from exercise-induced muscle damage. Finally, Mendelian randomization analyses show that beneficial effects of lower LST and higher MVPA on several risk factors and diseases are mediated or confounded by body mass index (BMI). Our results provide insights into physical activity mechanisms and its role in disease prevention.


Assuntos
Estudo de Associação Genômica Ampla , Comportamento Sedentário , Actinina/genética , Estudos Transversais , Exercício Físico/fisiologia , Humanos , Atividades de Lazer
20.
Commun Biol ; 5(1): 756, 2022 07 28.
Artigo em Inglês | MEDLINE | ID: mdl-35902682

RESUMO

The genetic determinants of fasting glucose (FG) and fasting insulin (FI) have been studied mostly through genome arrays, resulting in over 100 associated variants. We extended this work with high-coverage whole genome sequencing analyses from fifteen cohorts in NHLBI's Trans-Omics for Precision Medicine (TOPMed) program. Over 23,000 non-diabetic individuals from five race-ethnicities/populations (African, Asian, European, Hispanic and Samoan) were included. Eight variants were significantly associated with FG or FI across previously identified regions MTNR1B, G6PC2, GCK, GCKR and FOXA2. We additionally characterize suggestive associations with FG or FI near previously identified SLC30A8, TCF7L2, and ADCY5 regions as well as APOB, PTPRT, and ROBO1. Functional annotation resources including the Diabetes Epigenome Atlas were compiled for each signal (chromatin states, annotation principal components, and others) to elucidate variant-to-function hypotheses. We provide a catalog of nucleotide-resolution genomic variation spanning intergenic and intronic regions creating a foundation for future sequencing-based investigations of glycemic traits.


Assuntos
Diabetes Mellitus Tipo 2 , Jejum , Diabetes Mellitus Tipo 2/genética , Glucose , Humanos , Insulina/genética , National Heart, Lung, and Blood Institute (U.S.) , Proteínas do Tecido Nervoso/genética , Polimorfismo de Nucleotídeo Único , Medicina de Precisão , Receptores Imunológicos/genética , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...