Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 371
Filtrar
Más filtros

Base de datos
Tipo del documento
Intervalo de año de publicación
1.
Diabetes ; 73(7): 1188-1195, 2024 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-38394643

RESUMEN

Diabetic kidney disease (DKD) is the leading cause of end-stage kidney disease. Because many genes associate with DKD, multiomics approaches were used to narrow the list of functional genes, gene products, and related pathways providing insights into the pathophysiological mechanisms of DKD. The Kidney Precision Medicine Project human kidney single-cell RNA-sequencing (scRNA-seq) data set and Mendeley Data on human kidney cortex biopsy proteomics were used. The R package Seurat was used to analyze scRNA-seq data and data from a subset of proximal tubule cells. PathfindR was applied for pathway analysis in cell type-specific differentially expressed genes and the R limma package was used to analyze differential protein expression in kidney cortex. A total of 790 differentially expressed genes were identified in proximal tubule cells, including 530 upregulated and 260 downregulated transcripts. Compared with differentially expressed proteins, 24 genes or proteins were in common. An integrated analysis combining protein quantitative trait loci, genome-wide association study hits (namely, estimated glomerular filtration rate), and a plasma metabolomics analysis was performed using baseline metabolites predictive of DKD progression in our longitudinal Diabetes Heart Study samples. The aldo-keto reductase family 1 member A1 gene (AKR1A1) was revealed as a potential molecular hub for DKD cellular dysfunction in several cross-linked pathways featured by deficiency of this enzyme.


Asunto(s)
Aldehído Reductasa , Biomarcadores , Nefropatías Diabéticas , Nefropatías Diabéticas/metabolismo , Nefropatías Diabéticas/genética , Humanos , Biomarcadores/metabolismo , Aldehído Reductasa/genética , Aldehído Reductasa/metabolismo , Proteómica/métodos , Estudio de Asociación del Genoma Completo , Masculino , Túbulos Renales Proximales/metabolismo , Femenino , Persona de Mediana Edad , Multiómica
2.
bioRxiv ; 2023 Nov 02.
Artículo en Inglés | MEDLINE | ID: mdl-37961350

RESUMEN

Large-scale whole-genome sequencing (WGS) studies have improved our understanding of the contributions of coding and noncoding rare variants to complex human traits. Leveraging association effect sizes across multiple traits in WGS rare variant association analysis can improve statistical power over single-trait analysis, and also detect pleiotropic genes and regions. Existing multi-trait methods have limited ability to perform rare variant analysis of large-scale WGS data. We propose MultiSTAAR, a statistical framework and computationally-scalable analytical pipeline for functionally-informed multi-trait rare variant analysis in large-scale WGS studies. MultiSTAAR accounts for relatedness, population structure and correlation among phenotypes by jointly analyzing multiple traits, and further empowers rare variant association analysis by incorporating multiple functional annotations. We applied MultiSTAAR to jointly analyze three lipid traits (low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and triglycerides) in 61,861 multi-ethnic samples from the Trans-Omics for Precision Medicine (TOPMed) Program. We discovered new associations with lipid traits missed by single-trait analysis, including rare variants within an enhancer of NIPSNAP3A and an intergenic region on chromosome 1.

3.
Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38014529

RESUMEN

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29 670 participants, including up to 24 157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.


Asunto(s)
Aterosclerosis , Enfermedad de la Arteria Coronaria , Diabetes Mellitus Tipo 2 , Placa Aterosclerótica , Humanos , Enfermedad de la Arteria Coronaria/genética , Diabetes Mellitus Tipo 2/complicaciones , Diabetes Mellitus Tipo 2/genética , Grosor Intima-Media Carotídeo , Factores de Riesgo , Aterosclerosis/genética , Genómica
4.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-37802043

RESUMEN

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


Asunto(s)
ARN Largo no Codificante , Humanos , ARN Largo no Codificante/genética , Estudio de Asociación del Genoma Completo , Medicina de Precisión , Secuenciación Completa del Genoma/métodos , Lípidos/genética , Polimorfismo de Nucleótido Simple/genética
5.
Nat Genet ; 55(10): 1640-1650, 2023 10.
Artículo en Inglés | MEDLINE | ID: mdl-37709864

RESUMEN

Nonalcoholic fatty liver disease (NAFLD) is common and partially heritable and has no effective treatments. We carried out a genome-wide association study (GWAS) meta-analysis of imaging (n = 66,814) and diagnostic code (3,584 cases versus 621,081 controls) measured NAFLD across diverse ancestries. We identified NAFLD-associated variants at torsin family 1 member B (TOR1B), fat mass and obesity associated (FTO), cordon-bleu WH2 repeat protein like 1 (COBLL1)/growth factor receptor-bound protein 14 (GRB14), insulin receptor (INSR), sterol regulatory element-binding transcription factor 1 (SREBF1) and patatin-like phospholipase domain-containing protein 2 (PNPLA2), as well as validated NAFLD-associated variants at patatin-like phospholipase domain-containing protein 3 (PNPLA3), transmembrane 6 superfamily 2 (TM6SF2), apolipoprotein E (APOE), glucokinase regulator (GCKR), tribbles homolog 1 (TRIB1), glycerol-3-phosphate acyltransferase (GPAM), mitochondrial amidoxime-reducing component 1 (MARC1), microsomal triglyceride transfer protein large subunit (MTTP), alcohol dehydrogenase 1B (ADH1B), transmembrane channel like 4 (TMC4)/membrane-bound O-acyltransferase domain containing 7 (MBOAT7) and receptor-type tyrosine-protein phosphatase δ (PTPRD). Implicated genes highlight mitochondrial, cholesterol and de novo lipogenesis as causally contributing to NAFLD predisposition. Phenome-wide association study (PheWAS) analyses suggest at least seven subtypes of NAFLD. Individuals in the top 10% and 1% of genetic risk have a 2.5-fold to 6-fold increased risk of NAFLD, cirrhosis and hepatocellular carcinoma. These genetic variants identify subtypes of NAFLD, improve estimates of disease risk and can guide the development of targeted therapeutics.


Asunto(s)
Enfermedad del Hígado Graso no Alcohólico , Humanos , Enfermedad del Hígado Graso no Alcohólico/genética , Enfermedad del Hígado Graso no Alcohólico/complicaciones , Enfermedad del Hígado Graso no Alcohólico/metabolismo , Estudio de Asociación del Genoma Completo , Cirrosis Hepática/genética , Aciltransferasas/genética , Aciltransferasas/metabolismo , Fosfolipasas/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Hígado/metabolismo , Proteínas Serina-Treonina Quinasas/genética , Péptidos y Proteínas de Señalización Intracelular/genética , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato/genética , Dioxigenasa FTO Dependiente de Alfa-Cetoglutarato/metabolismo
6.
Nat Genet ; 55(10): 1651-1664, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37770635

RESUMEN

Coronary artery calcification (CAC), a measure of subclinical atherosclerosis, predicts future symptomatic coronary artery disease (CAD). Identifying genetic risk factors for CAC may point to new therapeutic avenues for prevention. Currently, there are only four known risk loci for CAC identified from genome-wide association studies (GWAS) in the general population. Here we conducted the largest multi-ancestry GWAS meta-analysis of CAC to date, which comprised 26,909 individuals of European ancestry and 8,867 individuals of African ancestry. We identified 11 independent risk loci, of which eight were new for CAC and five had not been reported for CAD. These new CAC loci are related to bone mineralization, phosphate catabolism and hormone metabolic pathways. Several new loci harbor candidate causal genes supported by multiple lines of functional evidence and are regulators of smooth muscle cell-mediated calcification ex vivo and in vitro. Together, these findings help refine the genetic architecture of CAC and extend our understanding of the biological and potential druggable pathways underlying CAC.


Asunto(s)
Aterosclerosis , Enfermedad de la Arteria Coronaria , Humanos , Aterosclerosis/genética , Población Negra/genética , Enfermedad de la Arteria Coronaria/genética , Estudio de Asociación del Genoma Completo , Factores de Riesgo , Pueblo Europeo/genética
7.
medRxiv ; 2023 Aug 22.
Artículo en Inglés | MEDLINE | ID: mdl-37662265

RESUMEN

Obesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals (P < 5 × 10-9). Notably, we identified and replicated a novel low frequency single nucleotide polymorphism (SNP) in MTMR3 that was common in individuals of African descent. Using a diverse study population, we further identified two novel secondary signals in known BMI loci and pinpointed two likely causal variants in the POC5 and DMD loci. Our work demonstrates the benefits of combining WGS and diverse cohorts in expanding current catalog of variants and genes confer risk for obesity, bringing us one step closer to personalized medicine.

8.
Metabolomics ; 19(8): 72, 2023 08 09.
Artículo en Inglés | MEDLINE | ID: mdl-37558891

RESUMEN

CONTEXT: Insulin resistance is associated with multiple complex diseases; however, precise measures of insulin resistance are invasive, expensive, and time-consuming. OBJECTIVE: Develop estimation models for measures of insulin resistance, including insulin sensitivity index (SI) and homeostatic model assessment of insulin resistance (HOMA-IR) from metabolomics data. DESIGN: Insulin Resistance Atherosclerosis Family Study (IRASFS). SETTING: Community based. PARTICIPANTS: Mexican Americans (MA) and African Americans (AA). MAIN OUTCOME: Estimation models for measures of insulin resistance, i.e. SI and HOMA-IR. RESULTS: Least Absolute Shrinkage and Selection Operator (LASSO) and Elastic Net regression were used to build insulin resistance estimation models from 1274 metabolites combined with clinical data, e.g. age, sex, body mass index (BMI). Metabolite data were transformed using three approaches, i.e. inverse normal transformation, standardization, and Box Cox transformation. The analysis was performed in one MA recruitment site (San Luis Valley, Colorado (SLV); N = 450) and tested in another MA recruitment site (San Antonio, Texas (SA); N = 473). In addition, the two MA recruitment sites were combined and estimation models tested in the AA recruitment sample (Los Angeles, California; N = 495). Estimated and empiric SI were correlated in the SA (r2 = 0.77) and AA (r2 = 0.74) testing datasets. Further, estimated and empiric SI were consistently associated with BMI, low-density lipoprotein cholesterol (LDL), and triglycerides. We applied similar approaches to estimate HOMA-IR with similar results. CONCLUSIONS: We have developed a method for estimating insulin resistance with metabolomics data that has the potential for application to a wide range of biomedical studies and conditions.


Asunto(s)
Aterosclerosis , Resistencia a la Insulina , Humanos , Metabolómica , Aterosclerosis/metabolismo
9.
medRxiv ; 2023 Aug 16.
Artículo en Inglés | MEDLINE | ID: mdl-37645892

RESUMEN

Background: The CCL2/CCR2 axis governs monocyte trafficking and recruitment to atherosclerotic lesions. Human genetic analyses and population-based studies support an association between circulating CCL2 levels and atherosclerosis. Still, it remains unknown whether pharmacological targeting of CCR2, the main CCL2 receptor, would provide protection against human atherosclerotic disease. Methods: In whole-exome sequencing data from 454,775 UK Biobank participants (40-69 years), we identified predicted loss-of-function (LoF) or damaging missense (REVEL score >0.5) variants within the CCR2 gene. We prioritized variants associated with lower monocyte count (p<0.05) and tested associations with vascular risk factors and risk of atherosclerotic disease over a mean follow-up of 14 years. The results were replicated in a pooled cohort of three independent datasets (TOPMed, deCODE and Penn Medicine BioBank; total n=441,445) and the effect of the most frequent damaging variant was experimentally validated. Results: A total of 45 predicted LoF or damaging missense variants were identified in the CCR2 gene, 4 of which were also significantly associated with lower monocyte count, but not with other white blood cell counts. Heterozygous carriers of these variants were at a lower risk of a combined atherosclerosis outcome, showed a lower burden of atherosclerosis across four vascular beds, and were at a lower lifetime risk of coronary artery disease and myocardial infarction. There was no evidence of association with vascular risk factors including LDL-cholesterol, blood pressure, glycemic status, or C-reactive protein. Using a cAMP assay, we found that cells transfected with the most frequent CCR2 damaging variant (3:46358273:T:A, M249K, 547 carriers, frequency: 0.14%) show a decrease in signaling in response to CCL2. The associations of the M249K variant with myocardial infarction were consistent across cohorts (ORUKB: 0.62 95%CI: 0.39-0.96; ORexternal: 0.64 95%CI: 0.34-1.19; ORpooled: 0.64 95%CI: 0.450.90). In a phenome-wide association study, we found no evidence for higher risk of common infections or mortality among carriers of damaging CCR2 variants. Conclusions: Heterozygous carriers of damaging CCR2 variants have a lower burden of atherosclerosis and lower lifetime risk of myocardial infarction. In conjunction with previous evidence from experimental and epidemiological studies, our findings highlight the translational potential of CCR2-targeting as an atheroprotective approach.

10.
medRxiv ; 2023 Jun 29.
Artículo en Inglés | MEDLINE | ID: mdl-37425772

RESUMEN

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions. Large-scale whole genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess the associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with blood lipid levels (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare variant aggregate association tests using the STAAR (variant-Set Test for Association using Annotation infoRmation) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare coding variants in nearby protein coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500 kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variations and rare protein coding variations at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNA, implicating new therapeutic opportunities.

11.
Sci Adv ; 9(17): eabm4945, 2023 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-37126548

RESUMEN

Nononcogenic somatic mutations are thought to be uncommon and inconsequential. To test this, we analyzed 43,693 National Heart, Lung and Blood Institute Trans-Omics for Precision Medicine blood whole genomes from 37 cohorts and identified 7131 non-missense somatic mutations that are recurrently mutated in at least 50 individuals. These recurrent non-missense somatic mutations (RNMSMs) are not clearly explained by other clonal phenomena such as clonal hematopoiesis. RNMSM prevalence increased with age, with an average 50-year-old having 27 RNMSMs. Inherited germline variation associated with RNMSM acquisition. These variants were found in genes involved in adaptive immune function, proinflammatory cytokine production, and lymphoid lineage commitment. In addition, the presence of eight specific RNMSMs associated with blood cell traits at effect sizes comparable to Mendelian genetic mutations. Overall, we found that somatic mutations in blood are an unexpectedly common phenomenon with ancestry-specific determinants and human health consequences.


Asunto(s)
Mutación de Línea Germinal , Hematopoyesis , Humanos , Persona de Mediana Edad , Mutación , Mutación Missense , Fenotipo
12.
Nature ; 616(7958): 755-763, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-37046083

RESUMEN

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Asunto(s)
Hematopoyesis Clonal , Células Madre Hematopoyéticas , Animales , Humanos , Ratones , Alelos , Hematopoyesis Clonal/genética , Estudio de Asociación del Genoma Completo , Hematopoyesis/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Mutación , Regiones Promotoras Genéticas
13.
Metabolomics ; 19(4): 35, 2023 04 02.
Artículo en Inglés | MEDLINE | ID: mdl-37005925

RESUMEN

INTRODUCTION: African Americans are at increased risk for type 2 diabetes. OBJECTIVES: This work aimed to examine metabolomic signature of glucose homeostasis in African Americans. METHODS: We used an untargeted liquid chromatography-mass spectrometry metabolomic approach to comprehensively profile 727 plasma metabolites among 571 African Americans from the Insulin Resistance Atherosclerosis Family Study (IRAS-FS) and investigate the associations between these metabolites and both the dynamic (SI, insulin sensitivity; AIR, acute insulin response; DI, disposition index; and SG, glucose effectiveness) and basal (HOMA-IR and HOMA-B) measures of glucose homeostasis using univariate and regularized regression models. We also compared the results with our previous findings in the IRAS-FS Mexican Americans. RESULTS: We confirmed increased plasma metabolite levels of branched-chain amino acids and their metabolic derivatives, 2-aminoadipate, 2-hydroxybutyrate, glutamate, arginine and its metabolic derivatives, carbohydrate metabolites, and medium- and long-chain fatty acids were associated with insulin resistance, while increased plasma metabolite levels in the glycine, serine and threonine metabolic pathway were associated with insulin sensitivity. We also observed a differential ancestral effect of glutamate on glucose homeostasis with significantly stronger effects observed in African Americans than those previously observed in Mexican Americans. CONCLUSION: We extended the observations that metabolites are useful biomarkers in the identification of prediabetes in individuals at risk of type 2 diabetes in African Americans. We revealed, for the first time, differential ancestral effect of certain metabolites (i.e., glutamate) on glucose homeostasis traits. Our study highlights the need for additional comprehensive metabolomic studies in well-characterized multiethnic cohorts.


Asunto(s)
Aterosclerosis , Diabetes Mellitus Tipo 2 , Resistencia a la Insulina , Humanos , Aterosclerosis/metabolismo , Negro o Afroamericano , Diabetes Mellitus Tipo 2/metabolismo , Glucosa , Glutamatos , Homeostasis/fisiología , Metabolómica
14.
Meteorit Planet Sci ; 58(1): 41-62, 2023 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37082523

RESUMEN

Askival is a light-toned, coarsely crystalline float rock, which was identified near the base of Vera Rubin Ridge in Gale crater. We have studied Askival, principally with the ChemCam instrument but also using APXS compositional data and MAHLI images. Askival and an earlier identified sample, Bindi, represent two rare examples of feldspathic cumulate float rocks in Gale crater with >65% relict plagioclase. Bindi appears unaltered whereas Askival shows textural and compositional signatures of silicification, along with alkali remobilization and hydration. Askival likely experienced multiple stages of alteration, occurring first through acidic hydrolysis of metal cations, followed by deposition of silica and possible phyllosilicates at low T and neutral-alkaline pH. Through laser-induced breakdown spectroscopy compositional analyses and normative calculations, we suggest that an assemblage of Fe-Mg silicates including amphibole and pyroxene, Fe phases, and possibly Mg-rich phyllosilicate are present. Thermodynamic modeling of the more pristine Bindi composition predicts that amphibole and feldspar are stable within an upper crustal setting. This is consistent with the presence of amphibole in the parent igneous rocks of Askival and suggests that the paucity of amphiboles in other known Martian samples reflects the lack of representative samples of the Martian crust rather than their absence on Mars.

15.
Res Sq ; 2023 Feb 03.
Artículo en Inglés | MEDLINE | ID: mdl-36778386

RESUMEN

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hematologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

16.
bioRxiv ; 2023 Jan 25.
Artículo en Inglés | MEDLINE | ID: mdl-36747810

RESUMEN

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

17.
Nat Genet ; 55(1): 154-164, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36564505

RESUMEN

Meta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.


Asunto(s)
Estudio de Asociación del Genoma Completo , Lípidos , Estudio de Asociación del Genoma Completo/métodos , Secuenciación Completa del Genoma/métodos , Secuenciación del Exoma , Fenotipo , Lípidos/genética
18.
Nature ; 612(7941): 720-724, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36477530

RESUMEN

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury1-4. These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries5. Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.


Asunto(s)
Consumo de Bebidas Alcohólicas , Predisposición Genética a la Enfermedad , Variación Genética , Internacionalidad , Herencia Multifactorial , Uso de Tabaco , Humanos , Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Estudio de Asociación del Genoma Completo/métodos , Herencia Multifactorial/genética , Factores de Riesgo , Uso de Tabaco/genética , Consumo de Bebidas Alcohólicas/genética , Transcriptoma , Tamaño de la Muestra , Sitios Genéticos/genética , Europa (Continente)/etnología
19.
Nat Methods ; 19(12): 1599-1611, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36303018

RESUMEN

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Genoma , Humanos , Estudio de Asociación del Genoma Completo/métodos , Secuenciación Completa del Genoma/métodos , Fenotipo , Variación Genética
20.
Nat Hum Behav ; 6(11): 1577-1586, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-35927319

RESUMEN

Common genetic variants explain less variation in complex phenotypes than inferred from family-based studies, and there is a debate on the source of this 'missing heritability'. We investigated the contribution of rare genetic variants to tobacco use with whole-genome sequences from up to 26,257 unrelated individuals of European ancestries and 11,743 individuals of African ancestries. Across four smoking traits, single-nucleotide-polymorphism-based heritability ([Formula: see text]) was estimated from 0.13 to 0.28 (s.e., 0.10-0.13) in European ancestries, with 35-74% of it attributable to rare variants with minor allele frequencies between 0.01% and 1%. These heritability estimates are 1.5-4 times higher than past estimates based on common variants alone and accounted for 60% to 100% of our pedigree-based estimates of narrow-sense heritability ([Formula: see text], 0.18-0.34). In the African ancestry samples, [Formula: see text] was estimated from 0.03 to 0.33 (s.e., 0.09-0.14) across the four smoking traits. These results suggest that rare variants are important contributors to the heritability of smoking.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Frecuencia de los Genes , Polimorfismo de Nucleótido Simple/genética , Fenotipo , Fumar/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA