Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 64
Filter
Add more filters

Publication year range
1.
Am J Hum Genet ; 110(11): 1853-1862, 2023 11 02.
Article in English | MEDLINE | ID: mdl-37875120

ABSTRACT

The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of ∼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hˆγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hˆγ2 = 0.012 ± 9.2 × 10-4), which translates to hˆ2 ranging from 0.062 to 0.85 (mean hˆ2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.


Subject(s)
Black or African American , Genetics, Population , Humans , Chromosome Mapping , Phenotype , Polymorphism, Single Nucleotide/genetics
2.
Proc Natl Acad Sci U S A ; 120(1): e2207544120, 2023 01 03.
Article in English | MEDLINE | ID: mdl-36574663

ABSTRACT

A growing body of work has addressed human adaptations to diverse environments using genomic data, but few studies have connected putatively selected alleles to phenotypes, much less among underrepresented populations such as Amerindians. Studies of natural selection and genotype-phenotype relationships in underrepresented populations hold potential to uncover previously undescribed loci underlying evolutionarily and biomedically relevant traits. Here, we worked with the Tsimane and the Moseten, two Amerindian populations inhabiting the Bolivian lowlands. We focused most intensively on the Tsimane, because long-term anthropological work with this group has shown that they have a high burden of both macro and microparasites, as well as minimal cardiometabolic disease or dementia. We therefore generated genome-wide genotype data for Tsimane individuals to study natural selection, and paired this with blood mRNA-seq as well as cardiometabolic and immune biomarker data generated from a larger sample that included both populations. In the Tsimane, we identified 21 regions that are candidates for selective sweeps, as well as 5 immune traits that show evidence for polygenic selection (e.g., C-reactive protein levels and the response to coronaviruses). Genes overlapping candidate regions were strongly enriched for known involvement in immune-related traits, such as abundance of lymphocytes and eosinophils. Importantly, we were also able to draw on extensive phenotype information for the Tsimane and Moseten and link five regions (containing PSD4, MUC21 and MUC22, TOX2, ANXA6, and ABCA1) with biomarkers of immune and metabolic function. Together, our work highlights the utility of pairing evolutionary analyses with anthropological and biomedical data to gain insight into the genetic basis of health-related traits.


Subject(s)
Genetics, Population , Health Status , Humans , Biomarkers , Bolivia , Genomics , Genotype , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic , Genome, Human
3.
Am J Hum Genet ; 109(4): 669-679, 2022 04 07.
Article in English | MEDLINE | ID: mdl-35263625

ABSTRACT

One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear. We applied PrediXcan to impute GReX in 51,520 ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) participants (35% African American, 45% Hispanic/Latino, 10% Asian, and 7% Hawaiian) across 25 key cardiometabolic traits and relevant tissues to identify 102 novel associations. We then compared associations in PAGE to those in a random subset of 50,000 White British participants from UK Biobank (UKBB50k) for height and body mass index (BMI). We identified 517 associations across 47 tissues in PAGE but not UKBB50k, demonstrating the importance of diverse samples in identifying trait-associated GReX. We observed that variants used in PrediXcan models were either more or less differentiated across continental-level populations than matched-control variants depending on the specific population reflecting sampling bias. Additionally, variants from identified genes specific to either PAGE or UKBB50k analyses were more ancestrally differentiated than those in genes detected in both analyses, underlining the value of population-specific discoveries. This suggests that while EA-derived transcriptome imputation models can identify new associations in non-EA populations, models derived from closely matched reference panels may yield further insights. Our findings call for more diversity in reference datasets of tissue-specific gene expression.


Subject(s)
Cardiovascular Diseases , Genome-Wide Association Study , Genetic Predisposition to Disease , Humans , Life Style , Polymorphism, Single Nucleotide , Transcriptome
4.
Nature ; 570(7762): 514-518, 2019 06.
Article in English | MEDLINE | ID: mdl-31217584

ABSTRACT

Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry1-3. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific4-10. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations11,12. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions13-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.


Subject(s)
Asian People/genetics , Black People/genetics , Genome-Wide Association Study/methods , Hispanic or Latino/genetics , Minority Groups , Multifactorial Inheritance/genetics , Women's Health , Body Height/genetics , Cohort Studies , Female , Genetics, Medical/methods , Health Equity/trends , Health Status Disparities , Humans , Male , United States
5.
Am J Hum Genet ; 108(10): 1836-1851, 2021 10 07.
Article in English | MEDLINE | ID: mdl-34582791

ABSTRACT

Many common and rare variants associated with hematologic traits have been discovered through imputation on large-scale reference panels. However, the majority of genome-wide association studies (GWASs) have been conducted in Europeans, and determining causal variants has proved challenging. We performed a GWAS of total leukocyte, neutrophil, lymphocyte, monocyte, eosinophil, and basophil counts generated from 109,563,748 variants in the autosomes and the X chromosome in the Trans-Omics for Precision Medicine (TOPMed) program, which included data from 61,802 individuals of diverse ancestry. We discovered and replicated 7 leukocyte trait associations, including (1) the association between a chromosome X, pseudo-autosomal region (PAR), noncoding variant located between cytokine receptor genes (CSF2RA and CLRF2) and lower eosinophil count; and (2) associations between single variants found predominantly among African Americans at the S1PR3 (9q22.1) and HBB (11p15.4) loci and monocyte and lymphocyte counts, respectively. We further provide evidence indicating that the newly discovered eosinophil-lowering chromosome X PAR variant might be associated with reduced susceptibility to common allergic diseases such as atopic dermatitis and asthma. Additionally, we found a burden of very rare FLT3 (13q12.2) variants associated with monocyte counts. Together, these results emphasize the utility of whole-genome sequencing in diverse samples in identifying associations missed by European-ancestry-driven GWASs.


Subject(s)
Asthma/epidemiology , Biomarkers/metabolism , Dermatitis, Atopic/epidemiology , Leukocytes/pathology , Polymorphism, Single Nucleotide , Pulmonary Disease, Chronic Obstructive/epidemiology , Quantitative Trait Loci , Asthma/genetics , Asthma/metabolism , Asthma/pathology , Dermatitis, Atopic/genetics , Dermatitis, Atopic/metabolism , Dermatitis, Atopic/pathology , Genetic Predisposition to Disease , Genome, Human , Genome-Wide Association Study , Humans , National Heart, Lung, and Blood Institute (U.S.) , Phenotype , Prognosis , Proteome/analysis , Proteome/metabolism , Pulmonary Disease, Chronic Obstructive/genetics , Pulmonary Disease, Chronic Obstructive/metabolism , Pulmonary Disease, Chronic Obstructive/pathology , United Kingdom/epidemiology , United States/epidemiology , Whole Genome Sequencing
6.
Bioinformatics ; 39(1)2023 01 01.
Article in English | MEDLINE | ID: mdl-36413071

ABSTRACT

SUMMARY: Genomic data are often processed in batches and analyzed together to save time. However, it is challenging to combine multiple large VCFs and properly handle imputation quality and missing variants due to the limitations of available tools. To address these concerns, we developed IMMerge, a Python-based tool that takes advantage of multiprocessing to reduce running time. For the first time in a publicly available tool, imputation quality scores are correctly combined with Fisher's z transformation. AVAILABILITY AND IMPLEMENTATION: IMMerge is an open-source project under MIT license. Source code and user manual are available at https://github.com/belowlab/IMMerge.


Subject(s)
Genome , Genomics , Software
7.
Hum Mol Genet ; 30(15): 1371-1383, 2021 07 09.
Article in English | MEDLINE | ID: mdl-33949650

ABSTRACT

Genome-wide association studies have been successful mapping loci for individual phenotypes, but few studies have comprehensively interrogated evidence of shared genetic effects across multiple phenotypes simultaneously. Statistical methods have been proposed for analyzing multiple phenotypes using summary statistics, which enables studies of shared genetic effects while avoiding challenges associated with individual-level data sharing. Adaptive tests have been developed to maintain power against multiple alternative hypotheses because the most powerful single-alternative test depends on the underlying structure of the associations between the multiple phenotypes and a single nucleotide polymorphism (SNP). Here we compare the performance of six such adaptive tests: two adaptive sum of powered scores (aSPU) tests, the unified score association test (metaUSAT), the adaptive test in a mixed-models framework (mixAda) and two principal-component-based adaptive tests (PCAQ and PCO). Our simulations highlight practical challenges that arise when multivariate distributions of phenotypes do not satisfy assumptions of multivariate normality. Previous reports in this context focus on low minor allele count (MAC) and omit the aSPU test, which relies less than other methods on asymptotic and distributional assumptions. When these assumptions are not satisfied, particularly when MAC is low and/or phenotype covariance matrices are singular or nearly singular, aSPU better preserves type I error, sometimes at the cost of decreased power. We illustrate this trade-off with multiple phenotype analyses of six quantitative electrocardiogram traits in the Population Architecture using Genomics and Epidemiology (PAGE) study.


Subject(s)
Genetic Association Studies/methods , Genome-Wide Association Study/methods , Phenotype , Alleles , Computer Simulation , Genotype , Humans , Models, Genetic , Polymorphism, Single Nucleotide/genetics
8.
Hum Mol Genet ; 30(22): 2190-2204, 2021 11 01.
Article in English | MEDLINE | ID: mdl-34165540

ABSTRACT

Central obesity is a leading health concern with a great burden carried by ethnic minority populations, especially Hispanics/Latinos. Genetic factors contribute to the obesity burden overall and to inter-population differences. We aimed to identify the loci associated with central adiposity measured as waist-to-hip ratio (WHR), waist circumference (WC) and hip circumference (HIP) adjusted for body mass index (adjBMI) by using the Hispanic Community Health Study/Study of Latinos (HCHS/SOL); determine if differences in associations differ by background group within HCHS/SOL and determine whether previously reported associations generalize to HCHS/SOL. Our analyses included 7472 women and 5200 men of mainland (Mexican, Central and South American) and Caribbean (Puerto Rican, Cuban and Dominican) background residing in the USA. We performed genome-wide association analyses stratified and combined across sexes using linear mixed-model regression. We identified 16 variants for waist-to-hip ratio adjusted for body mass index (WHRadjBMI), 22 for waist circumference adjusted for body mass index (WCadjBMI) and 28 for hip circumference adjusted for body mass index (HIPadjBMI), which reached suggestive significance (P < 1 × 10-6). Many loci exhibited differences in strength of associations by ethnic background and sex. We brought a total of 66 variants forward for validation in cohorts (N = 34 161) with participants of Hispanic/Latino, African and European descent. We confirmed four novel loci (P < 0.05 and consistent direction of effect, and P < 5 × 10-8 after meta-analysis), including two for WHRadjBMI (rs13301996, rs79478137); one for WCadjBMI (rs3168072) and one for HIPadjBMI (rs28692724). Also, we generalized previously reported associations to HCHS/SOL, (8 for WHRadjBMI, 10 for WCadjBMI and 12 for HIPadjBMI). Our study highlights the importance of large-scale genomic studies in ancestrally diverse Hispanic/Latino populations for identifying and characterizing central obesity susceptibility that may be ancestry-specific.


Subject(s)
Adiposity/genetics , Body Fat Distribution , Genome-Wide Association Study , Hispanic or Latino/genetics , Quantitative Trait, Heritable , Alleles , Humans , Polymorphism, Single Nucleotide
9.
Hum Genet ; 142(10): 1477-1489, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37658231

ABSTRACT

Inadequate representation of non-European ancestry populations in genome-wide association studies (GWAS) has limited opportunities to isolate functional variants. Fine-mapping in multi-ancestry populations should improve the efficiency of prioritizing variants for functional interrogation. To evaluate this hypothesis, we leveraged ancestry architecture to perform comparative GWAS and fine-mapping of obesity-related phenotypes in European ancestry populations from the UK Biobank (UKBB) and multi-ancestry samples from the Population Architecture for Genetic Epidemiology (PAGE) consortium with comparable sample sizes. In the investigated regions with genome-wide significant associations for obesity-related traits, fine-mapping in our ancestrally diverse sample led to 95% and 99% credible sets (CS) with fewer variants than in the European ancestry sample. Lead fine-mapped variants in PAGE regions had higher average coding scores, and higher average posterior probabilities for causality compared to UKBB. Importantly, 99% CS in PAGE loci contained strong expression quantitative trait loci (eQTLs) in adipose tissues or harbored more variants in tighter linkage disequilibrium (LD) with eQTLs. Leveraging ancestrally diverse populations with heterogeneous ancestry architectures, coupled with functional annotation, increased fine-mapping efficiency and performance, and reduced the set of candidate variants for consideration for future functional studies. Significant overlap in genetic causal variants across populations suggests generalizability of genetic mechanisms underpinning obesity-related traits across populations.


Subject(s)
Genome-Wide Association Study , Obesity , Humans , Molecular Epidemiology , Linkage Disequilibrium , Obesity/genetics , Quantitative Trait Loci/genetics
10.
Cardiovasc Diabetol ; 22(1): 231, 2023 08 31.
Article in English | MEDLINE | ID: mdl-37653519

ABSTRACT

BACKGROUND: Adipokines are hormones secreted from adipose tissue and are associated with cardiometabolic diseases (CMD). Functional differences between adipokines (leptin, adiponectin, and resistin) are known, but inconsistently reported associations with CMD and lack of studies in Hispanic populations are research gaps. We investigated the relationship between subclinical atherosclerosis and multiple adipokine measures. METHODS: Cross-sectional data from the Cameron County Hispanic Cohort (N = 624; mean age = 50; Female = 70.8%) were utilized to assess associations between adipokines [continuous measures of adiponectin, leptin, resistin, leptin-to-adiponectin ratio (LAR), and adiponectin-resistin index (ARI)] and early atherosclerosis [carotid-intima media thickness (cIMT)]. We adjusted for sex, age, body mass index (BMI), smoking status, cytokines, fasting blood glucose levels, blood pressure, lipid levels, and medication usage in the fully adjusted linear regression model. We conducted sexes-combined and sex-stratified analyses to account for sex-specificity and additionally tested whether stratification of participants by their metabolic status (metabolically elevated risk for CMD as defined by having two or more of the following conditions: hypertension, dyslipidemia, insulin resistance, and inflammation vs. not) influenced the relationship between adipokines and cIMT. RESULTS: In the fully adjusted analyses, adiponectin, leptin, and LAR displayed significant interaction by sex (p < 0.1). Male-specific associations were between cIMT and LAR [ß(SE) = 0.060 (0.016), p = 2.52 × 10-4], and female-specific associations were between cIMT and adiponectin [ß(SE) = 0.010 (0.005), p = 0.043] and ARI [ß(SE) = - 0.011 (0.005), p = 0.036]. When stratified by metabolic health status, the male-specific positive association between LAR and cIMT was more evident among the metabolically healthy group [ß(SE) = 0.127 (0.015), p = 4.70 × 10-10] (p for interaction by metabolic health < 0.1). However, the female-specific associations between adiponectin and cIMT and ARI and cIMT were observed only among the metabolically elevated risk group [ß(SE) = 0.014 (0.005), p = 0.012 for adiponectin; ß(SE) = - 0.015 (0.006), p = 0.013 for ARI; p for interaction by metabolic health < 0.1]. CONCLUSION: Associations between adipokines and cIMT were sex-specific, and metabolic health status influenced the relationships between adipokines and cIMT. These heterogeneities by sex and metabolic health affirm the complex relationships between adipokines and atherosclerosis.


Subject(s)
Adipokines , Atherosclerosis , Female , Male , Humans , Middle Aged , Leptin , Resistin , Adiponectin , Carotid Intima-Media Thickness , Cross-Sectional Studies , Hispanic or Latino
11.
Nature ; 542(7640): 186-190, 2017 02 09.
Article in English | MEDLINE | ID: mdl-28146470

ABSTRACT

Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1-4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1-2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.


Subject(s)
Body Height/genetics , Gene Frequency/genetics , Genetic Variation/genetics , ADAMTS Proteins/genetics , Adult , Alleles , Cell Adhesion Molecules/genetics , Female , Genome, Human/genetics , Glycoproteins/genetics , Glycoproteins/metabolism , Glycosaminoglycans/biosynthesis , Hedgehog Proteins/genetics , Humans , Intercellular Signaling Peptides and Proteins/genetics , Intercellular Signaling Peptides and Proteins/metabolism , Interferon Regulatory Factors/genetics , Interleukin-11 Receptor alpha Subunit/genetics , Male , Multifactorial Inheritance/genetics , NADPH Oxidase 4 , NADPH Oxidases/genetics , Phenotype , Pregnancy-Associated Plasma Protein-A/metabolism , Procollagen N-Endopeptidase/genetics , Proteoglycans/biosynthesis , Proteolysis , Receptors, Androgen/genetics , Somatomedins/metabolism
12.
PLoS Genet ; 16(3): e1008684, 2020 03.
Article in English | MEDLINE | ID: mdl-32226016

ABSTRACT

Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.


Subject(s)
Lipids/blood , Lipids/genetics , Racial Groups/genetics , Databases, Genetic , Female , Genome-Wide Association Study/methods , Genotype , Humans , Lipids/analysis , Male , Metagenomics/methods , Minority Groups , Multifactorial Inheritance/genetics , Phenotype , Polymorphism, Single Nucleotide/genetics , United States/epidemiology
13.
Diabetologia ; 65(3): 477-489, 2022 03.
Article in English | MEDLINE | ID: mdl-34951656

ABSTRACT

AIMS/HYPOTHESIS: Type 2 diabetes is a growing global public health challenge. Investigating quantitative traits, including fasting glucose, fasting insulin and HbA1c, that serve as early markers of type 2 diabetes progression may lead to a deeper understanding of the genetic aetiology of type 2 diabetes development. Previous genome-wide association studies (GWAS) have identified over 500 loci associated with type 2 diabetes, glycaemic traits and insulin-related traits. However, most of these findings were based only on populations of European ancestry. To address this research gap, we examined the genetic basis of fasting glucose, fasting insulin and HbA1c in participants of the diverse Population Architecture using Genomics and Epidemiology (PAGE) Study. METHODS: We conducted a GWAS of fasting glucose (n = 52,267), fasting insulin (n = 48,395) and HbA1c (n = 23,357) in participants without diabetes from the diverse PAGE Study (23% self-reported African American, 46% Hispanic/Latino, 40% European, 4% Asian, 3% Native Hawaiian, 0.8% Native American), performing transethnic and population-specific GWAS meta-analyses, followed by fine-mapping to identify and characterise novel loci and independent secondary signals in known loci. RESULTS: Four novel associations were identified (p < 5 × 10-9), including three loci associated with fasting insulin, and a novel, low-frequency African American-specific locus associated with fasting glucose. Additionally, seven secondary signals were identified, including novel independent secondary signals for fasting glucose at the known GCK locus and for fasting insulin at the known PPP1R3B locus in transethnic meta-analysis. CONCLUSIONS/INTERPRETATION: Our findings provide new insights into the genetic architecture of glycaemic traits and highlight the continued importance of conducting genetic studies in diverse populations. DATA AVAILABILITY: Full summary statistics from each of the population-specific and transethnic results are available at NHGRI-EBI GWAS catalog ( https://www.ebi.ac.uk/gwas/downloads/summary-statistics ).


Subject(s)
Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Blood Glucose/genetics , Diabetes Mellitus, Type 2/genetics , Genome-Wide Association Study/methods , Genomics , Humans , Polymorphism, Single Nucleotide/genetics
14.
Am J Hum Genet ; 105(4): 706-718, 2019 10 03.
Article in English | MEDLINE | ID: mdl-31564435

ABSTRACT

Hemoglobin A1c (HbA1c) is widely used to diagnose diabetes and assess glycemic control in individuals with diabetes. However, nonglycemic determinants, including genetic variation, may influence how accurately HbA1c reflects underlying glycemia. Analyzing the NHLBI Trans-Omics for Precision Medicine (TOPMed) sequence data in 10,338 individuals from five studies and four ancestries (6,158 Europeans, 3,123 African-Americans, 650 Hispanics, and 407 East Asians), we confirmed five regions associated with HbA1c (GCK in Europeans and African-Americans, HK1 in Europeans and Hispanics, FN3K and/or FN3KRP in Europeans, and G6PD in African-Americans and Hispanics) and we identified an African-ancestry-specific low-frequency variant (rs1039215 in HBG2 and HBE1, minor allele frequency (MAF) = 0.03). The most associated G6PD variant (rs1050828-T, p.Val98Met, MAF = 12% in African-Americans, MAF = 2% in Hispanics) lowered HbA1c (-0.88% in hemizygous males, -0.34% in heterozygous females) and explained 23% of HbA1c variance in African-Americans and 4% in Hispanics. Additionally, we identified a rare distinct G6PD coding variant (rs76723693, p.Leu353Pro, MAF = 0.5%; -0.98% in hemizygous males, -0.46% in heterozygous females) and detected significant association with HbA1c when aggregating rare missense variants in G6PD. We observed similar magnitude and direction of effects for rs1039215 (HBG2) and rs76723693 (G6PD) in the two largest TOPMed African American cohorts, and we replicated the rs76723693 association in the UK Biobank African-ancestry participants. These variants in G6PD and HBG2 were monomorphic in the European and Asian samples. African or Hispanic ancestry individuals carrying G6PD variants may be underdiagnosed for diabetes when screened with HbA1c. Thus, assessment of these variants should be considered for incorporation into precision medicine approaches for diabetes diagnosis.


Subject(s)
Diabetes Mellitus/diagnosis , Diabetes Mellitus/genetics , Genetic Variation , Glycated Hemoglobin/genetics , Population Groups/genetics , Precision Medicine , Cohort Studies , Female , Humans , Male , Polymorphism, Single Nucleotide
15.
Am J Hum Genet ; 105(1): 15-28, 2019 07 03.
Article in English | MEDLINE | ID: mdl-31178129

ABSTRACT

Circulating levels of adiponectin, an adipocyte-secreted protein associated with cardiovascular and metabolic risk, are highly heritable. To gain insights into the biology that regulates adiponectin levels, we performed an exome array meta-analysis of 265,780 genetic variants in 67,739 individuals of European, Hispanic, African American, and East Asian ancestry. We identified 20 loci associated with adiponectin, including 11 that had been reported previously (p < 2 × 10-7). Comparison of exome array variants to regional linkage disequilibrium (LD) patterns and prior genome-wide association study (GWAS) results detected candidate variants (r2 > .60) spanning as much as 900 kb. To identify potential genes and mechanisms through which the previously unreported association signals act to affect adiponectin levels, we assessed cross-trait associations, expression quantitative trait loci in subcutaneous adipose, and biological pathways of nearby genes. Eight of the nine loci were also associated (p < 1 × 10-4) with at least one obesity or lipid trait. Candidate genes include PRKAR2A, PTH1R, and HDAC9, which have been suggested to play roles in adipocyte differentiation or bone marrow adipose tissue. Taken together, these findings provide further insights into the processes that influence circulating adiponectin levels.


Subject(s)
Adiponectin/genetics , Adipose Tissue/pathology , Exome/genetics , Genetic Predisposition to Disease , Lipids/analysis , Obesity/etiology , Polymorphism, Single Nucleotide , Adipose Tissue/metabolism , Adolescent , Adult , Black or African American/genetics , Aged , Aged, 80 and over , Female , Hispanic or Latino/genetics , Humans , Male , Middle Aged , Obesity/pathology , Phenotype , Quantitative Trait Loci , White People/genetics , Young Adult
16.
Circ Res ; 126(12): 1816-1840, 2020 06 05.
Article in English | MEDLINE | ID: mdl-32496918

ABSTRACT

Genome-wide association studies have revolutionized our understanding of the genetic underpinnings of cardiometabolic disease. Yet, the inadequate representation of individuals of diverse ancestral backgrounds in these studies may undercut their ultimate potential for both public health and precision medicine. The goal of this review is to describe the imperativeness of studying the populations who are most affected by cardiometabolic disease, to the aim of better understanding the genetic underpinnings of the disease. We support this premise by describing the current variation in the global burden of cardiometabolic disease and emphasize the importance of building a globally and ancestrally representative genetics evidence base for the identification of population-specific variants, fine-mapping, and polygenic risk score estimation. We discuss the important ethical, legal, and social implications of increasing ancestral diversity in genetic studies of cardiometabolic disease and the challenges that arise from the (1) lack of diversity in current reference populations and available analytic samples and the (2) unequal generation of health-associated genomic data and their prediction accuracies. Despite these challenges, we conclude that additional, unprecedented opportunities lie ahead for public health genomics and the realization of precision medicine, provided that the gap in diversity can be systematically addressed. Achieving this goal will require concerted efforts by social, academic, professional and regulatory stakeholders and communities, and these efforts must be based on principles of equity and social justice.


Subject(s)
Genome-Wide Association Study/methods , Metabolic Syndrome/genetics , Gene Frequency , Genome-Wide Association Study/standards , Humans , Metabolic Syndrome/epidemiology , Polymorphism, Genetic
17.
Nature ; 536(7614): 41-47, 2016 08 04.
Article in English | MEDLINE | ID: mdl-27398621

ABSTRACT

The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.


Subject(s)
Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Alleles , DNA Mutational Analysis , Europe/ethnology , Exome , Genome-Wide Association Study , Genotyping Techniques , Humans , Sample Size
18.
BMC Genomics ; 22(1): 432, 2021 Jun 09.
Article in English | MEDLINE | ID: mdl-34107879

ABSTRACT

BACKGROUND: Circulating white blood cell and platelet traits are clinically linked to various disease outcomes and differ across individuals and ancestry groups. Genetic factors play an important role in determining these traits and many loci have been identified. However, most of these findings were identified in populations of European ancestry (EA), with African Americans (AA), Hispanics/Latinos (HL), and other races/ethnicities being severely underrepresented. RESULTS: We performed ancestry-combined and ancestry-specific genome-wide association studies (GWAS) for white blood cell and platelet traits in the ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) Study, including 16,201 AA, 21,347 HL, and 27,236 EA participants. We identified six novel findings at suggestive significance (P < 5E-8), which need confirmation, and independent signals at six previously established regions at genome-wide significance (P < 2E-9). We confirmed multiple previously reported genome-wide significant variants in the single variant association analysis and multiple genes using PrediXcan. Evaluation of loci reported from a Euro-centric GWAS indicated attenuation of effect estimates in AA and HL compared to EA populations. CONCLUSIONS: Our results highlighted the potential to identify ancestry-specific and ancestry-agnostic variants in participants with diverse backgrounds and advocate for continued efforts in improving inclusion of racially/ethnically diverse populations in genetic association studies for complex traits.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Genetic Predisposition to Disease , Genomics , Humans , Leukocytes , Phenotype
19.
Hum Mol Genet ; 28(7): 1212-1224, 2019 04 01.
Article in English | MEDLINE | ID: mdl-30624610

ABSTRACT

Interpretation of genetic association results is difficult because signals often lack biological context. To generate hypotheses of the functional genetic etiology of complex cardiometabolic traits, we estimated the genetically determined component of gene expression from common variants using PrediXcan (1) and determined genes with differential predicted expression by trait. PrediXcan imputes tissue-specific expression levels from genetic variation using variant-level effect on gene expression in transcriptome data. To explore the value of imputed genetically regulated gene expression (GReX) models across different ancestral populations, we evaluated imputed expression levels for predictive accuracy genome-wide in RNA sequence data in samples drawn from European-ancestry and African-ancestry populations and identified substantial predictive power using European-derived models in a non-European target population. We then tested the association of GReX on 15 cardiometabolic traits including blood lipid levels, body mass index, height, blood pressure, fasting glucose and insulin, RR interval, fibrinogen level, factor VII level and white blood cell and platelet counts in 15 755 individuals across three ancestry groups, resulting in 20 novel gene-phenotype associations reaching experiment-wide significance across ancestries. In addition, we identified 18 significant novel gene-phenotype associations in our ancestry-specific analyses. Top associations were assessed for additional support via query of S-PrediXcan (2) results derived from publicly available genome-wide association studies summary data. Collectively, these findings illustrate the utility of transcriptome-based imputation models for discovery of cardiometabolic effect genes in a diverse dataset.


Subject(s)
Forecasting/methods , Metabolome/genetics , Metabolome/physiology , Adult , Aged , Blood Pressure , Body Mass Index , Chromosome Mapping/methods , Ethnicity/genetics , Female , Genetic Association Studies/methods , Genome-Wide Association Study/methods , Humans , Male , Middle Aged , Multifactorial Inheritance/genetics , Phenotype , Polymorphism, Single Nucleotide/genetics , Transcriptome/genetics , White People/genetics
20.
BMC Genomics ; 21(1): 228, 2020 Mar 14.
Article in English | MEDLINE | ID: mdl-32171239

ABSTRACT

BACKGROUND: Quantitative red blood cell (RBC) traits are highly polygenic clinically relevant traits, with approximately 500 reported GWAS loci. The majority of RBC trait GWAS have been performed in European- or East Asian-ancestry populations, despite evidence that rare or ancestry-specific variation contributes substantially to RBC trait heritability. Recently developed combined-phenotype methods which leverage genetic trait correlation to improve statistical power have not yet been applied to these traits. Here we leveraged correlation of seven quantitative RBC traits in performing a combined-phenotype analysis in a multi-ethnic study population. RESULTS: We used the adaptive sum of powered scores (aSPU) test to assess combined-phenotype associations between ~ 21 million SNPs and seven RBC traits in a multi-ethnic population (maximum n = 67,885 participants; 24% African American, 30% Hispanic/Latino, and 43% European American; 76% female). Thirty-nine loci in our multi-ethnic population contained at least one significant association signal (p < 5E-9), with lead SNPs at nine loci significantly associated with three or more RBC traits. A majority of the lead SNPs were common (MAF > 5%) across all ancestral populations. Nineteen additional independent association signals were identified at seven known loci (HFE, KIT, HBS1L/MYB, CITED2/FILNC1, ABO, HBA1/2, and PLIN4/5). For example, the HBA1/2 locus contained 14 conditionally independent association signals, 11 of which were previously unreported and are specific to African and Amerindian ancestries. One variant in this region was common in all ancestries, but exhibited a narrower LD block in African Americans than European Americans or Hispanics/Latinos. GTEx eQTL analysis of all independent lead SNPs yielded 31 significant associations in relevant tissues, over half of which were not at the gene immediately proximal to the lead SNP. CONCLUSION: This work identified seven loci containing multiple independent association signals for RBC traits using a combined-phenotype approach, which may improve discovery in genetically correlated traits. Highly complex genetic architecture at the HBA1/2 locus was only revealed by the inclusion of African Americans and Hispanics/Latinos, underscoring the continued importance of expanding large GWAS to include ancestrally diverse populations.


Subject(s)
Black or African American/genetics , Erythrocytes/metabolism , Genome-Wide Association Study/methods , Hispanic or Latino/genetics , Quantitative Trait, Heritable , White People/genetics , Female , Genetics, Population , Humans , Male , Multifactorial Inheritance , Phenotype , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , United States/ethnology
SELECTION OF CITATIONS
SEARCH DETAIL