Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 73
Filter
1.
Nature ; 586(7831): 749-756, 2020 10.
Article in English | MEDLINE | ID: mdl-33087929

ABSTRACT

The UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world1. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%). The data include 198,269 autosomal predicted loss-of-function (LOF) variants, a more than 14-fold increase compared to the imputed sequence. Nearly all genes (more than 97%) had at least one carrier with a LOF variant, and most genes (more than 69%) had at least ten carriers with a LOF variant. We illustrate the power of characterizing LOF variants in this population through association analyses across 1,730 phenotypes. In addition to replicating established associations, we found novel LOF variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical importance, and show that 2% of this population has a medically actionable variant. Furthermore, we characterize the penetrance of cancer in carriers of pathogenic BRCA1 and BRCA2 variants. Exome sequences from the first 49,960 participants highlight the promise of genome sequencing in large population-based studies and are now accessible to the scientific community.


Subject(s)
Databases, Genetic , Exome Sequencing , Exome/genetics , Loss of Function Mutation/genetics , Phenotype , Aged , Bone Density/genetics , Collagen Type VI/genetics , Demography , Female , Genes, BRCA1 , Genes, BRCA2 , Genotype , Humans , Ion Channels/genetics , Male , Middle Aged , Neoplasms/genetics , Penetrance , Peptide Fragments/genetics , United Kingdom , Varicose Veins/genetics , ras GTPase-Activating Proteins/genetics
2.
Nature ; 570(7762): 514-518, 2019 06.
Article in English | MEDLINE | ID: mdl-31217584

ABSTRACT

Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry1-3. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific4-10. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations11,12. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions13-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.


Subject(s)
Asian People/genetics , Black People/genetics , Genome-Wide Association Study/methods , Hispanic or Latino/genetics , Minority Groups , Multifactorial Inheritance/genetics , Women's Health , Body Height/genetics , Cohort Studies , Female , Genetics, Medical/methods , Health Equity/trends , Health Status Disparities , Humans , Male , United States
3.
Bioinformatics ; 38(14): 3621-3628, 2022 07 11.
Article in English | MEDLINE | ID: mdl-35640976

ABSTRACT

MOTIVATION: Medical images can provide rich information about diseases and their biology. However, investigating their association with genetic variation requires non-standard methods. We propose transferGWAS, a novel approach to perform genome-wide association studies directly on full medical images. First, we learn semantically meaningful representations of the images based on a transfer learning task, during which a deep neural network is trained on independent but similar data. Then, we perform genetic association tests with these representations. RESULTS: We validate the type I error rates and power of transferGWAS in simulation studies of synthetic images. Then we apply transferGWAS in a genome-wide association study of retinal fundus images from the UK Biobank. This first-of-a-kind GWAS of full imaging data yielded 60 genomic regions associated with retinal fundus images, of which 7 are novel candidate loci for eye-related traits and diseases. AVAILABILITY AND IMPLEMENTATION: Our method is implemented in Python and available at https://github.com/mkirchler/transferGWAS/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genome-Wide Association Study , Neural Networks, Computer , Genome-Wide Association Study/methods , Phenotype , Genome , Machine Learning
4.
Nature ; 542(7640): 186-190, 2017 02 09.
Article in English | MEDLINE | ID: mdl-28146470

ABSTRACT

Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1-4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1-2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.


Subject(s)
Body Height/genetics , Gene Frequency/genetics , Genetic Variation/genetics , ADAMTS Proteins/genetics , Adult , Alleles , Cell Adhesion Molecules/genetics , Female , Genome, Human/genetics , Glycoproteins/genetics , Glycoproteins/metabolism , Glycosaminoglycans/biosynthesis , Hedgehog Proteins/genetics , Humans , Intercellular Signaling Peptides and Proteins/genetics , Intercellular Signaling Peptides and Proteins/metabolism , Interferon Regulatory Factors/genetics , Interleukin-11 Receptor alpha Subunit/genetics , Male , Multifactorial Inheritance/genetics , NADPH Oxidase 4 , NADPH Oxidases/genetics , Phenotype , Pregnancy-Associated Plasma Protein-A/metabolism , Procollagen N-Endopeptidase/genetics , Proteoglycans/biosynthesis , Proteolysis , Receptors, Androgen/genetics , Somatomedins/metabolism
5.
Am J Hum Genet ; 105(1): 15-28, 2019 07 03.
Article in English | MEDLINE | ID: mdl-31178129

ABSTRACT

Circulating levels of adiponectin, an adipocyte-secreted protein associated with cardiovascular and metabolic risk, are highly heritable. To gain insights into the biology that regulates adiponectin levels, we performed an exome array meta-analysis of 265,780 genetic variants in 67,739 individuals of European, Hispanic, African American, and East Asian ancestry. We identified 20 loci associated with adiponectin, including 11 that had been reported previously (p < 2 × 10-7). Comparison of exome array variants to regional linkage disequilibrium (LD) patterns and prior genome-wide association study (GWAS) results detected candidate variants (r2 > .60) spanning as much as 900 kb. To identify potential genes and mechanisms through which the previously unreported association signals act to affect adiponectin levels, we assessed cross-trait associations, expression quantitative trait loci in subcutaneous adipose, and biological pathways of nearby genes. Eight of the nine loci were also associated (p < 1 × 10-4) with at least one obesity or lipid trait. Candidate genes include PRKAR2A, PTH1R, and HDAC9, which have been suggested to play roles in adipocyte differentiation or bone marrow adipose tissue. Taken together, these findings provide further insights into the processes that influence circulating adiponectin levels.


Subject(s)
Adiponectin/genetics , Adipose Tissue/pathology , Exome/genetics , Genetic Predisposition to Disease , Lipids/analysis , Obesity/etiology , Polymorphism, Single Nucleotide , Adipose Tissue/metabolism , Adolescent , Adult , Black or African American/genetics , Aged , Aged, 80 and over , Female , Hispanic or Latino/genetics , Humans , Male , Middle Aged , Obesity/pathology , Phenotype , Quantitative Trait Loci , White People/genetics , Young Adult
6.
N Engl J Med ; 378(12): 1096-1106, 2018 03 22.
Article in English | MEDLINE | ID: mdl-29562163

ABSTRACT

BACKGROUND: Elucidation of the genetic factors underlying chronic liver disease may reveal new therapeutic targets. METHODS: We used exome sequence data and electronic health records from 46,544 participants in the DiscovEHR human genetics study to identify genetic variants associated with serum levels of alanine aminotransferase (ALT) and aspartate aminotransferase (AST). Variants that were replicated in three additional cohorts (12,527 persons) were evaluated for association with clinical diagnoses of chronic liver disease in DiscovEHR study participants and two independent cohorts (total of 37,173 persons) and with histopathological severity of liver disease in 2391 human liver samples. RESULTS: A splice variant (rs72613567:TA) in HSD17B13, encoding the hepatic lipid droplet protein hydroxysteroid 17-beta dehydrogenase 13, was associated with reduced levels of ALT (P=4.2×10-12) and AST (P=6.2×10-10). Among DiscovEHR study participants, this variant was associated with a reduced risk of alcoholic liver disease (by 42% [95% confidence interval {CI}, 20 to 58] among heterozygotes and by 53% [95% CI, 3 to 77] among homozygotes), nonalcoholic liver disease (by 17% [95% CI, 8 to 25] among heterozygotes and by 30% [95% CI, 13 to 43] among homozygotes), alcoholic cirrhosis (by 42% [95% CI, 14 to 61] among heterozygotes and by 73% [95% CI, 15 to 91] among homozygotes), and nonalcoholic cirrhosis (by 26% [95% CI, 7 to 40] among heterozygotes and by 49% [95% CI, 15 to 69] among homozygotes). Associations were confirmed in two independent cohorts. The rs72613567:TA variant was associated with a reduced risk of nonalcoholic steatohepatitis, but not steatosis, in human liver samples. The rs72613567:TA variant mitigated liver injury associated with the risk-increasing PNPLA3 p.I148M allele and resulted in an unstable and truncated protein with reduced enzymatic activity. CONCLUSIONS: A loss-of-function variant in HSD17B13 was associated with a reduced risk of chronic liver disease and of progression from steatosis to steatohepatitis. (Funded by Regeneron Pharmaceuticals and others.).


Subject(s)
17-Hydroxysteroid Dehydrogenases/genetics , Fatty Liver/genetics , Genetic Predisposition to Disease , Liver Diseases/genetics , Loss of Function Mutation , 17-Hydroxysteroid Dehydrogenases/metabolism , Alanine Transaminase/blood , Aspartate Aminotransferases/blood , Biomarkers/blood , Chronic Disease , Disease Progression , Female , Genetic Variation , Genotype , Humans , Linear Models , Liver/pathology , Liver Diseases/pathology , Male , Sequence Analysis, RNA , Exome Sequencing
7.
Nature ; 523(7561): 459-462, 2015 Jul 23.
Article in English | MEDLINE | ID: mdl-26131930

ABSTRACT

Homozygosity has long been associated with rare, often devastating, Mendelian disorders, and Darwin was one of the first to recognize that inbreeding reduces evolutionary fitness. However, the effect of the more distant parental relatedness that is common in modern human populations is less well understood. Genomic data now allow us to investigate the effects of homozygosity on traits of public health importance by observing contiguous homozygous segments (runs of homozygosity), which are inferred to be homozygous along their complete length. Given the low levels of genome-wide homozygosity prevalent in most human populations, information is required on very large numbers of people to provide sufficient power. Here we use runs of homozygosity to study 16 health-related quantitative traits in 354,224 individuals from 102 cohorts, and find statistically significant associations between summed runs of homozygosity and four complex traits: height, forced expiratory lung volume in one second, general cognitive ability and educational attainment (P < 1 × 10(-300), 2.1 × 10(-6), 2.5 × 10(-10) and 1.8 × 10(-10), respectively). In each case, increased homozygosity was associated with decreased trait value, equivalent to the offspring of first cousins being 1.2 cm shorter and having 10 months' less education. Similar effect sizes were found across four continental groups and populations with different degrees of genome-wide homozygosity, providing evidence that homozygosity, rather than confounding, directly contributes to phenotypic variance. Contrary to earlier reports in substantially smaller samples, no evidence was seen of an influence of genome-wide homozygosity on blood pressure and low density lipoprotein cholesterol, or ten other cardio-metabolic traits. Since directional dominance is predicted for traits under directional evolutionary selection, this study provides evidence that increased stature and cognitive function have been positively selected in human evolution, whereas many important risk factors for late-onset complex diseases may not have been.


Subject(s)
Body Height/genetics , Cognition , Homozygote , Biological Evolution , Blood Pressure/genetics , Cholesterol, LDL/genetics , Cohort Studies , Educational Status , Female , Forced Expiratory Volume/genetics , Genome, Human/genetics , Humans , Lung Volume Measurements , Male , Phenotype
8.
N Engl J Med ; 377(3): 211-221, 2017 07 20.
Article in English | MEDLINE | ID: mdl-28538136

ABSTRACT

BACKGROUND: Loss-of-function variants in the angiopoietin-like 3 gene (ANGPTL3) have been associated with decreased plasma levels of triglycerides, low-density lipoprotein (LDL) cholesterol, and high-density lipoprotein (HDL) cholesterol. It is not known whether such variants or therapeutic antagonism of ANGPTL3 are associated with a reduced risk of atherosclerotic cardiovascular disease. METHODS: We sequenced the exons of ANGPTL3 in 58,335 participants in the DiscovEHR human genetics study. We performed tests of association for loss-of-function variants in ANGPTL3 with lipid levels and with coronary artery disease in 13,102 case patients and 40,430 controls from the DiscovEHR study, with follow-up studies involving 23,317 case patients and 107,166 controls from four population studies. We also tested the effects of a human monoclonal antibody, evinacumab, against Angptl3 in dyslipidemic mice and against ANGPTL3 in healthy human volunteers with elevated levels of triglycerides or LDL cholesterol. RESULTS: In the DiscovEHR study, participants with heterozygous loss-of-function variants in ANGPTL3 had significantly lower serum levels of triglycerides, HDL cholesterol, and LDL cholesterol than participants without these variants. Loss-of-function variants were found in 0.33% of case patients with coronary artery disease and in 0.45% of controls (adjusted odds ratio, 0.59; 95% confidence interval, 0.41 to 0.85; P=0.004). These results were confirmed in the follow-up studies. In dyslipidemic mice, inhibition of Angptl3 with evinacumab resulted in a greater decrease in atherosclerotic lesion area and necrotic content than a control antibody. In humans, evinacumab caused a dose-dependent placebo-adjusted reduction in fasting triglyceride levels of up to 76% and LDL cholesterol levels of up to 23%. CONCLUSIONS: Genetic and therapeutic antagonism of ANGPTL3 in humans and of Angptl3 in mice was associated with decreased levels of all three major lipid fractions and decreased odds of atherosclerotic cardiovascular disease. (Funded by Regeneron Pharmaceuticals and others; ClinicalTrials.gov number, NCT01749878 .).


Subject(s)
Angiopoietins/antagonists & inhibitors , Antibodies, Monoclonal/administration & dosage , Atherosclerosis/drug therapy , Coronary Artery Disease/genetics , Dyslipidemias/drug therapy , Lipids/blood , Mutation , Aged , Angiopoietin-Like Protein 3 , Angiopoietin-like Proteins , Angiopoietins/genetics , Animals , Antibodies, Monoclonal/adverse effects , Antibodies, Monoclonal/pharmacology , Atherosclerosis/metabolism , Cardiovascular Diseases/prevention & control , Coronary Artery Disease/metabolism , Disease Models, Animal , Dose-Response Relationship, Drug , Double-Blind Method , Dyslipidemias/blood , Female , Humans , Lipid Metabolism/drug effects , Male , Mice , Mice, Inbred Strains , Middle Aged
9.
PLoS Genet ; 13(4): e1006760, 2017 04.
Article in English | MEDLINE | ID: mdl-28453575

ABSTRACT

Prior GWAS have identified loci associated with red blood cell (RBC) traits in populations of European, African, and Asian ancestry. These studies have not included individuals with an Amerindian ancestral background, such as Hispanics/Latinos, nor evaluated the full spectrum of genomic variation beyond single nucleotide variants. Using a custom genotyping array enriched for Amerindian ancestral content and 1000 Genomes imputation, we performed GWAS in 12,502 participants of Hispanic Community Health Study and Study of Latinos (HCHS/SOL) for hematocrit, hemoglobin, RBC count, RBC distribution width (RDW), and RBC indices. Approximately 60% of previously reported RBC trait loci generalized to HCHS/SOL Hispanics/Latinos, including African ancestral alpha- and beta-globin gene variants. In addition to the known 3.8kb alpha-globin copy number variant, we identified an Amerindian ancestral association in an alpha-globin regulatory region on chromosome 16p13.3 for mean corpuscular volume and mean corpuscular hemoglobin. We also discovered and replicated three genome-wide significant variants in previously unreported loci for RDW (SLC12A2 rs17764730, PSMB5 rs941718), and hematocrit (PROX1 rs3754140). Among the proxy variants at the SLC12A2 locus we identified rs3812049, located in a bi-directional promoter between SLC12A2 (which encodes a red cell membrane ion-transport protein) and an upstream anti-sense long-noncoding RNA, LINC01184, as the likely causal variant. We further demonstrate that disruption of the regulatory element harboring rs3812049 affects transcription of SLC12A2 and LINC01184 in human erythroid progenitor cells. Together, these results reinforce the importance of genetic study of diverse ancestral populations, in particular Hispanics/Latinos.


Subject(s)
Homeodomain Proteins/genetics , Proteasome Endopeptidase Complex/genetics , RNA, Long Noncoding/genetics , Solute Carrier Family 12, Member 2/genetics , Tumor Suppressor Proteins/genetics , alpha-Globins/genetics , Erythrocyte Count , Erythrocytes , Female , Genome-Wide Association Study , Hemoglobins/genetics , Hispanic or Latino/genetics , Humans , Male , Polymorphism, Single Nucleotide , beta-Globins/genetics
10.
Hum Mol Genet ; 26(6): 1193-1204, 2017 03 15.
Article in English | MEDLINE | ID: mdl-28158719

ABSTRACT

Circulating white blood cell (WBC) counts (neutrophils, monocytes, lymphocytes, eosinophils, basophils) differ by ethnicity. The genetic factors underlying basal WBC traits in Hispanics/Latinos are unknown. We performed a genome-wide association study of total WBC and differential counts in a large, ethnically diverse US population sample of Hispanics/Latinos ascertained by the Hispanic Community Health Study and Study of Latinos (HCHS/SOL). We demonstrate that several previously known WBC-associated genetic loci (e.g. the African Duffy antigen receptor for chemokines null variant for neutrophil count) are generalizable to WBC traits in Hispanics/Latinos. We identified and replicated common and rare germ-line variants at FLT3 (a gene often somatically mutated in leukemia) associated with monocyte count. The common FLT3 variant rs76428106 has a large allele frequency differential between African and non-African populations. We also identified several novel genetic loci involving or regulating hematopoietic transcription factors (CEBPE-SLC7A7, CEBPA and CRBN-TRNT1) associated with basophil count. The minor allele of the CEBPE variant associated with lower basophil count has been previously associated with Amerindian ancestry and higher risk of acute lymphoblastic leukemia in Hispanics. Together, these data suggest that germline genetic variation affecting transcriptional and signaling pathways that underlie WBC development and lineage specification can contribute to inter-individual as well as ethnic differences in peripheral blood cell counts (normal hematopoiesis) in addition to susceptibility to leukemia (malignant hematopoiesis).


Subject(s)
CCAAT-Enhancer-Binding Proteins/genetics , Genome-Wide Association Study , Leukocyte Count , fms-Like Tyrosine Kinase 3/genetics , Black or African American/genetics , Basophils/cytology , Female , Gene Frequency , Hispanic or Latino/genetics , Humans , Lymphocytes/cytology , Male , Monocytes/cytology , Neutrophils/cytology , United States/epidemiology , White People/genetics
11.
Am J Hum Genet ; 98(2): 229-42, 2016 Feb 04.
Article in English | MEDLINE | ID: mdl-26805783

ABSTRACT

Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 × 10(-28)) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits.


Subject(s)
Genetic Association Studies/methods , Genetic Loci , Hispanic or Latino/genetics , Platelet Count , Actinin/genetics , Adolescent , Adult , Aged , Alleles , Gene Frequency , Genotype , Genotyping Techniques , Humans , MEF2 Transcription Factors/genetics , Membrane Proteins/genetics , Middle Aged , Phenotype , Polymorphism, Single Nucleotide , Receptors, GABA-B/genetics , Young Adult
12.
Am J Hum Genet ; 99(1): 40-55, 2016 Jul 07.
Article in English | MEDLINE | ID: mdl-27346686

ABSTRACT

Platelet production, maintenance, and clearance are tightly controlled processes indicative of platelets' important roles in hemostasis and thrombosis. Platelets are common targets for primary and secondary prevention of several conditions. They are monitored clinically by complete blood counts, specifically with measurements of platelet count (PLT) and mean platelet volume (MPV). Identifying genetic effects on PLT and MPV can provide mechanistic insights into platelet biology and their role in disease. Therefore, we formed the Blood Cell Consortium (BCX) to perform a large-scale meta-analysis of Exomechip association results for PLT and MPV in 157,293 and 57,617 individuals, respectively. Using the low-frequency/rare coding variant-enriched Exomechip genotyping array, we sought to identify genetic variants associated with PLT and MPV. In addition to confirming 47 known PLT and 20 known MPV associations, we identified 32 PLT and 18 MPV associations not previously observed in the literature across the allele frequency spectrum, including rare large effect (FCER1A), low-frequency (IQGAP2, MAP1A, LY75), and common (ZMIZ2, SMG6, PEAR1, ARFGAP3/PACSIN2) variants. Several variants associated with PLT/MPV (PEAR1, MRVI1, PTGES3) were also associated with platelet reactivity. In concurrent BCX analyses, there was overlap of platelet-associated variants with red (MAP1A, TMPRSS6, ZMIZ2) and white (PEAR1, ZMIZ2, LY75) blood cell traits, suggesting common regulatory pathways with shared genetic architecture among these hematopoietic lineages. Our large-scale Exomechip analyses identified previously undocumented associations with platelet traits and further indicate that several complex quantitative hematological, lipid, and cardiovascular traits share genetic factors.


Subject(s)
Blood Platelets/metabolism , Exome/genetics , Genetic Variation/genetics , Female , Genome-Wide Association Study , Humans , Male , Mean Platelet Volume , Platelet Count
13.
Am J Hum Genet ; 99(1): 8-21, 2016 Jul 07.
Article in English | MEDLINE | ID: mdl-27346685

ABSTRACT

Red blood cell (RBC) traits are important heritable clinical biomarkers and modifiers of disease severity. To identify coding genetic variants associated with these traits, we conducted meta-analyses of seven RBC phenotypes in 130,273 multi-ethnic individuals from studies genotyped on an exome array. After conditional analyses and replication in 27,480 independent individuals, we identified 16 new RBC variants. We found low-frequency missense variants in MAP1A (rs55707100, minor allele frequency [MAF] = 3.3%, p = 2 × 10(-10) for hemoglobin [HGB]) and HNF4A (rs1800961, MAF = 2.4%, p < 3 × 10(-8) for hematocrit [HCT] and HGB). In African Americans, we identified a nonsense variant in CD36 associated with higher RBC distribution width (rs3211938, MAF = 8.7%, p = 7 × 10(-11)) and showed that it is associated with lower CD36 expression and strong allelic imbalance in ex vivo differentiated human erythroblasts. We also identified a rare missense variant in ALAS2 (rs201062903, MAF = 0.2%) associated with lower mean corpuscular volume and mean corpuscular hemoglobin (p < 8 × 10(-9)). Mendelian mutations in ALAS2 are a cause of sideroblastic anemia and erythropoietic protoporphyria. Gene-based testing highlighted three rare missense variants in PKLR, a gene mutated in Mendelian non-spherocytic hemolytic anemia, associated with HGB and HCT (SKAT p < 8 × 10(-7)). These rare, low-frequency, and common RBC variants showed pleiotropy, being also associated with platelet, white blood cell, and lipid traits. Our association results and functional annotation suggest the involvement of new genes in human erythropoiesis. We also confirm that rare and low-frequency variants play a role in the architecture of complex human traits, although their phenotypic effect is generally smaller than originally anticipated.


Subject(s)
Erythrocytes/cytology , Erythropoiesis/genetics , Exome/genetics , Genetic Pleiotropy , Genetic Variation/genetics , Genotype , Black or African American/genetics , Allelic Imbalance , Erythrocyte Indices , Erythrocytes/metabolism , Gene Frequency , Hematocrit , Hemoglobins/genetics , Humans , Quantitative Trait Loci/genetics
14.
Am J Hum Genet ; 99(1): 22-39, 2016 Jul 07.
Article in English | MEDLINE | ID: mdl-27346689

ABSTRACT

White blood cells play diverse roles in innate and adaptive immunity. Genetic association analyses of phenotypic variation in circulating white blood cell (WBC) counts from large samples of otherwise healthy individuals can provide insights into genes and biologic pathways involved in production, differentiation, or clearance of particular WBC lineages (myeloid, lymphoid) and also potentially inform the genetic basis of autoimmune, allergic, and blood diseases. We performed an exome array-based meta-analysis of total WBC and subtype counts (neutrophils, monocytes, lymphocytes, basophils, and eosinophils) in a multi-ancestry discovery and replication sample of âˆ¼157,622 individuals from 25 studies. We identified 16 common variants (8 of which were coding variants) associated with one or more WBC traits, the majority of which are pleiotropically associated with autoimmune diseases. Based on functional annotation, these loci included genes encoding surface markers of myeloid, lymphoid, or hematopoietic stem cell differentiation (CD69, CD33, CD87), transcription factors regulating lineage specification during hematopoiesis (ASXL1, IRF8, IKZF1, JMJD1C, ETS2-PSMG1), and molecules involved in neutrophil clearance/apoptosis (C10orf54, LTA), adhesion (TNXB), or centrosome and microtubule structure/function (KIF9, TUBD1). Together with recent reports of somatic ASXL1 mutations among individuals with idiopathic cytopenias or clonal hematopoiesis of undetermined significance, the identification of a common regulatory 3' UTR variant of ASXL1 suggests that both germline and somatic ASXL1 mutations contribute to lower blood counts in otherwise asymptomatic individuals. These association results shed light on genetic mechanisms that regulate circulating WBC counts and suggest a prominent shared genetic architecture with inflammatory and autoimmune diseases.


Subject(s)
Exome/genetics , Genetic Loci/genetics , Genetic Pleiotropy , Genome-Wide Association Study , Immune System Diseases/genetics , Leukocytes/cytology , Blood Cell Count , Humans , Quality Control
15.
Hum Mol Genet ; 25(21): 4611-4623, 2016 11 01.
Article in English | MEDLINE | ID: mdl-28158590

ABSTRACT

Cigarette smoking is a leading modifiable cause of death worldwide. We hypothesized that cigarette smoking induces extensive transcriptomic changes that lead to target-organ damage and smoking-related diseases. We performed a meta-analysis of transcriptome-wide gene expression using whole blood-derived RNA from 10,233 participants of European ancestry in six cohorts (including 1421 current and 3955 former smokers) to identify associations between smoking and altered gene expression levels. At a false discovery rate (FDR) <0.1, we identified 1270 differentially expressed genes in current vs. never smokers, and 39 genes in former vs. never smokers. Expression levels of 12 genes remained elevated up to 30 years after smoking cessation, suggesting that the molecular consequence of smoking may persist for decades. Gene ontology analysis revealed enrichment of smoking-related genes for activation of platelets and lymphocytes, immune response, and apoptosis. Many of the top smoking-related differentially expressed genes, including LRRN3 and GPR15, have DNA methylation loci in promoter regions that were recently reported to be hypomethylated among smokers. By linking differential gene expression with smoking-related disease phenotypes, we demonstrated that stroke and pulmonary function show enrichment for smoking-related gene expression signatures. Mediation analysis revealed the expression of several genes (e.g. ALAS2) to be putative mediators of the associations between smoking and inflammatory biomarkers (IL6 and C-reactive protein levels). Our transcriptomic study provides potential insights into the effects of cigarette smoking on gene expression in whole blood and their relations to smoking-related diseases. The results of such analyses may highlight attractive targets for treating or preventing smoking-related health effects.


Subject(s)
Cigarette Smoking/genetics , Gene Expression/drug effects , Adult , Aged , Cigarette Smoking/blood , Cohort Studies , CpG Islands , DNA Methylation , Female , Gene Expression Profiling , Gene Expression Regulation/genetics , Humans , Leukocytes/drug effects , Male , Middle Aged , Smoking/genetics , Transcriptome/drug effects , White People/genetics
16.
Hum Mol Genet ; 25(10): 2082-2092, 2016 05 15.
Article in English | MEDLINE | ID: mdl-26908616

ABSTRACT

Although the role of complete gene inactivation by two loss-of-function mutations inherited in trans is well-established in recessive Mendelian diseases, we have not yet explored how such gene knockouts (KOs) could influence complex human phenotypes. Here, we developed a statistical framework to test the association between gene KOs and quantitative human traits. Our method is flexible, publicly available, and compatible with common genotype format files (e.g. PLINK and vcf). We characterized gene KOs in 4498 participants from the NHLBI Exome Sequence Project (ESP) sequenced at high coverage (>100×), 1976 French Canadians from the Montreal Heart Institute Biobank sequenced at low coverage (5.7×), and >100 000 participants from the Genetic Investigation of ANthropometric Traits (GIANT) Consortium genotyped on an exome array. We tested associations between gene KOs and three anthropometric traits: body mass index (BMI), height and BMI-adjusted waist-to-hip ratio (WHR). Despite our large sample size and multiple datasets available, we could not detect robust associations between specific gene KOs and quantitative anthropometric traits. Our results highlight several limitations and challenges for future gene KO studies in humans, in particular when there is no prior knowledge on the phenotypes that might be affected by the tested gene KOs. They also suggest that gene KOs identified with current DNA sequencing methodologies probably do not strongly influence normal variation in BMI, height, and WHR in the general human population.


Subject(s)
Body Height/genetics , Body Mass Index , Quantitative Trait Loci/genetics , Waist-Hip Ratio , Anthropometry , Canada , Exome/genetics , Female , Gene Knockout Techniques , Genotype , Humans , Male , Mutation , Phenotype , Polymorphism, Single Nucleotide
17.
Hum Genet ; 137(10): 847-862, 2018 Oct.
Article in English | MEDLINE | ID: mdl-30317457

ABSTRACT

Primary open angle glaucoma (POAG) is a complex disease with a major genetic contribution. Its prevalence varies greatly among ethnic groups, and is up to five times more frequent in black African populations compared to Europeans. So far, worldwide efforts to elucidate the genetic complexity of POAG in African populations has been limited. We conducted a genome-wide association study in 1113 POAG cases and 1826 controls from Tanzanian, South African and African American study samples. Apart from confirming evidence of association at TXNRD2 (rs16984299; OR[T] 1.20; P = 0.003), we found that a genetic risk score combining the effects of the 15 previously reported POAG loci was significantly associated with POAG in our samples (OR 1.56; 95% CI 1.26-1.93; P = 4.79 × 10-5). By genome-wide association testing we identified a novel candidate locus, rs141186647, harboring EXOC4 (OR[A] 0.48; P = 3.75 × 10-8), a gene transcribing a component of the exocyst complex involved in vesicle transport. The low frequency and high degree of genetic heterogeneity at this region hampered validation of this finding in predominantly West-African replication sets. Our results suggest that established genetic risk factors play a role in African POAG, however, they do not explain the higher disease load. The high heterogeneity within Africans remains a challenge to identify the genetic commonalities for POAG in this ethnicity, and demands studies of extremely large size.


Subject(s)
Black People/genetics , Genetic Loci , Genome-Wide Association Study , Glaucoma, Open-Angle/genetics , Thioredoxin Reductase 2/genetics , Vesicular Transport Proteins/genetics , Aged , Aged, 80 and over , Female , Humans , Male , Middle Aged
18.
Am J Hematol ; 2018 Jun 15.
Article in English | MEDLINE | ID: mdl-29905378

ABSTRACT

Red blood cell (RBC) traits provide insight into a wide range of physiological states and exhibit moderate to high heritability, making them excellent candidates for genetic studies to inform underlying biologic mechanisms. Previous RBC trait genome-wide association studies were performed primarily in European- or Asian-ancestry populations, missing opportunities to inform understanding of RBC genetic architecture in diverse populations and reduce intervals surrounding putative functional SNPs through fine-mapping. Here, we report the first fine-mapping of six correlated (Pearson's r range: |0.04 - 0.92|) RBC traits in up to 19,036 African Americans and 19,562 Hispanic/Latinos participants of the Population Architecture using Genomics and Epidemiology (PAGE) consortium. Trans-ethnic meta-analysis of race/ethnic- and study-specific estimates for approximately 11,000 SNPs flanking 13 previously identified association signals as well as 150,000 additional array-wide SNPs was performed using inverse-variance meta-analysis after adjusting for study and clinical covariates. Approximately half of previously reported index SNP-RBC trait associations generalized to the trans-ethnic study population (p<1.7x10-4 ); previously unreported independent association signals within the ABO region reinforce the potential for multiple functional variants affecting the same locus. Trans-ethnic fine-mapping did not reveal additional signals at the HFE locus independent of the known functional variants. Finally, we identified a potential novel association in the Hispanic/Latino study population at the HECTD4/RPL6 locus for RBC count (p=1.9x10-7 ). The identification of a previously unknown association, generalization of a large proportion of known association signals, and refinement of known association signals all exemplify the benefits of genetic studies in diverse populations. This article is protected by copyright. All rights reserved.

19.
Nicotine Tob Res ; 20(4): 448-457, 2018 03 06.
Article in English | MEDLINE | ID: mdl-28520984

ABSTRACT

Introduction: Genetic variants associated with nicotine dependence have previously been identified, primarily in European-ancestry populations. No genome-wide association studies (GWAS) have been reported for smoking behaviors in Hispanics/Latinos in the United States and Latin America, who are of mixed ancestry with European, African, and American Indigenous components. Methods: We examined genetic associations with smoking behaviors in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (N = 12 741 with smoking data, 5119 ever-smokers), using ~2.3 million genotyped variants imputed to the 1000 Genomes Project phase 3. Mixed logistic regression models accounted for population structure, sampling, relatedness, sex, and age. Results: The known region of CHRNA5, which encodes the α5 cholinergic nicotinic receptor subunit, was associated with heavy smoking at genome-wide significance (p ≤ 5 × 10-8) in a comparison of 1929 ever-smokers reporting cigarettes per day (CPD) > 10 versus 3156 reporting CPD ≤ 10. The functional variant rs16969968 in CHRNA5 had a p value of 2.20 × 10-7 and odds ratio (OR) of 1.32 for the minor allele (A); its minor allele frequency was 0.22 overall and similar across Hispanic/Latino background groups (Central American = 0.17; South American = 0.19; Mexican = 0.18; Puerto Rican = 0.22; Cuban = 0.29; Dominican = 0.19). CHRNA4 on chromosome 20 attained p < 10-4, supporting prior findings in non-Hispanics. For nondaily smoking, which is prevalent in Hispanic/Latino smokers, compared to daily smoking, loci on chromosomes 2 and 4 achieved genome-wide significance; replication attempts were limited by small Hispanic/Latino sample sizes. Conclusions: Associations of nicotinic receptor gene variants with smoking, first reported in non-Hispanic European-ancestry populations, generalized to Hispanics/Latinos despite different patterns of smoking behavior. Implications: We conducted the first large-scale genome-wide association study (GWAS) of smoking behavior in a US Hispanic/Latino cohort, and the first GWAS of daily/nondaily smoking in any population. Results show that the region of the nicotinic receptor subunit gene CHRNA5, which in non-Hispanic European-ancestry smokers has been associated with heavy smoking as well as cessation and treatment efficacy, is also significantly associated with heavy smoking in this Hispanic/Latino cohort. The results are an important addition to understanding the impact of genetic variants in understudied Hispanic/Latino smokers.


Subject(s)
Genome-Wide Association Study/methods , Hispanic or Latino/genetics , Nerve Tissue Proteins/genetics , Public Health/methods , Receptors, Nicotinic/genetics , Smoking/epidemiology , Smoking/genetics , Adult , Female , Gene Frequency , Genotype , Humans , Male , Middle Aged , United States/epidemiology
20.
PLoS Genet ; 11(5): e1005223, 2015 May.
Article in English | MEDLINE | ID: mdl-25955312

ABSTRACT

The functional consequences of trait associated SNPs are often investigated using expression quantitative trait locus (eQTL) mapping. While trait-associated variants may operate in a cell-type specific manner, eQTL datasets for such cell-types may not always be available. We performed a genome-environment interaction (GxE) meta-analysis on data from 5,683 samples to infer the cell type specificity of whole blood cis-eQTLs. We demonstrate that this method is able to predict neutrophil and lymphocyte specific cis-eQTLs and replicate these predictions in independent cell-type specific datasets. Finally, we show that SNPs associated with Crohn's disease preferentially affect gene expression within neutrophils, including the archetypal NOD2 locus.


Subject(s)
Lymphocytes/cytology , Neutrophils/cytology , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Cell Line , Crohn Disease/genetics , Gene Expression Regulation , Genome-Wide Association Study/methods , Humans , Lymphocytes/metabolism , Neutrophils/metabolism , Nod2 Signaling Adaptor Protein/genetics , Nod2 Signaling Adaptor Protein/metabolism , Phenotype , Principal Component Analysis , Reproducibility of Results
SELECTION OF CITATIONS
SEARCH DETAIL