Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 68
Filter
Add more filters

Publication year range
1.
Cell ; 179(3): 589-603, 2019 10 17.
Article in English | MEDLINE | ID: mdl-31607513

ABSTRACT

Genome-wide association studies (GWASs) have focused primarily on populations of European descent, but it is essential that diverse populations become better represented. Increasing diversity among study participants will advance our understanding of genetic architecture in all populations and ensure that genetic research is broadly applicable. To facilitate and promote research in multi-ancestry and admixed cohorts, we outline key methodological considerations and highlight opportunities, challenges, solutions, and areas in need of development. Despite the perception that analyzing genetic data from diverse populations is difficult, it is scientifically and ethically imperative, and there is an expanding analytical toolbox to do it well.


Subject(s)
Genome-Wide Association Study/methods , Genotyping Techniques/methods , Human Genetics/methods , Data Accuracy , Genetic Variation , Genetics, Population/methods , Genetics, Population/standards , Genome-Wide Association Study/standards , Genotyping Techniques/standards , Human Genetics/standards , Humans , Pedigree
2.
Cell ; 171(6): 1340-1353.e14, 2017 Nov 30.
Article in English | MEDLINE | ID: mdl-29195075

ABSTRACT

Approximately 15 genes have been directly associated with skin pigmentation variation in humans, leading to its characterization as a relatively simple trait. However, by assembling a global survey of quantitative skin pigmentation phenotypes, we demonstrate that pigmentation is more complex than previously assumed, with genetic architecture varying by latitude. We investigate polygenicity in the KhoeSan populations indigenous to southern Africa who have considerably lighter skin than equatorial Africans. We demonstrate that skin pigmentation is highly heritable, but known pigmentation loci explain only a small fraction of the variance. Rather, baseline skin pigmentation is a complex, polygenic trait in the KhoeSan. Despite this, we identify canonical and non-canonical skin pigmentation loci, including near SLC24A5, TYRP1, SMARCA2/VLDLR, and SNX13, using a genome-wide association approach complemented by targeted resequencing. By considering diverse, under-studied African populations, we show how the architecture of skin pigmentation can vary across humans subject to different local evolutionary pressures.


Subject(s)
Skin Pigmentation , Africa , Black People/genetics , Humans , Polymorphism, Single Nucleotide
3.
Cell ; 162(5): 1051-65, 2015 Aug 27.
Article in English | MEDLINE | ID: mdl-26300125

ABSTRACT

Deciphering the impact of genetic variants on gene regulation is fundamental to understanding human disease. Although gene regulation often involves long-range interactions, it is unknown to what extent non-coding genetic variants influence distal molecular phenotypes. Here, we integrate chromatin profiling for three histone marks in lymphoblastoid cell lines (LCLs) from 75 sequenced individuals with LCL-specific Hi-C and ChIA-PET-based chromatin contact maps to uncover one of the largest collections of local and distal histone quantitative trait loci (hQTLs). Distal QTLs are enriched within topologically associated domains and exhibit largely concordant variation of chromatin state coordinated by proximal and distal non-coding genetic variants. Histone QTLs are enriched for common variants associated with autoimmune diseases and enable identification of putative target genes of disease-associated variants from genome-wide association studies. These analyses provide insights into how genetic variation can affect human disease phenotypes by coordinated changes in chromatin at interacting regulatory elements.


Subject(s)
Chromatin/metabolism , Chromosomes, Human/metabolism , Human Genome Project , Cell Line , Chromosomes, Human/chemistry , Cohort Studies , Female , Gene Regulatory Networks , Genome-Wide Association Study , Histones/metabolism , Humans , Lymphocytes/metabolism , Male , Quantitative Trait Loci , Regulatory Elements, Transcriptional
4.
Nature ; 625(7993): 92-100, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38057664

ABSTRACT

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.


Subject(s)
Genome, Human , Genomics , Models, Genetic , Mutation , Humans , Access to Information , Databases, Genetic , Datasets as Topic , Gene Frequency , Genome, Human/genetics , Mutation/genetics , Selection, Genetic
5.
Am J Hum Genet ; 111(5): 809-824, 2024 05 02.
Article in English | MEDLINE | ID: mdl-38642557

ABSTRACT

Advancements in genomic technologies have shown remarkable promise for improving health trajectories. The Human Genome Project has catalyzed the integration of genomic tools into clinical practice, such as disease risk assessment, prenatal testing and reproductive genomics, cancer diagnostics and prognostication, and therapeutic decision making. Despite the promise of genomic technologies, their full potential remains untapped without including individuals of diverse ancestries and integrating social determinants of health (SDOHs). The NHGRI launched the 2020 Strategic Vision with ten bold predictions by 2030, including "individuals from ancestrally diverse backgrounds will benefit equitably from advances in human genomics." Meeting this goal requires a holistic approach that brings together genomic advancements with careful consideration to healthcare access as well as SDOHs to ensure that translation of genetics research is inclusive, affordable, and accessible and ultimately narrows rather than widens health disparities. With this prediction in mind, this review delves into the two paramount applications of genetic testing-reproductive genomics and precision oncology. When discussing these applications of genomic advancements, we evaluate current accessibility limitations, highlight challenges in achieving representativeness, and propose paths forward to realize the ultimate goal of their equitable applications.


Subject(s)
Genomics , Precision Medicine , Humans , Genomics/methods , Precision Medicine/methods , Genome, Human , Genetic Testing , Neoplasms/genetics , Health Services Accessibility
6.
N Engl J Med ; 390(22): 2083-2097, 2024 Jun 13.
Article in English | MEDLINE | ID: mdl-38767252

ABSTRACT

BACKGROUND: Adjustment for race is discouraged in lung-function testing, but the implications of adopting race-neutral equations have not been comprehensively quantified. METHODS: We obtained longitudinal data from 369,077 participants in the National Health and Nutrition Examination Survey, U.K. Biobank, the Multi-Ethnic Study of Atherosclerosis, and the Organ Procurement and Transplantation Network. Using these data, we compared the race-based 2012 Global Lung Function Initiative (GLI-2012) equations with race-neutral equations introduced in 2022 (GLI-Global). Evaluated outcomes included national projections of clinical, occupational, and financial reclassifications; individual lung-allocation scores for transplantation priority; and concordance statistics (C statistics) for clinical prediction tasks. RESULTS: Among the 249 million persons in the United States between 6 and 79 years of age who are able to produce high-quality spirometric results, the use of GLI-Global equations may reclassify ventilatory impairment for 12.5 million persons, medical impairment ratings for 8.16 million, occupational eligibility for 2.28 million, grading of chronic obstructive pulmonary disease for 2.05 million, and military disability compensation for 413,000. These potential changes differed according to race; for example, classifications of nonobstructive ventilatory impairment may change dramatically, increasing 141% (95% confidence interval [CI], 113 to 169) among Black persons and decreasing 69% (95% CI, 63 to 74) among White persons. Annual disability payments may increase by more than $1 billion among Black veterans and decrease by $0.5 billion among White veterans. GLI-2012 and GLI-Global equations had similar discriminative accuracy with regard to respiratory symptoms, health care utilization, new-onset disease, death from any cause, death related to respiratory disease, and death among persons on a transplant waiting list, with differences in C statistics ranging from -0.008 to 0.011. CONCLUSIONS: The use of race-based and race-neutral equations generated similarly accurate predictions of respiratory outcomes but assigned different disease classifications, occupational eligibility, and disability compensation for millions of persons, with effects diverging according to race. (Funded by the National Heart Lung and Blood Institute and the National Institute of Environmental Health Sciences.).


Subject(s)
Respiratory Function Tests , Respiratory Insufficiency , Adolescent , Adult , Aged , Child , Female , Humans , Male , Middle Aged , Young Adult , Lung Diseases/diagnosis , Lung Diseases/economics , Lung Diseases/ethnology , Lung Diseases/therapy , Lung Transplantation/statistics & numerical data , Nutrition Surveys/statistics & numerical data , Pulmonary Disease, Chronic Obstructive/diagnosis , Pulmonary Disease, Chronic Obstructive/economics , Pulmonary Disease, Chronic Obstructive/ethnology , Pulmonary Disease, Chronic Obstructive/therapy , Racial Groups , Respiratory Function Tests/classification , Respiratory Function Tests/economics , Respiratory Function Tests/standards , Spirometry , United States/epidemiology , Respiratory Insufficiency/diagnosis , Respiratory Insufficiency/economics , Respiratory Insufficiency/ethnology , Respiratory Insufficiency/therapy , Black or African American/statistics & numerical data , White/statistics & numerical data , Disability Evaluation , Veterans Disability Claims/classification , Veterans Disability Claims/economics , Veterans Disability Claims/statistics & numerical data , Disabled Persons/classification , Disabled Persons/statistics & numerical data , Occupational Diseases/diagnosis , Occupational Diseases/economics , Occupational Diseases/ethnology , Financing, Government/economics , Financing, Government/statistics & numerical data
7.
Genome Res ; 34(5): 796-809, 2024 06 25.
Article in English | MEDLINE | ID: mdl-38749656

ABSTRACT

Underrepresented populations are often excluded from genomic studies owing in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high-quality set of 4094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also show substantial added value from this data set compared with the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared with previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality-control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.


Subject(s)
Databases, Genetic , Genome, Human , Humans , Human Genome Project , High-Throughput Nucleotide Sequencing/methods , Genetic Variation , Genomics/methods
8.
Am J Hum Genet ; 109(9): 1667-1679, 2022 09 01.
Article in English | MEDLINE | ID: mdl-36055213

ABSTRACT

African populations are the most diverse in the world yet are sorely underrepresented in medical genetics research. Here, we examine the structure of African populations using genetic and comprehensive multi-generational ethnolinguistic data from the Neuropsychiatric Genetics of African Populations-Psychosis study (NeuroGAP-Psychosis) consisting of 900 individuals from Ethiopia, Kenya, South Africa, and Uganda. We find that self-reported language classifications meaningfully tag underlying genetic variation that would be missed with consideration of geography alone, highlighting the importance of culture in shaping genetic diversity. Leveraging our uniquely rich multi-generational ethnolinguistic metadata, we track language transmission through the pedigree, observing the disappearance of several languages in our cohort as well as notable shifts in frequency over three generations. We find suggestive evidence for the rate of language transmission in matrilineal groups having been higher than that for patrilineal ones. We highlight both the diversity of variation within Africa as well as how within-Africa variation can be informative for broader variant interpretation; many variants that are rare elsewhere are common in parts of Africa. The work presented here improves the understanding of the spectrum of genetic variation in African populations and highlights the enormous and complex genetic and ethnolinguistic diversity across Africa.


Subject(s)
Genetic Variation , Genetics, Population , Africa, Southern , Black People/genetics , Genetic Structures , Genetic Variation/genetics , Humans
10.
Am J Hum Genet ; 108(4): 656-668, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33770507

ABSTRACT

Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of ≥4× captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1×) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 4× sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.


Subject(s)
DNA Mutational Analysis/economics , DNA Mutational Analysis/standards , Genetic Variation/genetics , Genetics, Population/economics , Africa , DNA Mutational Analysis/methods , Genetics, Population/methods , Genome, Human/genetics , Genome-Wide Association Study , Health Equity , Humans , Microbiota , Whole Genome Sequencing/economics , Whole Genome Sequencing/standards
11.
N Engl J Med ; 385(1): 78-86, 2021 07 01.
Article in English | MEDLINE | ID: mdl-34192436

ABSTRACT

Companies have recently begun to sell a new service to patients considering in vitro fertilization: embryo selection based on polygenic scores (ESPS). These scores represent individualized predictions of health and other outcomes derived from genomewide association studies in adults to partially predict these outcomes. This article includes a discussion of many factors that lower the predictive power of polygenic scores in the context of embryo selection and quantifies these effects for a variety of clinical and nonclinical traits. Also discussed are potential unintended consequences of ESPS (including selecting for adverse traits, altering population demographics, exacerbating inequalities in society, and devaluing certain traits). Recommendations for the responsible communication about ESPS by practitioners are provided, and a call for a society-wide conversation about this technology is made. (Funded by the National Institute on Aging and others.).


Subject(s)
Embryo, Mammalian , Fertilization in Vitro , Genetic Testing , Genetic Variation , Multifactorial Inheritance/genetics , Phenotype , Preimplantation Diagnosis , Educational Status , Gene-Environment Interaction , Genome-Wide Association Study , Humans , Predictive Value of Tests
12.
Am J Hum Genet ; 106(3): 371-388, 2020 03 05.
Article in English | MEDLINE | ID: mdl-32142644

ABSTRACT

The population of the United States is shaped by centuries of migration, isolation, growth, and admixture between ancestors of global origins. Here, we assemble a comprehensive view of recent population history by studying the ancestry and population structure of more than 32,000 individuals in the US using genetic, ancestral birth origin, and geographic data from the National Geographic Genographic Project. We identify migration routes and barriers that reflect historical demographic events. We also uncover the spatial patterns of relatedness in subpopulations through the combination of haplotype clustering, ancestral birth origin analysis, and local ancestry inference. Examples of these patterns include substantial substructure and heterogeneity in Hispanics/Latinos, isolation-by-distance in African Americans, elevated levels of relatedness and homozygosity in Asian immigrants, and fine-scale structure in European descents. Taken together, our results provide detailed insights into the genetic structure and demographic history of the diverse US population.


Subject(s)
Emigration and Immigration , Genetics, Population , Haplotypes , Cluster Analysis , Demography , Humans , United States
14.
Am J Hum Genet ; 104(6): 1169-1181, 2019 06 06.
Article in English | MEDLINE | ID: mdl-31155286

ABSTRACT

Polygenic scores (PSs) are becoming a useful tool to identify individuals with high genetic risk for complex diseases, and several projects are currently testing their utility for translational applications. It is also tempting to use PSs to assess whether genetic variation can explain a part of the geographic distribution of a phenotype. However, it is not well known how the population genetic properties of the training and target samples affect the geographic distribution of PSs. Here, we evaluate geographic differences, and related biases, of PSs in Finland in a geographically well-defined sample of 2,376 individuals from the National FINRISK study. First, we detect geographic differences in PSs for coronary artery disease (CAD), rheumatoid arthritis, schizophrenia, waist-hip ratio (WHR), body-mass index (BMI), and height, but not for Crohn disease or ulcerative colitis. Second, we use height as a model trait to thoroughly assess the possible population genetic biases in PSs and apply similar approaches to the other phenotypes. Most importantly, we detect suspiciously large accumulations of geographic differences for CAD, WHR, BMI, and height, suggesting bias arising from the population's genetic structure rather than from a direct genotype-phenotype association. This work demonstrates how sensitive the geographic patterns of current PSs are for small biases even within relatively homogeneous populations and provides simple tools to identify such biases. A thorough understanding of the effects of population genetic structure on PSs is essential for translational applications of PSs.


Subject(s)
Genetic Markers , Genetics, Population , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide , Quantitative Trait, Heritable , Adult , Aged , Arthritis, Rheumatoid/epidemiology , Arthritis, Rheumatoid/genetics , Body Mass Index , Colitis, Ulcerative/epidemiology , Colitis, Ulcerative/genetics , Coronary Artery Disease/epidemiology , Coronary Artery Disease/genetics , Crohn Disease/epidemiology , Crohn Disease/genetics , Female , Finland/epidemiology , Genetic Association Studies , Geography , Humans , Male , Middle Aged , Risk Factors , Schizophrenia/epidemiology , Schizophrenia/genetics , Waist-Hip Ratio
15.
Mol Biol Evol ; 37(4): 994-1006, 2020 04 01.
Article in English | MEDLINE | ID: mdl-31848607

ABSTRACT

Native American genetic variation remains underrepresented in most catalogs of human genome sequencing data. Previous genotyping efforts have revealed that Mexico's Indigenous population is highly differentiated and substructured, thus potentially harboring higher proportions of private genetic variants of functional and biomedical relevance. Here we have targeted the coding fraction of the genome and characterized its full site frequency spectrum by sequencing 76 exomes from five Indigenous populations across Mexico. Using diffusion approximations, we modeled the demographic history of Indigenous populations from Mexico with northern and southern ethnic groups splitting 7.2 KYA and subsequently diverging locally 6.5 and 5.7 KYA, respectively. Selection scans for positive selection revealed BCL2L13 and KBTBD8 genes as potential candidates for adaptive evolution in Rarámuris and Triquis, respectively. BCL2L13 is highly expressed in skeletal muscle and could be related to physical endurance, a well-known phenotype of the northern Mexico Rarámuri. The KBTBD8 gene has been associated with idiopathic short stature and we found it to be highly differentiated in Triqui, a southern Indigenous group from Oaxaca whose height is extremely low compared to other Native populations.


Subject(s)
Adaptation, Biological/genetics , American Indian or Alaska Native/genetics , Evolution, Molecular , Genetic Variation , Exome , Humans , Mexico , Phylogeography
16.
Am J Hum Genet ; 102(5): 760-775, 2018 05 03.
Article in English | MEDLINE | ID: mdl-29706349

ABSTRACT

Finland provides unique opportunities to investigate population and medical genomics because of its adoption of unified national electronic health records, detailed historical and birth records, and serial population bottlenecks. We assembled a comprehensive view of recent population history (≤100 generations), the timespan during which most rare-disease-causing alleles arose, by comparing pairwise haplotype sharing from 43,254 Finns to that of 16,060 Swedes, Estonians, Russians, and Hungarians from geographically and linguistically adjacent countries with different population histories. We find much more extensive sharing in Finns, with at least one ≥ 5 cM tract on average between pairs of unrelated individuals. By coupling haplotype sharing with fine-scale birth records from more than 25,000 individuals, we find that although haplotype sharing broadly decays with geographical distance, there are pockets of excess haplotype sharing; individuals from northeast Finland typically share several-fold more of their genome in identity-by-descent segments than individuals from southwest regions. We estimate recent effective population-size changes through time across regions of Finland, and we find that there was more continuous gene flow as Finns migrated from southwest to northeast between the early- and late-settlement regions than was dichotomously described previously. Lastly, we show that haplotype sharing is locally enriched by an order of magnitude among pairs of individuals sharing rare alleles and especially among pairs sharing rare disease-causing variants. Our work provides a general framework for using haplotype sharing to reconstruct an integrative view of recent population history and gain insight into the evolutionary origins of rare variants contributing to disease.


Subject(s)
Disease/genetics , Genetics, Population , Haplotypes/genetics , Finland , Gene Flow , Genetic Variation , Geography , Human Migration , Humans , Parturition , Population Density , Time Factors
17.
Am J Hum Genet ; 102(6): 1204-1211, 2018 06 07.
Article in English | MEDLINE | ID: mdl-29861106

ABSTRACT

There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.


Subject(s)
Mutation/genetics , Open Reading Frames/genetics , Databases, Genetic , Ethnicity/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Phenotype , Proteins/genetics
18.
Proc Natl Acad Sci U S A ; 115(52): 13324-13329, 2018 12 26.
Article in English | MEDLINE | ID: mdl-30530665

ABSTRACT

Skin pigmentation is under strong directional selection in northern European and Asian populations. The indigenous KhoeSan populations of far southern Africa have lighter skin than other sub-Saharan African populations, potentially reflecting local adaptation to a region of Africa with reduced UV radiation. Here, we demonstrate that a canonical Eurasian skin pigmentation gene, SLC24A5, was introduced to southern Africa via recent migration and experienced strong adaptive evolution in the KhoeSan. To reconstruct the evolution of skin pigmentation, we collected phenotypes from over 400 ≠Khomani San and Nama individuals and high-throughput sequenced candidate pigmentation genes. The derived causal allele in SLC24A5, p.Ala111Thr, significantly lightens basal skin pigmentation in the KhoeSan and explains 8 to 15% of phenotypic variance in these populations. The frequency of this allele (33 to 53%) is far greater than expected from colonial period European gene flow; however, the most common derived haplotype is identical among European, eastern African, and KhoeSan individuals. Using four-population demographic simulations with selection, we show that the allele was introduced into the KhoeSan only 2,000 y ago via a back-to-Africa migration and then experienced a selective sweep (s = 0.04 to 0.05 in ≠Khomani and Nama). The SLC24A5 locus is both a rare example of intense, ongoing adaptation in very recent human history, as well as an adaptive gene flow at a pigmentation locus in humans.


Subject(s)
Antiporters/genetics , Skin Pigmentation/genetics , Adult , Africa, Southern , Alleles , Antiporters/metabolism , Asian People/genetics , Black People/genetics , Demography/methods , Evolution, Molecular , Female , Gene Flow , Genetic Variation/genetics , Genetics, Population/methods , Genotype , Haplotypes , Humans , Male , Phenotype , Phylogeny , Polymorphism, Single Nucleotide/genetics , White People/genetics
19.
Am J Hum Genet ; 100(4): 635-649, 2017 Apr 06.
Article in English | MEDLINE | ID: mdl-28366442

ABSTRACT

The vast majority of genome-wide association studies (GWASs) are performed in Europeans, and their transferability to other populations is dependent on many factors (e.g., linkage disequilibrium, allele frequencies, genetic architecture). As medical genomics studies become increasingly large and diverse, gaining insights into population history and consequently the transferability of disease risk measurement is critical. Here, we disentangle recent population history in the widely used 1000 Genomes Project reference panel, with an emphasis on populations underrepresented in medical studies. To examine the transferability of single-ancestry GWASs, we used published summary statistics to calculate polygenic risk scores for eight well-studied phenotypes. We identify directional inconsistencies in all scores; for example, height is predicted to decrease with genetic distance from Europeans, despite robust anthropological evidence that West Africans are as tall as Europeans on average. To gain deeper quantitative insights into GWAS transferability, we developed a complex trait coalescent-based simulation framework considering effects of polygenicity, causal allele frequency divergence, and heritability. As expected, correlations between true and inferred risk are typically highest in the population from which summary statistics were derived. We demonstrate that scores inferred from European GWASs are biased by genetic drift in other populations even when choosing the same causal variants and that biases in any direction are possible and unpredictable. This work cautions that summarizing findings from large-scale GWASs may have limited portability to other populations using standard approaches and highlights the need for generalized risk prediction methods and the inclusion of more diverse individuals in medical genomics.


Subject(s)
Genetic Predisposition to Disease , Racial Groups/genetics , Americas , Genetics, Medical , Genetics, Population , Haplotypes , Human Genome Project , Humans , Multifactorial Inheritance
20.
Am J Phys Anthropol ; 168 Suppl 67: 4-26, 2019 01.
Article in English | MEDLINE | ID: mdl-30408154

ABSTRACT

Like many highly variable human traits, more than a dozen genes are known to contribute to the full range of skin color. However, the historical bias in favor of genetic studies in European and European-derived populations has blinded us to the magnitude of pigmentation's complexity. As deliberate efforts are being made to better characterize diverse global populations and new sequencing technologies, better measurement tools, functional assessments, predictive modeling, and ancient DNA analyses become more widely accessible, we are beginning to appreciate how limited our understanding of the genetic bases of human skin color have been. Novel variants in genes not previously linked to pigmentation have been identified and evidence is mounting that there are hundreds more variants yet to be found. Even for genes that have been exhaustively characterized in European populations like MC1R, OCA2, and SLC24A5, research in previously understudied groups is leading to a new appreciation of the degree to which genetic diversity, epistatic interactions, pleiotropy, admixture, global and local adaptation, and cultural practices operate in population-specific ways to shape the genetic architecture of skin color. Furthermore, we are coming to terms with how factors like tanning response and barrier function may also have influenced selection on skin throughout human history. By examining how our knowledge of pigmentation genetics has shifted in the last decade, we can better appreciate how far we have come in understanding human diversity and the still long road ahead for understanding many complex human traits.


Subject(s)
Biological Evolution , Skin Physiological Phenomena , Skin Pigmentation , Anthropology, Physical , Antiporters/genetics , Genetics, Population , Guanine Nucleotide Exchange Factors/genetics , Humans , Membrane Proteins/genetics , Membrane Transport Proteins/genetics , Receptor, Melanocortin, Type 1/genetics , Skin/metabolism , Skin Pigmentation/genetics , Skin Pigmentation/physiology , Ubiquitin-Protein Ligases
SELECTION OF CITATIONS
SEARCH DETAIL