ABSTRACT
We conduct high coverage (>30×) whole-genome sequencing of 180 individuals from 12 indigenous African populations. We identify millions of unreported variants, many predicted to be functionally important. We observe that the ancestors of southern African San and central African rainforest hunter-gatherers (RHG) diverged from other populations >200 kya and maintained a large effective population size. We observe evidence for ancient population structure in Africa and for multiple introgression events from "ghost" populations with highly diverged genetic lineages. Although currently geographically isolated, we observe evidence for gene flow between eastern and southern Khoesan-speaking hunter-gatherer populations lasting until â¼12 kya. We identify signatures of local adaptation for traits related to skin color, immune response, height, and metabolic processes. We identify a positively selected variant in the lightly pigmented San that influences pigmentation in vitro by regulating the enhancer activity and gene expression of PDPK1.
Subject(s)
Acclimatization , Skin Pigmentation , Humans , Whole Genome Sequencing , Population Density , Africa , 3-Phosphoinositide-Dependent Protein KinasesABSTRACT
Oakleaf butterflies in the genus Kallima have a polymorphic wing phenotype, enabling these insects to masquerade as dead leaves. This iconic example of protective resemblance provides an interesting evolutionary paradigm that can be employed to study biodiversity. We integrated multi-omic data analyses and functional validation to infer the evolutionary history of Kallima species and investigate the genetic basis of their variable leaf wing patterns. We find that Kallima butterflies diversified in the eastern Himalayas and dispersed to East and Southeast Asia. Moreover, we find that leaf wing polymorphism is controlled by the wing patterning gene cortex, which has been maintained in Kallima by long-term balancing selection. Our results provide macroevolutionary and microevolutionary insights into a model species originating from a mountain ecosystem.
Subject(s)
Butterflies , Animals , Biodiversity , Biological Evolution , Butterflies/genetics , Ecosystem , Phenotype , Wings, AnimalABSTRACT
During human evolution, the knee adapted to the biomechanical demands of bipedalism by altering chondrocyte developmental programs. This adaptive process was likely not without deleterious consequences to health. Today, osteoarthritis occurs in 250 million people, with risk variants enriched in non-coding sequences near chondrocyte genes, loci that likely became optimized during knee evolution. We explore this relationship by epigenetically profiling joint chondrocytes, revealing ancient selection and recent constraint and drift on knee regulatory elements, which also overlap osteoarthritis variants that contribute to disease heritability by tending to modify constrained functional sequence. We propose a model whereby genetic violations to regulatory constraint, tolerated during knee development, lead to adult pathology. In support, we discover a causal enhancer variant (rs6060369) present in billions of people at a risk locus (GDF5-UQCC1), showing how it impacts mouse knee-shape and osteoarthritis. Overall, our methods link an evolutionarily novel aspect of human anatomy to its pathogenesis.
Subject(s)
Chondrocytes/physiology , Knee Joint/physiology , Osteoarthritis/genetics , Animals , Biological Evolution , Chondrocytes/metabolism , Evolution, Molecular , Genetic Predisposition to Disease/genetics , Growth Differentiation Factor 5/genetics , Growth Differentiation Factor 5/metabolism , HEK293 Cells , Humans , Knee/physiology , Mice , NIH 3T3 Cells , Regulatory Sequences, Nucleic Acid/genetics , Risk FactorsABSTRACT
Underrepresentation of Asian genomes has hindered population and medical genetics research on Asians, leading to population disparities in precision medicine. By whole-genome sequencing of 4,810 Singapore Chinese, Malays, and Indians, we found 98.3 million SNPs and small insertions or deletions, over half of which are novel. Population structure analysis demonstrated great representation of Asian genetic diversity by three ethnicities in Singapore and revealed a Malay-related novel ancestry component. Furthermore, demographic inference suggested that Malays split from Chinese â¼24,800 years ago and experienced significant admixture with East Asians â¼1,700 years ago, coinciding with the Austronesian expansion. Additionally, we identified 20 candidate loci for natural selection, 14 of which harbored robust associations with complex traits and diseases. Finally, we show that our data can substantially improve genotype imputation in diverse Asian and Oceanian populations. These results highlight the value of our data as a resource to empower human genetics discovery across broad geographic regions.
Subject(s)
Genetics, Population , Genome, Human/genetics , Selection, Genetic , Whole Genome Sequencing , Asian People/genetics , Female , Genotype , Humans , Malaysia/epidemiology , Male , Polymorphism, Single Nucleotide/genetics , Singapore/epidemiologyABSTRACT
FOXP2, initially identified for its role in human speech, contains two nonsynonymous substitutions derived in the human lineage. Evidence for a recent selective sweep in Homo sapiens, however, is at odds with the presence of these substitutions in archaic hominins. Here, we comprehensively reanalyze FOXP2 in hundreds of globally distributed genomes to test for recent selection. We do not find evidence of recent positive or balancing selection at FOXP2. Instead, the original signal appears to have been due to sample composition. Our tests do identify an intronic region that is enriched for highly conserved sites that are polymorphic among humans, compatible with a loss of function in humans. This region is lowly expressed in relevant tissue types that were tested via RNA-seq in human prefrontal cortex and RT-PCR in immortalized human brain cells. Our results represent a substantial revision to the adaptive history of FOXP2, a gene regarded as vital to human evolution.
Subject(s)
Forkhead Transcription Factors/genetics , Brain/cytology , Brain/metabolism , Cell Line , Databases, Genetic , Exons , Female , Genome, Human , Haplotypes , Humans , Introns , Male , Markov Chains , Polymorphism, Single Nucleotide , Prefrontal Cortex/metabolismABSTRACT
Understanding the physiology and genetics of human hypoxia tolerance has important medical implications, but this phenomenon has thus far only been investigated in high-altitude human populations. Another system, yet to be explored, is humans who engage in breath-hold diving. The indigenous Bajau people ("Sea Nomads") of Southeast Asia live a subsistence lifestyle based on breath-hold diving and are renowned for their extraordinary breath-holding abilities. However, it is unknown whether this has a genetic basis. Using a comparative genomic study, we show that natural selection on genetic variants in the PDE10A gene have increased spleen size in the Bajau, providing them with a larger reservoir of oxygenated red blood cells. We also find evidence of strong selection specific to the Bajau on BDKRB2, a gene affecting the human diving reflex. Thus, the Bajau, and possibly other diving populations, provide a new opportunity to study human adaptation to hypoxia tolerance. VIDEO ABSTRACT.
Subject(s)
Adaptation, Physiological , Breath Holding , Diving , Organ Size , Phosphoric Diester Hydrolases/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Alleles , Asian People , Erythrocytes/cytology , Ethnicity , Female , Genetic Variation , Genomics , Humans , Hypoxia , Indonesia/ethnology , Lung , Male , Middle Aged , Oxygen/physiology , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic , Spleen/physiology , White People , Young AdultABSTRACT
We assembled genome-wide data from 16 prehistoric Africans. We show that the anciently divergent lineage that comprises the primary ancestry of the southern African San had a wider distribution in the past, contributing approximately two-thirds of the ancestry of Malawi hunter-gatherers â¼8,100-2,500 years ago and approximately one-third of the ancestry of Tanzanian hunter-gatherers â¼1,400 years ago. We document how the spread of farmers from western Africa involved complete replacement of local hunter-gatherers in some regions, and we track the spread of herders by showing that the population of a â¼3,100-year-old pastoralist from Tanzania contributed ancestry to people from northeastern to southern Africa, including a â¼1,200-year-old southern African pastoralist. The deepest diversifications of African lineages were complex, involving either repeated gene flow among geographically disparate groups or a lineage more deeply diverging than that of the San contributing more to some western African populations than to others. We finally leverage ancient genomes to document episodes of natural selection in southern African populations. PAPERCLIP.
Subject(s)
Black People/genetics , Genome, Human , Africa , Bone and Bones/chemistry , DNA, Ancient/analysis , Female , Fossils , Genetics, Medical , Genetics, Population , Genome-Wide Association Study , Humans , Life Style , MaleABSTRACT
Humans differ in the outcome that follows exposure to life-threatening pathogens, yet the extent of population differences in immune responses and their genetic and evolutionary determinants remain undefined. Here, we characterized, using RNA sequencing, the transcriptional response of primary monocytes from Africans and Europeans to bacterial and viral stimuli-ligands activating Toll-like receptor pathways (TLR1/2, TLR4, and TLR7/8) and influenza virus-and mapped expression quantitative trait loci (eQTLs). We identify numerous cis-eQTLs that contribute to the marked differences in immune responses detected within and between populations and a strong trans-eQTL hotspot at TLR1 that decreases expression of pro-inflammatory genes in Europeans only. We find that immune-responsive regulatory variants are enriched in population-specific signals of natural selection and show that admixture with Neandertals introduced regulatory variants into European genomes, affecting preferentially responses to viral challenges. Together, our study uncovers evolutionarily important determinants of differences in host immune responsiveness between human populations.
Subject(s)
Adaptation, Physiological/genetics , Adaptation, Physiological/immunology , Adaptive Immunity , Neanderthals/genetics , Neanderthals/immunology , Adaptive Immunity/genetics , Alleles , Animals , Bacterial Infections/genetics , Bacterial Infections/immunology , Base Sequence , Biological Evolution , Black People/genetics , Gene Expression Regulation , Genetic Variation , Humans , Immune System , Quantitative Trait Loci , RNA/genetics , Selection, Genetic , Sequence Analysis, RNA , Toll-Like Receptors/genetics , Transcription, Genetic , Virus Diseases/genetics , Virus Diseases/immunology , White People/geneticsABSTRACT
Individuals from different populations vary considerably in their susceptibility to immune-related diseases. To understand how genetic variation and natural selection contribute to these differences, we tested for the effects of African versus European ancestry on the transcriptional response of primary macrophages to live bacterial pathogens. A total of 9.3% of macrophage-expressed genes show ancestry-associated differences in the gene regulatory response to infection, and African ancestry specifically predicts a stronger inflammatory response and reduced intracellular bacterial growth. A large proportion of these differences are under genetic control: for 804 genes, more than 75% of ancestry effects on the immune response can be explained by a single cis- or trans-acting expression quantitative trait locus (eQTL). Finally, we show that genetic effects on the immune response are strongly enriched for recent, population-specific signatures of adaptation. Together, our results demonstrate how historical selective events continue to shape human phenotypic diversity today, including for traits that are key to controlling infection.
ABSTRACT
Since the identification of sickle cell trait as a heritable form of resistance to malaria, candidate gene studies, linkage analysis paired with sequencing, and genome-wide association (GWA) studies have revealed many examples of genetic resistance and susceptibility to infectious diseases. GWA studies enabled the identification of many common variants associated with small shifts in susceptibility to infectious diseases. This is exemplified by multiple loci associated with leprosy, malaria, HIV, tuberculosis, and coronavirus disease 2019 (COVID-19), which illuminate genetic architecture and implicate pathways underlying pathophysiology. Despite these successes, most of the heritability of infectious diseases remains to be explained. As the field advances, current limitations may be overcome by applying methodological innovations such as cellular GWA studies and phenome-wide association (PheWA) studies as well as by improving methodological rigor with more precise case definitions, deeper phenotyping, increased cohort diversity, and functional validation of candidate loci in the laboratory or human challenge studies.
Subject(s)
COVID-19 , Communicable Diseases , Humans , Genome-Wide Association Study , COVID-19/genetics , Communicable Diseases/genetics , Human GeneticsABSTRACT
Humans exhibit considerable variability in their immune responses to the same immune challenges. Such variation is widespread and affects individual and population-level susceptibility to infectious diseases and immune disorders. Although the factors influencing immune response diversity are partially understood, what mechanisms lead to the wide range of immune traits in healthy individuals remain largely unexplained. Here, we discuss the role that natural selection has played in driving phenotypic differences in immune responses across populations and present-day susceptibility to immune-related disorders. Further, we touch on future directions in the field of immunogenomics, highlighting the value of expanding this work to human populations globally, the utility of modeling the immune response as a dynamic process, and the importance of considering the potential polygenic nature of natural selection. Identifying loci acted upon by evolution may further pinpoint variants critically involved in disease etiology, and designing studies to capture these effects will enrich our understanding of the genetic contributions to immunity and immune dysregulation.
Subject(s)
Selection, Genetic , Humans , Animals , Genetic Predisposition to Disease , Immunity/genetics , Genetic Variation , Genetics, Population , Phenotype , Disease Susceptibility/immunologyABSTRACT
Lifespan varies significantly among mammals, with more than 100-fold difference between the shortest and longest living species. This natural difference may uncover the evolutionary forces and molecular features that define longevity. To understand the relationship between gene expression variation and longevity, we conducted a comparative transcriptomics analysis of liver, kidney, and brain tissues of 103 mammalian species. We found that few genes exhibit common expression patterns with longevity in the three organs analyzed. However, pathways related to translation fidelity, such as nonsense-mediated decay and eukaryotic translation elongation, correlated with longevity across mammals. Analyses of selection pressure found that selection intensity related to the direction of longevity-correlated genes is inconsistent across organs. Furthermore, expression of methionine restriction-related genes correlated with longevity and was under strong selection in long-lived mammals, suggesting that a common strategy is utilized by natural selection and artificial intervention to control lifespan. Our results indicate that lifespan regulation via gene expression is driven through polygenic and indirect natural selection.
Subject(s)
Longevity , Mammals , Animals , Mammals/classification , Mammals/genetics , Mammals/growth & development , Mammals/metabolism , Longevity/genetics , Gene Expression Profiling , Gene Expression , Liver/metabolism , Brain/metabolism , Kidney/metabolism , Humans , Male , FemaleABSTRACT
The gene content in a metagenomic pool defines the function potential of a microbial community. Natural selection, operating on the level of genomes or genes, shapes the evolution of community functions by enriching some genes while depriving the others. Despite the importance of microbiomes in the environment and health, a general metric to evaluate the community-wide fitness of microbial genes remains lacking. In this work, we adapt the classic neutral model of species and use it to predict how the abundances of different genes will be shaped by selection, regardless of at which level the selection acts. We establish a simple metric that quantitatively infers the average survival capability of each gene in a microbiome. We then experimentally validate the predictions using synthetic communities of barcoded Escherichia coli strains undergoing neutral assembly and competition. We further show that this approach can be applied to publicly available metagenomic datasets to gain insights into the environment-function interplay of natural microbiomes.
Subject(s)
Microbiota , Microbiota/genetics , Metagenome/genetics , Selection, Genetic , Genes, MicrobialABSTRACT
A central goal in evolutionary biology is to understand how different evolutionary processes cause trait change in wild populations. However, quantifying evolutionary change in the wild requires linking trait change to shifts in allele frequencies at causal loci. Nevertheless, datasets that allow for such tests are extremely rare and existing theoretical approaches poorly account for the evolutionary dynamics that likely occur in ecological settings. Using a decade-long integrative phenome-to-genome time-series dataset on wild threespine stickleback (Gasterosteus aculeatus), we identified how different modes of selection (directional, episodic, and balancing) drive microevolutionary change in correlated traits over time. Most strikingly, we show that feeding traits changed by as much 25% across 10 generations which was driven by changes in the genetic architecture (i.e., in both genomic breeding values and allele frequencies at genetic loci for feeding traits). Importantly, allele frequencies at genetic loci related to feeding traits changed at a rate greater than expected under drift, suggesting that the observed change was a result of directional selection. Allele frequency dynamics of loci related to swimming traits appeared to be under fluctuating selection evident in periodic population crashes in this system. Our results show that microevolutionary change in a wild population is characterized by different modes of selection acting simultaneously on different traits, which likely has important consequences for the evolution of correlated traits. Our study provides one of the most thorough descriptions to date of how microevolutionary processes result in trait change in a natural population.
Subject(s)
Biological Evolution , Gene Frequency , Selection, Genetic , Smegmamorpha , Animals , Smegmamorpha/genetics , Smegmamorpha/physiology , PhenotypeABSTRACT
In addition to the canonical right-handed double helix, other DNA structures, termed 'non-B DNA', can form in the genomes across the tree of life. Non-B DNA regulates multiple cellular processes, including replication and transcription, yet its presence is associated with elevated mutagenicity and genome instability. These discordant cellular roles fuel the enormous potential of non-B DNA to drive genomic and phenotypic evolution. Here we discuss recent studies establishing non-B DNA structures as novel functional elements subject to natural selection, affecting evolution of transposable elements (TEs), and specifying centromeres. By highlighting the contributions of non-B DNA to repeated evolution and adaptation to changing environments, we conclude that evolutionary analyses should include a perspective of not only DNA sequence, but also its structure.
Subject(s)
DNA Transposable Elements , Genomics , Humans , DNA Transposable Elements/genetics , Base Sequence , Genomic Instability/genetics , Evolution, MolecularABSTRACT
A growing body of work has addressed human adaptations to diverse environments using genomic data, but few studies have connected putatively selected alleles to phenotypes, much less among underrepresented populations such as Amerindians. Studies of natural selection and genotype-phenotype relationships in underrepresented populations hold potential to uncover previously undescribed loci underlying evolutionarily and biomedically relevant traits. Here, we worked with the Tsimane and the Moseten, two Amerindian populations inhabiting the Bolivian lowlands. We focused most intensively on the Tsimane, because long-term anthropological work with this group has shown that they have a high burden of both macro and microparasites, as well as minimal cardiometabolic disease or dementia. We therefore generated genome-wide genotype data for Tsimane individuals to study natural selection, and paired this with blood mRNA-seq as well as cardiometabolic and immune biomarker data generated from a larger sample that included both populations. In the Tsimane, we identified 21 regions that are candidates for selective sweeps, as well as 5 immune traits that show evidence for polygenic selection (e.g., C-reactive protein levels and the response to coronaviruses). Genes overlapping candidate regions were strongly enriched for known involvement in immune-related traits, such as abundance of lymphocytes and eosinophils. Importantly, we were also able to draw on extensive phenotype information for the Tsimane and Moseten and link five regions (containing PSD4, MUC21 and MUC22, TOX2, ANXA6, and ABCA1) with biomarkers of immune and metabolic function. Together, our work highlights the utility of pairing evolutionary analyses with anthropological and biomedical data to gain insight into the genetic basis of health-related traits.
Subject(s)
Genetics, Population , Health Status , Humans , Biomarkers , Bolivia , Genomics , Genotype , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic , Genome, HumanABSTRACT
Species' phenotypic characteristics often remain unchanged over long stretches of geological time. Stabilizing selection-in which fitness is highest for intermediate phenotypes and lowest for the extremes-has been widely invoked as responsible for this pattern. At the community level, such stabilizing selection acting individually on co-occurring species is expected to produce a rugged fitness landscape on which different species occupy distinct fitness peaks. However, even with an explosion of microevolutionary field studies over the past four decades, evidence for persistent stabilizing selection driving long-term stasis is lacking. Nonetheless, biologists continue to invoke stabilizing selection as a major factor explaining macroevolutionary patterns. Here, by directly measuring natural selection in the wild, we identified a complex community-wide fitness surface in which four Anolis lizard species each occupy a distinct fitness peak close to their mean phenotype. The presence of local fitness optima within species, and fitness valleys between species, presents a barrier to adaptive evolutionary change and acts to maintain species differences through time. However, instead of continuously operating stabilizing selection, we found that species were maintained on these peaks by the combination of many independent periods among which selection fluctuated in form, strength, direction, or existence and in which stabilizing selection rarely occurred. Our results suggest that lack of substantial phenotypic evolutionary change through time may be the result of selection, but not persistent stabilizing selection as classically envisioned.
Subject(s)
Biological Evolution , Selection, Genetic , Phenotype , Environment , BiotaABSTRACT
Gene regulatory networks (GRNs) are the core engine of organismal development. If we would like to understand the origin and diversification of phenotypes, it is necessary to consider the structure of GRNs in order to reconstruct the links between genetic mutations and phenotypic change. Much of the progress in evolutionary developmental biology, however, has occurred without a nuanced consideration of the evolution of functional relationships between genes, especially in the context of their broader network interactions. Characterizing and comparing GRNs across traits and species in a more detailed way will allow us to determine how network position influences what genes drive adaptive evolution. In this perspective paper, we consider the architecture of developmental GRNs and how positive selection strength may vary across a GRN. We then propose several testable models for these patterns of selection and experimental approaches to test these models.
Subject(s)
Gene Regulatory Networks , Gene Regulatory Networks/genetics , MutationABSTRACT
Food security is threatened by climate change, with heat and drought being the main stresses affecting crop physiology and ecosystem services, such as plant-pollinator interactions. We hypothesize that tracking and ranking pollinators' preferences for flowers under environmental pressure could be used as a marker of plant quality for agricultural breeding to increase crop stress tolerance. Despite increasing relevance of flowers as the most stress sensitive organs, phenotyping platforms aim at identifying traits of resilience by assessing the plant physiological status through remote sensing-assisted vegetative indexes, but find strong bottlenecks in quantifying flower traits and in accurate genotype-to-phenotype prediction. However, as the transport of photoassimilates from leaves (sources) to flowers (sinks) is reduced in low-resilient plants, flowers are better indicators than leaves of plant well-being. Indeed, the chemical composition and amount of pollen and nectar that flowers produce, which ultimately serve as food resources for pollinators, change in response to environmental cues. Therefore, pollinators' preferences could be used as a measure of functional source-to-sink relationships for breeding decisions. To achieve this challenging goal, we propose to develop a pollinator-assisted phenotyping and selection platform for automated quantification of Genotype × Environment × Pollinator interactions through an insect geo-positioning system. Pollinator-assisted selection can be validated by metabolic, transcriptomic, and ionomic traits, and mapping of candidate genes, linking floral and leaf traits, pollinator preferences, plant resilience, and crop productivity. This radical new approach can change the current paradigm of plant phenotyping and find new paths for crop redomestication and breeding assisted by ecological decisions.
Subject(s)
Crops, Agricultural , Flowers , Phenotype , Plant Breeding , Pollination , Stress, Physiological , Pollination/physiology , Crops, Agricultural/genetics , Crops, Agricultural/physiology , Plant Breeding/methods , Flowers/physiology , Flowers/genetics , Animals , GenotypeABSTRACT
One gene does not equal one RNA or protein. The genomic revolution has revealed numerous different RNA and protein molecules that can be produced from one gene, such as circular RNAs generated by back-splicing, proteins with residues mismatching the genomic encoding because of RNA editing, and proteins extended in the C terminus via stop codon readthrough in translation. Are these diverse products results of exquisite gene regulations or imprecise biological processes? While there are cases where the gene product diversity appears beneficial, genome-scale patterns suggest that much of this diversity arises from nonadaptive, molecular errors. This finding has important implications for studying the functions of diverse gene products and for understanding the fundamental properties and evolution of cellular life.