Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 50
Filter
Add more filters










Publication year range
1.
Genetics ; 2024 Jul 16.
Article in English | MEDLINE | ID: mdl-39013011

ABSTRACT

Our knowledge of human evolutionary history has been greatly advanced by paleogenomics. Since the 2020s, the study of ancient DNA has increasingly focused on reconstructing the recent past. However, the accuracy of paleogenomic methods in resolving questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation remains an open question. We evaluated the performance and behavior of two commonly used methods, qpAdm and the f3-statistic, on admixture inference under a diversity of demographic models and data conditions. We performed two complementary simulation approaches - firstly exploring a wide demographic parameter space under four simple demographic models of varying complexities and configurations using branch-length data from two chromosomes - and secondly, we analyzed a model of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudo-haploidization. We observe population differentiation is the primary factor driving qpAdm performance. Notably, whilst complex gene-flow histories influence which models are classified as plausible, they do not reduce overall performance. Under conditions reflective of the historical period, qpAdm most frequently identifies the true model as plausible amongst a small candidate set of closely related populations. To increase the utility for resolving fine-scaled hypotheses, we provide a heuristic for further distinguishing between candidate models that incorporates qpAdm model P-values and f3-statistics. Finally, we demonstrate a significant performance increase for qpAdm using whole-genome branch-length f2-statistics, highlighting the potential for improved demographic inference that could be achieved with future advancements in f-statistic estimations.

2.
bioRxiv ; 2024 Apr 18.
Article in English | MEDLINE | ID: mdl-38659893

ABSTRACT

The Yamnaya archaeological complex appeared around 3300BCE across the steppes north of the Black and Caspian Seas, and by 3000BCE reached its maximal extent from Hungary in the west to Kazakhstan in the east. To localize the ancestral and geographical origins of the Yamnaya among the diverse Eneolithic people that preceded them, we studied ancient DNA data from 428 individuals of which 299 are reported for the first time, demonstrating three previously unknown Eneolithic genetic clines. First, a "Caucasus-Lower Volga" (CLV) Cline suffused with Caucasus hunter-gatherer (CHG) ancestry extended between a Caucasus Neolithic southern end in Neolithic Armenia, and a steppe northern end in Berezhnovka in the Lower Volga. Bidirectional gene flow across the CLV cline created admixed intermediate populations in both the north Caucasus, such as the Maikop people, and on the steppe, such as those at the site of Remontnoye north of the Manych depression. CLV people also helped form two major riverine clines by admixing with distinct groups of European hunter-gatherers. A "Volga Cline" was formed as Lower Volga people mixed with upriver populations that had more Eastern hunter-gatherer (EHG) ancestry, creating genetically hyper-variable populations as at Khvalynsk in the Middle Volga. A "Dnipro Cline" was formed as CLV people bearing both Caucasus Neolithic and Lower Volga ancestry moved west and acquired Ukraine Neolithic hunter-gatherer (UNHG) ancestry to establish the population of the Serednii Stih culture from which the direct ancestors of the Yamnaya themselves were formed around 4000BCE. This population grew rapidly after 3750-3350BCE, precipitating the expansion of people of the Yamnaya culture who totally displaced previous groups on the Volga and further east, while admixing with more sedentary groups in the west. CLV cline people with Lower Volga ancestry contributed four fifths of the ancestry of the Yamnaya, but also, entering Anatolia from the east, contributed at least a tenth of the ancestry of Bronze Age Central Anatolians, where the Hittite language, related to the Indo-European languages spread by the Yamnaya, was spoken. We thus propose that the final unity of the speakers of the "Proto-Indo-Anatolian" ancestral language of both Anatolian and Indo-European languages can be traced to CLV cline people sometime between 4400-4000 BCE.

3.
bioRxiv ; 2023 Nov 15.
Article in English | MEDLINE | ID: mdl-38014190

ABSTRACT

Paleogenomics has expanded our knowledge of human evolutionary history. Since the 2020s, the study of ancient DNA has increased its focus on reconstructing the recent past. However, the accuracy of paleogenomic methods in answering questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation within the historical period remains an open question. We used two simulation approaches to evaluate the limitations and behavior of commonly used methods, qpAdm and the f3-statistic, on admixture inference. The first is based on branch-length data simulated from four simple demographic models of varying complexities and configurations. The second, an analysis of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudo-haploidization. We show that under conditions resembling historical populations, qpAdm can identify a small candidate set of true sources and populations closely related to them. However, in typical ancient DNA conditions, qpAdm is unable to further distinguish between them, limiting its utility for resolving fine-scaled hypotheses. Notably, we find that complex gene-flow histories generally lead to improvements in the performance of qpAdm and observe no bias in the estimation of admixture weights. We offer a heuristic for admixture inference that incorporates admixture weight estimate and P-values of qpAdm models, and f3-statistics to enhance the power to distinguish between multiple plausible candidates. Finally, we highlight the future potential of qpAdm through whole-genome branch-length f2-statistics, demonstrating the improved demographic inference that could be achieved with advancements in f-statistic estimations.

4.
bioRxiv ; 2023 Oct 18.
Article in English | MEDLINE | ID: mdl-37904998

ABSTRACT

Although a broad range of methods exists for reconstructing population history from genome-wide single nucleotide polymorphism data, just a few methods gained popularity in archaeogenetics: principal component analysis (PCA); ADMIXTURE, an algorithm that models individuals as mixtures of multiple ancestral sources represented by actual or inferred populations; formal tests for admixture such as f3-statistics and D/f4-statistics; and qpAdm, a tool for fitting two-component and more complex admixture models to groups or individuals. Despite their popularity in archaeogenetics, which is explained by modest computational requirements and ability to analyze data of various types and qualities, protocols relying on qpAdm that screen numerous alternative models of varying complexity and find "fitting" models (often considering both estimated admixture proportions and p-values as a composite criterion of model fit) remain untested on complex simulated population histories in the form of admixture graphs of random topology. We analyzed genotype data extracted from such simulations and tested various types of high-throughput qpAdm protocols ("rotating" and "non-rotating", with or without temporal stratification of target groups and proxy ancestry sources, and with or without a "model competition" step). We caution that high-throughput qpAdm protocols may be inappropriate for exploratory analyses in poorly studied regions/periods since their false discovery rates varied between 12% and 68% depending on the details of the protocol and on the amount and quality of simulated data (i.e., >12% of fitting two-way admixture models imply gene flows that were not simulated). We demonstrate that for reducing false discovery rates of qpAdm protocols to nearly 0% it is advisable to use large SNP sets with low missing data rates, the rotating qpAdm protocol with a strictly enforced rule that target groups do not pre-date their proxy sources, and an unsupervised ADMIXTURE analysis as a way to verify feasible qpAdm models. Our study has a number of limitations: for instance, these recommendations depend on the assumption that the underlying genetic history is a complex admixture graph and not a stepping-stone model.

5.
PLoS Genet ; 19(9): e1010931, 2023 09.
Article in English | MEDLINE | ID: mdl-37676865

ABSTRACT

f-statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. Not only are they guaranteed to allow robust tests of the fits of proposed models of population history to data when analyzing full genome sequencing data-that is, all single nucleotide polymorphisms (SNPs) in the individuals being analyzed-but they are also guaranteed to allow robust tests of models for SNPs ascertained as polymorphic in a population that is an outgroup in a phylogenetic sense to all groups being analyzed. True "outgroup ascertainment" is in practice impossible in humans because our species has arisen from a substructured ancestral population that does not descend from a homogeneous ancestral population going back many hundreds of thousands of years into the past. However, initial studies suggested that non-outgroup-ascertainment schemes might produce robust enough results using f-statistics, and that motivated widespread fitting of models to data using non-outgroup-ascertained SNP panels such as the "Affymetrix Human Origins array" which has been genotyped on thousands of modern individuals from hundreds of populations, or the "1240k" in-solution enrichment reagent which has been the source of about 70% of published genome-wide data for ancient humans. In this study, we show that while analyses of population history using such panels work well for studies of relationships among non-African populations and one African outgroup, when co-modeling more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans), fitting of f-statistics to such SNP sets is expected to frequently lead to false rejection of true demographic histories, and failure to reject incorrect models. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, has limited statistical power and retains important biases. However, by carrying out simulations of diverse demographic histories, we show that bias in inferences based on f-statistics can be minimized by ascertaining on variants common in a union of diverse African groups; such ascertainment retains high statistical power while allowing co-analysis of archaic and modern groups.


Subject(s)
African People , Demography , Phylogeny , Polymorphism, Single Nucleotide , Animals , Humans , Black People/genetics , Chromosome Mapping , Genotype , Neanderthals/genetics , Polymorphism, Single Nucleotide/genetics , African People/genetics , Demography/history , Biological Variation, Population/genetics , Models, Statistical , Bias
6.
Sci Rep ; 13(1): 8371, 2023 05 24.
Article in English | MEDLINE | ID: mdl-37225753

ABSTRACT

Thailand is a country where over 60 languages from five language families (Austroasiatic, Austronesian, Hmong-Mien, Kra-Dai, and Sino-Tibetan) are spoken. The Kra-Dai language family is the most prevalent, and Thai, the official language of the country, belongs to it. Previous genome-wide studies on Thailand populations revealed a complex population structure and put some hypotheses forward concerning the population history of the country. However, many published populations have not been co-analyzed, and some aspects of population history were not explored adequately. In this study, we employ new methods to re-analyze published genome-wide genetic data on Thailand populations, with a focus on 14 Kra-Dai-speaking groups. Our analyses reveal South Asian ancestry in Kra-Dai-speaking Lao Isan and Khonmueang, and in Austroasiatic-speaking Palaung, in contrast to a previous study in which the data were generated. We support the admixture scenario for the formation of Kra-Dai-speaking groups from Thailand who harbor both Austroasiatic-related ancestry and Kra-Dai-related ancestry from outside of Thailand. We also provide evidence of bidirectional admixture between Southern Thai and Nayu, an Austronesian-speaking group from Southern Thailand. Challenging some previously reported genetic analyses, we reveal a close genetic relationship between Nayu and Austronesian-speaking groups from Island Southeast Asia (ISEA).


Subject(s)
Asian People , Asian , Language , Humans , Asian/ethnology , Asian/genetics , Asian People/ethnology , Asian People/genetics , Thailand , Asia, Southeastern/ethnology , Genome-Wide Association Study
7.
Elife ; 122023 06 29.
Article in English | MEDLINE | ID: mdl-37057893

ABSTRACT

Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this findGraphs tool within a software package, ADMIXTOOLS 2, which is a reimplementation of the ADMIXTOOLS software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature.


Subject(s)
Genetics, Population , Hominidae , Humans , Dogs , Animals , Horses , Gene Frequency , Software , Genetic Drift , Models, Genetic
8.
bioRxiv ; 2023 Jan 22.
Article in English | MEDLINE | ID: mdl-36711923

ABSTRACT

f -statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. These statistics can provide strong evidence for either admixture or cladality, which can be robust to substantial rates of errors or missing data. f -statistics are guaranteed to be unbiased under "SNP ascertainment" (analyzing non-randomly chosen subsets of single nucleotide polymorphisms) only if it relies on a population that is an outgroup for all groups analyzed. However, ascertainment on a true outgroup that is not co-analyzed with other populations is often impractical and uncommon in the literature. In this study focused on practical rather than theoretical aspects of SNP ascertainment, we show that many non-outgroup ascertainment schemes lead to false rejection of true demographic histories, as well as to failure to reject incorrect models. But the bias introduced by common ascertainments such as the 1240K panel is mostly limited to situations when more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans) or non-human outgroups are co-modelled, for example, f 4 -statistics involving one non-African group, two African groups, and one archaic group. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, cannot fix all these problems since for some classes of f -statistics it is not a clean outgroup ascertainment, and in other cases it demonstrates relatively low power to reject incorrect demographic models since it provides a relatively small number of variants common in anatomically modern humans. And due to the paucity of high-coverage archaic genomes, archaic individuals used for ascertainment often act as sole representatives of the respective groups in an analysis, and we show that this approach is highly problematic. By carrying out large numbers of simulations of diverse demographic histories, we find that bias in inferences based on f -statistics introduced by non-outgroup ascertainment can be minimized if the derived allele frequency spectrum in the population used for ascertainment approaches the spectrum that existed at the root of all groups being co-analyzed. Ascertaining on sites with variants common in a diverse group of African individuals provides a good approximation to such a set of SNPs, addressing the great majority of biases and also retaining high statistical power for studying population history. Such a "pan-African" ascertainment, although not completely problem-free, allows unbiased exploration of demographic models for the widest set of archaic and modern human populations, as compared to the other ascertainment schemes we explored.

9.
Commun Biol ; 6(1): 64, 2023 01 18.
Article in English | MEDLINE | ID: mdl-36653511

ABSTRACT

Polar oceans belong to the most productive and rapidly changing environments, yet our understanding of this fragile ecosystem remains limited. Here we present an analysis of a unique set of DNA metabarcoding samples from the western Weddell Sea sampled throughout the whole water column and across five water masses with different characteristics and different origin. We focus on factors affecting the distribution of planktonic pico-nano eukaryotes and observe an ecological succession of eukaryotic communities as the water masses move away from the surface and as oxygen becomes depleted with time. At the beginning of this succession, in the photic zone, algae, bacteriovores, and predators of small eukaryotes dominate the community, while another community develops as the water sinks deeper, mostly composed of parasitoids (syndinians), mesoplankton predators (radiolarians), and diplonemids. The strongly correlated distribution of syndinians and diplonemids along the depth and oxygen gradients suggests their close ecological link and moves us closer to understanding the biological role of the latter group in the ocean ecosystem.


Subject(s)
Ecosystem , Eukaryota , Water , Oceans and Seas , Oxygen
10.
Sci Rep ; 12(1): 22507, 2022 12 29.
Article in English | MEDLINE | ID: mdl-36581666

ABSTRACT

Indian cultural influence is remarkable in present-day Mainland Southeast Asia (MSEA), and it may have stimulated early state formation in the region. Various present-day populations in MSEA harbor a low level of South Asian ancestry, but previous studies failed to detect such ancestry in any ancient individual from MSEA. In this study, we discovered a substantial level of South Asian admixture (ca. 40-50%) in a Protohistoric individual from the Vat Komnou cemetery at the Angkor Borei site in Cambodia. The location and direct radiocarbon dating result on the human bone (95% confidence interval is 78-234 calCE) indicate that this individual lived during the early period of Funan, one of the earliest states in MSEA, which shows that the South Asian gene flow to Cambodia started about a millennium earlier than indicated by previous published results of genetic dating relying on present-day populations. Plausible proxies for the South Asian ancestry source in this individual are present-day populations in Southern India, and the individual shares more genetic drift with present-day Cambodians than with most present-day East and Southeast Asian populations.


Subject(s)
DNA, Ancient , Genetics, Population , Humans , Cambodia , South Asian People , Asian People
11.
PLoS Genet ; 18(2): e1010036, 2022 02.
Article in English | MEDLINE | ID: mdl-35176016

ABSTRACT

The great ethnolinguistic diversity found today in mainland Southeast Asia (MSEA) reflects multiple migration waves of people in the past. Maritime trading between MSEA and India was established at the latest 300 BCE, and the formation of early states in Southeast Asia during the first millennium CE was strongly influenced by Indian culture, a cultural influence that is still prominent today. Several ancient Indian-influenced states were located in present-day Thailand, and various populations in the country are likely to be descendants of people from those states. To systematically explore Indian genetic heritage in MSEA populations, we generated genome-wide SNP data (using the Affymetrix Human Origins array) for 119 present-day individuals belonging to 10 ethnic groups from Thailand and co-analyzed them with published data using PCA, ADMIXTURE, and methods relying on f-statistics and on autosomal haplotypes. We found low levels of South Asian admixture in various MSEA populations for whom there is evidence of historical connections with the ancient Indian-influenced states but failed to find this genetic component in present-day hunter-gatherer groups and relatively isolated groups from the highlands of Northern Thailand. The results suggest that migration of Indian populations to MSEA may have been responsible for the spread of Indian culture in the region. Our results also support close genetic affinity between Kra-Dai-speaking (also known as Tai-Kadai) and Austronesian-speaking populations, which fits a linguistic hypothesis suggesting cladality of the two language families.


Subject(s)
Asian People/genetics , Ethnicity/genetics , Asia, Southeastern/ethnology , Genetic Variation/genetics , Genetics, Population/methods , Haplotypes/genetics , Humans , India/ethnology , Language , Polymorphism, Single Nucleotide/genetics , Thailand/ethnology
12.
Environ Microbiol ; 22(9): 4014-4031, 2020 09.
Article in English | MEDLINE | ID: mdl-32779301

ABSTRACT

We analysed a widely used barcode, the V9 region of the 18S rRNA gene, to study the effect of environmental conditions on the distribution of two related heterotrophic protistan lineages in marine plankton, kinetoplastids and diplonemids. We relied on a major published dataset (Tara Oceans) where samples from the mesopelagic zone were available from just 32 of 123 locations, and both groups are most abundant in this zone. To close sampling gaps and obtain more information from the deeper ocean, we collected 57 new samples targeting especially the mesopelagic zone. We sampled in three geographic regions: the Arctic, two depth transects in the Adriatic Sea, and the anoxic Cariaco Basin. In agreement with previous studies, both protist groups are most abundant and diverse in the mesopelagic zone. In addition to that, we found that their abundance, richness, and community structure also depend on geography, oxygen concentration, salinity, temperature, and other environmental variables reflecting the abundance of algae and nutrients. Both groups studied here demonstrated similar patterns, although some differences were also observed. Kinetoplastids and diplonemids prefer tropical regions and nutrient-rich conditions and avoid high oxygen concentration, high salinity, and high density of algae.


Subject(s)
Euglenozoa/isolation & purification , Oceans and Seas , Plankton/isolation & purification , Seawater/microbiology , Biodiversity , Euglenozoa/classification , Euglenozoa/genetics , Geography , Plankton/classification , Plankton/genetics , RNA, Protozoan/genetics , RNA, Ribosomal, 18S/genetics , Seawater/chemistry , Species Specificity
13.
PLoS One ; 15(3): e0230537, 2020.
Article in English | MEDLINE | ID: mdl-32208452

ABSTRACT

During the blood feeding, sand fly females inject saliva containing immunomodulatory and anti-haemostatic molecules into their vertebrate hosts. The saliva composition is species-specific, likely due to an adaptation to particular haemostatic pathways of their preferred host. Research on sand fly saliva is limited to the representatives of two best-studied genera, Phlebotomus and Lutzomyia. Although the members of the genus Sergentomyia are highly abundant in many areas in the Old World, their role in human disease transmission remains uncertain. Most Sergentomyia spp. preferentially attack various species of reptiles, but feeding on warm-blooded vertebrates, including humans and domestic animals, has been repeatedly described, especially for Sergentomyia schwetzi, of which salivary gland transcriptome and proteome is analyzed in the current study. Illumina RNA sequencing and de novo assembly of the reads and their annotation revealed 17,293 sequences homologous to other arthropods' proteins. In the sialome, all proteins typical for sand fly saliva were identified-antigen 5-related, lufaxin, yellow-related, PpSP15-like, D7-related, ParSP25-like, and silk proteins, as well as less frequent salivary proteins included 71kDa-like, ParSP80-like, SP16-like, and ParSP17-like proteins. Salivary enzymes include apyrase, hyaluronidase, endonuclease, amylase, lipase A2, adenosine deaminase, pyrophosphatase, 5'nucleotidase, and ribonuclease. Proteomics analysis of salivary glands identified 631 proteins, 81 of which are likely secreted into the saliva. We also compared two S. schwetzi lineages derived from the same origin. These lineages were adapted for over 40 generations for blood feeding either on mice (S-M) or geckos (S-G), two vertebrate hosts with different haemostatic mechanisms. Altogether, 20 and 40 annotated salivary transcripts were up-regulated in the S-M and S-G lineage, respectively. Proteomic comparison revealed ten salivary proteins more abundant in the lineage S-M, whereas 66 salivary proteins were enriched in the lineage S-G. No difference between lineages was found for apyrase activity; contrarily the hyaluronidase activity was significantly higher in the lineage feeding on mice.


Subject(s)
Insect Proteins/genetics , Psychodidae/genetics , Salivary Glands/metabolism , Transcriptome , Animals , Apyrase/analysis , Apyrase/genetics , Apyrase/metabolism , Hyaluronoglucosaminidase/analysis , Hyaluronoglucosaminidase/genetics , Hyaluronoglucosaminidase/metabolism , Insect Proteins/analysis , Insect Proteins/metabolism , Lizards , Mice , Phylogeny , Psychodidae/metabolism , Receptors, Odorant/analysis , Receptors, Odorant/genetics , Receptors, Odorant/metabolism
14.
BMC Biol ; 18(1): 23, 2020 03 02.
Article in English | MEDLINE | ID: mdl-32122335

ABSTRACT

BACKGROUND: The Euglenozoa are a protist group with an especially rich history of evolutionary diversity. They include diplonemids, representing arguably the most species-rich clade of marine planktonic eukaryotes; trypanosomatids, which are notorious parasites of medical and veterinary importance; and free-living euglenids. These different lifestyles, and particularly the transition from free-living to parasitic, likely require different metabolic capabilities. We carried out a comparative genomic analysis across euglenozoan diversity to see how changing repertoires of enzymes and structural features correspond to major changes in lifestyles. RESULTS: We find a gradual loss of genes encoding enzymes in the evolution of kinetoplastids, rather than a sudden decrease in metabolic capabilities corresponding to the origin of parasitism, while diplonemids and euglenids maintain more metabolic versatility. Distinctive characteristics of molecular machines such as kinetochores and the pre-replication complex that were previously considered specific to parasitic kinetoplastids were also identified in their free-living relatives. Therefore, we argue that they represent an ancestral rather than a derived state, as thought until the present. We also found evidence of ancient redundancy in systems such as NADPH-dependent thiol-redox. Only the genus Euglena possesses the combination of trypanothione-, glutathione-, and thioredoxin-based systems supposedly present in the euglenozoan common ancestor, while other representatives of the phylum have lost one or two of these systems. Lastly, we identified convergent losses of specific metabolic capabilities between free-living kinetoplastids and ciliates. Although this observation requires further examination, it suggests that certain eukaryotic lineages are predisposed to such convergent losses of key enzymes or whole pathways. CONCLUSIONS: The loss of metabolic capabilities might not be associated with the switch to parasitic lifestyle in kinetoplastids, and the presence of a highly divergent (or unconventional) kinetochore machinery might not be restricted to this protist group. The data derived from the transcriptomes of free-living early branching prokinetoplastids suggests that the pre-replication complex of Trypanosomatidae is a highly divergent version of the conventional machinery. Our findings shed light on trends in the evolution of metabolism in protists in general and open multiple avenues for future research.


Subject(s)
Biological Evolution , Euglenozoa/genetics , Genome, Protozoan , Euglenida/genetics , Euglenida/metabolism , Euglenozoa/metabolism , Evolution, Molecular , Kinetoplastida/genetics , Kinetoplastida/metabolism
15.
Nature ; 570(7760): 236-240, 2019 06.
Article in English | MEDLINE | ID: mdl-31168094

ABSTRACT

Much of the American Arctic was first settled 5,000 years ago, by groups of people known as Palaeo-Eskimos. They were subsequently joined and largely displaced around 1,000 years ago by ancestors of the present-day Inuit and Yup'ik1-3. The genetic relationship between Palaeo-Eskimos and Native American, Inuit, Yup'ik and Aleut populations remains uncertain4-6. Here we present genomic data for 48 ancient individuals from Chukotka, East Siberia, the Aleutian Islands, Alaska, and the Canadian Arctic. We co-analyse these data with data from present-day Alaskan Iñupiat and West Siberian populations and published genomes. Using methods based on rare-allele and haplotype sharing, as well as established techniques4,7-9, we show that Palaeo-Eskimo-related ancestry is ubiquitous among people who speak Na-Dene and Eskimo-Aleut languages. We develop a comprehensive model for the Holocene peopling events of Chukotka and North America, and show that Na-Dene-speaking peoples, people of the Aleutian Islands, and Yup'ik and Inuit across the Arctic region all share ancestry from a single Palaeo-Eskimo-related Siberian source.


Subject(s)
Human Migration/history , Inuit/classification , Inuit/genetics , Phylogeny , Phylogeography , Africa , Alaska , Alleles , Arctic Regions , Asia, Southeastern , Canada , Europe , Genome, Human/genetics , Haplotypes , History, Ancient , Humans , Principal Component Analysis , Siberia/ethnology
16.
Nat Ecol Evol ; 3(6): 966-976, 2019 06.
Article in English | MEDLINE | ID: mdl-31036896

ABSTRACT

The indigenous populations of inner Eurasia-a huge geographic region covering the central Eurasian steppe and the northern Eurasian taiga and tundra-harbour tremendous diversity in their genes, cultures and languages. In this study, we report novel genome-wide data for 763 individuals from Armenia, Georgia, Kazakhstan, Moldova, Mongolia, Russia, Tajikistan, Ukraine and Uzbekistan. We furthermore report additional damage-reduced genome-wide data of two previously published individuals from the Eneolithic Botai culture in Kazakhstan (~5,400 BP). We find that present-day inner Eurasian populations are structured into three distinct admixture clines stretching between various western and eastern Eurasian ancestries, mirroring geography. The Botai and more recent ancient genomes from Siberia show a decrease in contributions from so-called 'ancient North Eurasian' ancestry over time, which is detectable only in the northern-most 'forest-tundra' cline. The intermediate 'steppe-forest' cline descends from the Late Bronze Age steppe ancestries, while the 'southern steppe' cline further to the south shows a strong West/South Asian influence. Ancient genomes suggest a northward spread of the southern steppe cline in Central Asia during the first millennium BC. Finally, the genetic structure of Caucasus populations highlights a role of the Caucasus Mountains as a barrier to gene flow and suggests a post-Neolithic gene flow into North Caucasus populations from the steppe.


Subject(s)
Asian People , Gene Flow , Geography , Humans , Russia
17.
Science ; 361(6397): 92-95, 2018 07 06.
Article in English | MEDLINE | ID: mdl-29773666

ABSTRACT

Southeast Asia is home to rich human genetic and linguistic diversity, but the details of past population movements in the region are not well known. Here, we report genome-wide ancient DNA data from 18 Southeast Asian individuals spanning from the Neolithic period through the Iron Age (4100 to 1700 years ago). Early farmers from Man Bac in Vietnam exhibit a mixture of East Asian (southern Chinese agriculturalist) and deeply diverged eastern Eurasian (hunter-gatherer) ancestry characteristic of Austroasiatic speakers, with similar ancestry as far south as Indonesia providing evidence for an expansive initial spread of Austroasiatic languages. By the Bronze Age, in a parallel pattern to Europe, sites in Vietnam and Myanmar show close connections to present-day majority groups, reflecting substantial additional influxes of migrants.


Subject(s)
Genome, Human , Human Migration/history , Language/history , Agriculture/history , Asia, Southeastern , Asian People/genetics , DNA, Ancient , Genetic Variation , History, Ancient , Humans , Radiometric Dating
18.
Sci Rep ; 8(1): 5239, 2018 03 27.
Article in English | MEDLINE | ID: mdl-29588502

ABSTRACT

Rheb is a conserved and widespread Ras-like GTPase involved in cell growth regulation mediated by the (m)TORC1 kinase complex and implicated in tumourigenesis in humans. Rheb function depends on its association with membranes via prenylated C-terminus, a mechanism shared with many other eukaryotic GTPases. Strikingly, our analysis of a phylogenetically rich sample of Rheb sequences revealed that in multiple lineages this canonical and ancestral membrane attachment mode has been variously altered. The modifications include: (1) accretion to the N-terminus of two different phosphatidylinositol 3-phosphate-binding domains, PX in Cryptista (the fusion being the first proposed synapomorphy of this clade), and FYVE in Euglenozoa and the related undescribed flagellate SRT308; (2) acquisition of lipidic modifications of the N-terminal region, namely myristoylation and/or S-palmitoylation in seven different protist lineages; (3) acquisition of S-palmitoylation in the hypervariable C-terminal region of Rheb in apusomonads, convergently to some other Ras family proteins; (4) replacement of the C-terminal prenylation motif with four transmembrane segments in a novel Rheb paralog in the SAR clade; (5) loss of an evident C-terminal membrane attachment mechanism in Tremellomycetes and some Rheb paralogs of Euglenozoa. Rheb evolution is thus surprisingly dynamic and presents a spectacular example of molecular tinkering.


Subject(s)
Cell Membrane/metabolism , Phylogeny , Ras Homolog Enriched in Brain Protein/genetics , Ras Homolog Enriched in Brain Protein/metabolism , Animals , Carcinogenesis/genetics , Carcinogenesis/metabolism , Euglenozoa/genetics , Euglenozoa/metabolism , Euglenozoa Infections/parasitology , Evolution, Molecular , Humans , Ras Homolog Enriched in Brain Protein/chemistry
19.
Sci Rep ; 8(1): 1536, 2018 01 24.
Article in English | MEDLINE | ID: mdl-29367746

ABSTRACT

The Maniq and Mlabri are the only recorded nomadic hunter-gatherer groups in Thailand. Here, we sequenced complete mitochondrial (mt) DNA genomes and ~2.364 Mbp of non-recombining Y chromosome (NRY) to learn more about the origins of these two enigmatic populations. Both groups exhibited low genetic diversity compared to other Thai populations, and contrasting patterns of mtDNA and NRY diversity: there was greater mtDNA diversity in the Maniq than in the Mlabri, while the converse was true for the NRY. We found basal uniparental lineages in the Maniq, namely mtDNA haplogroups M21a, R21 and M17a, and NRY haplogroup K. Overall, the Maniq are genetically similar to other negrito groups in Southeast Asia. By contrast, the Mlabri haplogroups (B5a1b1 for mtDNA and O1b1a1a1b and O1b1a1a1b1a1 for the NRY) are common lineages in Southeast Asian non-negrito groups, and overall the Mlabri are genetically similar to their linguistic relatives (Htin and Khmu) and other groups from northeastern Thailand. In agreement with previous studies of the Mlabri, our results indicate that the Malbri do not directly descend from the indigenous negritos. Instead, they likely have a recent origin (within the past 1,000 years) by an extreme founder event (involving just one maternal and two paternal lineages) from an agricultural group, most likely the Htin or a closely-related group.


Subject(s)
Asian People/genetics , Chromosomes, Human, Y , DNA, Mitochondrial/genetics , Genetic Variation , Transients and Migrants , DNA, Mitochondrial/chemistry , Haplotypes , Humans , Sequence Analysis, DNA , Thailand
20.
Mol Genet Genomics ; 293(1): 107-117, 2018 Feb.
Article in English | MEDLINE | ID: mdl-28884289

ABSTRACT

The human Y-chromosome has proven to be a powerful tool for tracing the paternal history of human populations and genealogical ancestors. The human Y-chromosome haplogroup Q is the most frequent haplogroup in the Americas. Previous studies have traced the origin of haplogroup Q to the region around Central Asia and Southern Siberia. Although the diversity of haplogroup Q in the Americas has been studied in detail, investigations on the diffusion of haplogroup Q in Eurasia and Africa are still limited. In this study, we collected 39 samples from China and Russia, investigated 432 samples from previous studies of haplogroup Q, and analyzed the single nucleotide polymorphism (SNP) subclades Q1a1a1-M120, Q1a2a1-L54, Q1a1b-M25, Q1a2-M346, Q1a2a1a2-L804, Q1a2b2-F1161, Q1b1a-M378, and Q1b1a1-L245. Through NETWORK and BATWING analyses, we found that the subclades of haplogroup Q continued to disperse from Central Asia and Southern Siberia during the past 10,000 years. Apart from its migration through the Beringia to the Americas, haplogroup Q also moved from Asia to the south and to the west during the Neolithic period, and subsequently to the whole of Eurasia and part of Africa.


Subject(s)
Chromosomes, Human, Y/genetics , Genetics, Population , Haplotypes/genetics , Human Migration , Asia , China , Humans , Microsatellite Repeats/genetics , Phylogeny , Polymorphism, Single Nucleotide , Siberia
SELECTION OF CITATIONS
SEARCH DETAIL