ABSTRACT
f-statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. Not only are they guaranteed to allow robust tests of the fits of proposed models of population history to data when analyzing full genome sequencing data-that is, all single nucleotide polymorphisms (SNPs) in the individuals being analyzed-but they are also guaranteed to allow robust tests of models for SNPs ascertained as polymorphic in a population that is an outgroup in a phylogenetic sense to all groups being analyzed. True "outgroup ascertainment" is in practice impossible in humans because our species has arisen from a substructured ancestral population that does not descend from a homogeneous ancestral population going back many hundreds of thousands of years into the past. However, initial studies suggested that non-outgroup-ascertainment schemes might produce robust enough results using f-statistics, and that motivated widespread fitting of models to data using non-outgroup-ascertained SNP panels such as the "Affymetrix Human Origins array" which has been genotyped on thousands of modern individuals from hundreds of populations, or the "1240k" in-solution enrichment reagent which has been the source of about 70% of published genome-wide data for ancient humans. In this study, we show that while analyses of population history using such panels work well for studies of relationships among non-African populations and one African outgroup, when co-modeling more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans), fitting of f-statistics to such SNP sets is expected to frequently lead to false rejection of true demographic histories, and failure to reject incorrect models. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, has limited statistical power and retains important biases. However, by carrying out simulations of diverse demographic histories, we show that bias in inferences based on f-statistics can be minimized by ascertaining on variants common in a union of diverse African groups; such ascertainment retains high statistical power while allowing co-analysis of archaic and modern groups.
Subject(s)
African People , Demography , Phylogeny , Polymorphism, Single Nucleotide , Animals , Humans , Black People/genetics , Chromosome Mapping , Genotype , Neanderthals/genetics , Polymorphism, Single Nucleotide/genetics , African People/genetics , Demography/history , Biological Variation, Population/genetics , Models, Statistical , BiasABSTRACT
Much of the American Arctic was first settled 5,000 years ago, by groups of people known as Palaeo-Eskimos. They were subsequently joined and largely displaced around 1,000 years ago by ancestors of the present-day Inuit and Yup'ik1-3. The genetic relationship between Palaeo-Eskimos and Native American, Inuit, Yup'ik and Aleut populations remains uncertain4-6. Here we present genomic data for 48 ancient individuals from Chukotka, East Siberia, the Aleutian Islands, Alaska, and the Canadian Arctic. We co-analyse these data with data from present-day Alaskan Iñupiat and West Siberian populations and published genomes. Using methods based on rare-allele and haplotype sharing, as well as established techniques4,7-9, we show that Palaeo-Eskimo-related ancestry is ubiquitous among people who speak Na-Dene and Eskimo-Aleut languages. We develop a comprehensive model for the Holocene peopling events of Chukotka and North America, and show that Na-Dene-speaking peoples, people of the Aleutian Islands, and Yup'ik and Inuit across the Arctic region all share ancestry from a single Palaeo-Eskimo-related Siberian source.
Subject(s)
Human Migration/history , Inuit/classification , Inuit/genetics , Phylogeny , Phylogeography , Africa , Alaska , Alleles , Arctic Regions , Asia, Southeastern , Canada , Europe , Genome, Human/genetics , Haplotypes , History, Ancient , Humans , Principal Component Analysis , Siberia/ethnologyABSTRACT
The great ethnolinguistic diversity found today in mainland Southeast Asia (MSEA) reflects multiple migration waves of people in the past. Maritime trading between MSEA and India was established at the latest 300 BCE, and the formation of early states in Southeast Asia during the first millennium CE was strongly influenced by Indian culture, a cultural influence that is still prominent today. Several ancient Indian-influenced states were located in present-day Thailand, and various populations in the country are likely to be descendants of people from those states. To systematically explore Indian genetic heritage in MSEA populations, we generated genome-wide SNP data (using the Affymetrix Human Origins array) for 119 present-day individuals belonging to 10 ethnic groups from Thailand and co-analyzed them with published data using PCA, ADMIXTURE, and methods relying on f-statistics and on autosomal haplotypes. We found low levels of South Asian admixture in various MSEA populations for whom there is evidence of historical connections with the ancient Indian-influenced states but failed to find this genetic component in present-day hunter-gatherer groups and relatively isolated groups from the highlands of Northern Thailand. The results suggest that migration of Indian populations to MSEA may have been responsible for the spread of Indian culture in the region. Our results also support close genetic affinity between Kra-Dai-speaking (also known as Tai-Kadai) and Austronesian-speaking populations, which fits a linguistic hypothesis suggesting cladality of the two language families.
Subject(s)
Asian People/genetics , Ethnicity/genetics , Asia, Southeastern/ethnology , Genetic Variation/genetics , Genetics, Population/methods , Haplotypes/genetics , Humans , India/ethnology , Language , Polymorphism, Single Nucleotide/genetics , Thailand/ethnologyABSTRACT
BACKGROUND: The Euglenozoa are a protist group with an especially rich history of evolutionary diversity. They include diplonemids, representing arguably the most species-rich clade of marine planktonic eukaryotes; trypanosomatids, which are notorious parasites of medical and veterinary importance; and free-living euglenids. These different lifestyles, and particularly the transition from free-living to parasitic, likely require different metabolic capabilities. We carried out a comparative genomic analysis across euglenozoan diversity to see how changing repertoires of enzymes and structural features correspond to major changes in lifestyles. RESULTS: We find a gradual loss of genes encoding enzymes in the evolution of kinetoplastids, rather than a sudden decrease in metabolic capabilities corresponding to the origin of parasitism, while diplonemids and euglenids maintain more metabolic versatility. Distinctive characteristics of molecular machines such as kinetochores and the pre-replication complex that were previously considered specific to parasitic kinetoplastids were also identified in their free-living relatives. Therefore, we argue that they represent an ancestral rather than a derived state, as thought until the present. We also found evidence of ancient redundancy in systems such as NADPH-dependent thiol-redox. Only the genus Euglena possesses the combination of trypanothione-, glutathione-, and thioredoxin-based systems supposedly present in the euglenozoan common ancestor, while other representatives of the phylum have lost one or two of these systems. Lastly, we identified convergent losses of specific metabolic capabilities between free-living kinetoplastids and ciliates. Although this observation requires further examination, it suggests that certain eukaryotic lineages are predisposed to such convergent losses of key enzymes or whole pathways. CONCLUSIONS: The loss of metabolic capabilities might not be associated with the switch to parasitic lifestyle in kinetoplastids, and the presence of a highly divergent (or unconventional) kinetochore machinery might not be restricted to this protist group. The data derived from the transcriptomes of free-living early branching prokinetoplastids suggests that the pre-replication complex of Trypanosomatidae is a highly divergent version of the conventional machinery. Our findings shed light on trends in the evolution of metabolism in protists in general and open multiple avenues for future research.
Subject(s)
Biological Evolution , Euglenozoa/genetics , Genome, Protozoan , Euglenida/genetics , Euglenida/metabolism , Euglenozoa/metabolism , Evolution, Molecular , Kinetoplastida/genetics , Kinetoplastida/metabolismABSTRACT
We analysed a widely used barcode, the V9 region of the 18S rRNA gene, to study the effect of environmental conditions on the distribution of two related heterotrophic protistan lineages in marine plankton, kinetoplastids and diplonemids. We relied on a major published dataset (Tara Oceans) where samples from the mesopelagic zone were available from just 32 of 123 locations, and both groups are most abundant in this zone. To close sampling gaps and obtain more information from the deeper ocean, we collected 57 new samples targeting especially the mesopelagic zone. We sampled in three geographic regions: the Arctic, two depth transects in the Adriatic Sea, and the anoxic Cariaco Basin. In agreement with previous studies, both protist groups are most abundant and diverse in the mesopelagic zone. In addition to that, we found that their abundance, richness, and community structure also depend on geography, oxygen concentration, salinity, temperature, and other environmental variables reflecting the abundance of algae and nutrients. Both groups studied here demonstrated similar patterns, although some differences were also observed. Kinetoplastids and diplonemids prefer tropical regions and nutrient-rich conditions and avoid high oxygen concentration, high salinity, and high density of algae.
Subject(s)
Euglenozoa/isolation & purification , Oceans and Seas , Plankton/isolation & purification , Seawater/microbiology , Biodiversity , Euglenozoa/classification , Euglenozoa/genetics , Geography , Plankton/classification , Plankton/genetics , RNA, Protozoan/genetics , RNA, Ribosomal, 18S/genetics , Seawater/chemistry , Species SpecificityABSTRACT
RNA editing by targeted insertion and deletion of uridine is crucial to generate translatable mRNAs from the cryptogenes of the mitochondrial genome of kinetoplastids. This type of editing consists of a stepwise cascade of reactions generally proceeding from 3' to 5' on a transcript, resulting in a population of partially edited as well as pre-edited and completely edited molecules for each mitochondrial cryptogene of these protozoans. Often, the number of uridines inserted and deleted exceed the number of nucleotides that are genome-encoded. Thus, analysis of kinetoplastid mitochondrial transcriptomes has proven frustratingly complex. Here we present our analysis of Leptomonas pyrrhocoris mitochondrial cDNA deep sequencing reads using T-Aligner, our new tool which allows comprehensive characterization of RNA editing, not relying on targeted transcript amplification and on prior knowledge of final edited products. T-Aligner implements a pipeline of read mapping, visualization of all editing states and their coverage, and assembly of canonical and alternative translatable mRNAs. We also assess T-Aligner functionality on a more challenging deep sequencing read input from Trypanosoma cruzi. The analysis reveals that transcripts of cryptogenes of both species undergo very complex editing that includes the formation of alternative open reading frames and whole categories of truncated editing products.
Subject(s)
Mitochondria/genetics , RNA Editing , RNA, Mitochondrial/genetics , Trypanosomatina/genetics , Computational Biology/methods , Gene Expression Profiling/methods , Genome, Mitochondrial/genetics , Genome, Protozoan/genetics , Mitochondria/metabolism , RNA Isoforms/genetics , RNA Isoforms/metabolism , RNA Splicing , RNA, Mitochondrial/metabolism , RNA, Protozoan/genetics , RNA, Protozoan/metabolism , Trypanosoma brucei brucei/genetics , Trypanosoma brucei brucei/metabolism , Trypanosomatina/metabolismABSTRACT
Paratrypanosoma confusum is a monoxenous kinetoplastid flagellate that constitutes the most basal branch of the highly diverse parasitic trypanosomatids, which include human pathogens Trypanosoma and Leishmania This makes Paratrypanosoma uniquely informative for the evolution of obligatory parasitism from free-living lifestyle and the evolution of human parasitism in some trypanosomatid lineages. It has typical promastigote morphology but also forms surface-attached haptomonads and amastigotes. Haptomonads form by attachment to a surface via a large bulge at the base of the flagellum, which is then remodeled into a thin attachment pad associated with flagellum shortening. Promastigotes and haptomonads multiply by binary division, and the progeny of a haptomonad can either remain attached or grow a flagellum and resume swimming. Whole genome sequencing and transcriptome profiling, in combination with analysis of the cell ultrastructure, reveal how the cell surface and metabolism are adapted to parasitism and how characteristic cytoskeletal features are conserved. Our data demonstrate that surface attachment by the flagellum and the flagellar pocket, a Leishmania-like flagellum attachment zone, and a Trypanosoma cruzi-like cytostome are ancestral features, while evolution of extant trypanosomatids, including the human parasites, is associated with genome streamlining and diversification of membrane proteins.
Subject(s)
Flagella/genetics , Life Cycle Stages/genetics , Trypanosoma cruzi/genetics , Cytoskeleton/genetics , Gene Expression Profiling/methods , Genome, Protozoan/genetics , Humans , Leishmania/genetics , Phylogeny , Protozoan Proteins/geneticsABSTRACT
Kinetoplastid flagellates comprise basal mostly free-living bodonids and derived obligatory parasitic trypanosomatids, which belong to the best-studied protists. Due to their omnipresence in aquatic environments and soil, the bodonids are of ecological significance. Here, we present the first global survey of marine kinetoplastids and compare it with the strikingly different patterns of abundance and diversity in their sister clade, the diplonemids. Based on analysis of 18S rDNA V9 ribotypes obtained from 124 sites sampled during the Tara Oceans expedition, our results show generally low to moderate abundance and diversity of planktonic kinetoplastids. Although we have identified all major kinetoplastid lineages, 98% of kinetoplastid reads are represented by neobodonids, namely specimens of the Neobodo and Rhynchomonas genera, which make up 59% and 18% of all reads, respectively. Most kinetoplastids have small cell size (0.8-5 µm) and tend to be more abundant in the mesopelagic as compared to the euphotic zone. Some of the most abundant operational taxonomic units have distinct geographical distributions, and three novel putatively parasitic neobodonids were identified, along with their potential hosts.
Subject(s)
Kinetoplastida/classification , Kinetoplastida/genetics , Plankton/genetics , Biodiversity , DNA, Ribosomal/genetics , Oceans and Seas , Phylogeny , RNA, Ribosomal, 18S/geneticsABSTRACT
The human Y-chromosome has proven to be a powerful tool for tracing the paternal history of human populations and genealogical ancestors. The human Y-chromosome haplogroup Q is the most frequent haplogroup in the Americas. Previous studies have traced the origin of haplogroup Q to the region around Central Asia and Southern Siberia. Although the diversity of haplogroup Q in the Americas has been studied in detail, investigations on the diffusion of haplogroup Q in Eurasia and Africa are still limited. In this study, we collected 39 samples from China and Russia, investigated 432 samples from previous studies of haplogroup Q, and analyzed the single nucleotide polymorphism (SNP) subclades Q1a1a1-M120, Q1a2a1-L54, Q1a1b-M25, Q1a2-M346, Q1a2a1a2-L804, Q1a2b2-F1161, Q1b1a-M378, and Q1b1a1-L245. Through NETWORK and BATWING analyses, we found that the subclades of haplogroup Q continued to disperse from Central Asia and Southern Siberia during the past 10,000 years. Apart from its migration through the Beringia to the Americas, haplogroup Q also moved from Asia to the south and to the west during the Neolithic period, and subsequently to the whole of Eurasia and part of Africa.
Subject(s)
Chromosomes, Human, Y/genetics , Genetics, Population , Haplotypes/genetics , Human Migration , Asia , China , Humans , Microsatellite Repeats/genetics , Phylogeny , Polymorphism, Single Nucleotide , SiberiaABSTRACT
The Neolithic transition in west Eurasia occurred in two main steps: the gradual development of sedentism and plant cultivation in the Near East and the subsequent spread of Neolithic cultures into the Aegean and across Europe after 7000 cal BCE. Here, we use published ancient genomes to investigate gene flow events in west Eurasia during the Neolithic transition. We confirm that the Early Neolithic central Anatolians in the ninth millennium BCE were probably descendants of local hunter-gatherers, rather than immigrants from the Levant or Iran. We further study the emergence of post-7000 cal BCE north Aegean Neolithic communities. Although Aegean farmers have frequently been assumed to be colonists originating from either central Anatolia or from the Levant, our findings raise alternative possibilities: north Aegean Neolithic populations may have been the product of multiple westward migrations, including south Anatolian emigrants, or they may have been descendants of local Aegean Mesolithic groups who adopted farming. These scenarios are consistent with the diversity of material cultures among Aegean Neolithic communities and the inheritance of local forager know-how. The demographic and cultural dynamics behind the earliest spread of Neolithic culture in the Aegean could therefore be distinct from the subsequent Neolithization of mainland Europe.
Subject(s)
Agriculture/history , Gene Flow , Genome, Human , Human Migration/history , Archaeology , Farmers/history , Genomics , Greece , History, Ancient , Humans , TurkeyABSTRACT
The co-infection cases involving dixenous Leishmania spp. (mostly of the L. donovani complex) and presumably monoxenous trypanosomatids in immunocompromised mammalian hosts including humans are well documented. The main opportunistic parasite has been identified as Leptomonas seymouri of the sub-family Leishmaniinae. The molecular mechanisms allowing a parasite of insects to withstand elevated temperature and substantially different conditions of vertebrate tissues are not understood. Here we demonstrate that L. seymouri is well adapted for the environment of the warm-blooded host. We sequenced the genome and compared the whole transcriptome profiles of this species cultivated at low and high temperatures (mimicking the vector and the vertebrate host, respectively) and identified genes and pathways differentially expressed under these experimental conditions. Moreover, Leptomonas seymouri was found to persist for several days in two species of Phlebotomus spp. implicated in Leishmania donovani transmission. Despite of all these adaptations, L. seymouri remains a predominantly monoxenous species not capable of infecting vertebrate cells under normal conditions.
Subject(s)
Coinfection/microbiology , Euglenozoa Infections/genetics , Leishmaniasis, Visceral/parasitology , Trypanosomatina/genetics , Adaptation, Physiological/physiology , Animals , Disease Models, Animal , Gene Expression Profiling , Genes, Protozoan , Leishmania donovani , Life Cycle Stages , Polymerase Chain Reaction , Psychodidae/microbiology , Transcriptome , Trypanosomatina/growth & developmentABSTRACT
Members of the family Trypanosomatidae infect many organisms, including animals, plants and humans. Plant-infecting trypanosomes are grouped under the single genus Phytomonas, failing to reflect the wide biological and pathological diversity of these protists. While some Phytomonas spp. multiply in the latex of plants, or in fruit or seeds without apparent pathogenicity, others colonize the phloem sap and afflict plants of substantial economic value, including the coffee tree, coconut and oil palms. Plant trypanosomes have not been studied extensively at the genome level, a major gap in understanding and controlling pathogenesis. We describe the genome sequences of two plant trypanosomatids, one pathogenic isolate from a Guianan coconut and one non-symptomatic isolate from Euphorbia collected in France. Although these parasites have extremely distinct pathogenic impacts, very few genes are unique to either, with the vast majority of genes shared by both isolates. Significantly, both Phytomonas spp. genomes consist essentially of single copy genes for the bulk of their metabolic enzymes, whereas other trypanosomatids e.g. Leishmania and Trypanosoma possess multiple paralogous genes or families. Indeed, comparison with other trypanosomatid genomes revealed a highly streamlined genome, encoding for a minimized metabolic system while conserving the major pathways, and with retention of a full complement of endomembrane organelles, but with no evidence for functional complexity. Identification of the metabolic genes of Phytomonas provides opportunities for establishing in vitro culturing of these fastidious parasites and new tools for the control of agricultural plant disease.
Subject(s)
Kinetoplastida/genetics , Plant Diseases/genetics , Sequence Analysis, DNA , Trypanosomatina/genetics , Animals , Cocos/genetics , Cocos/parasitology , Coffee/genetics , Coffee/parasitology , France , Genome , Humans , Kinetoplastida/pathogenicity , Plant Diseases/parasitology , Seeds/parasitology , Trypanosomatina/pathogenicityABSTRACT
Four respiratory complexes and ATP-synthase represent central functional units in mitochondria. In some mitochondria and derived anaerobic organelles, a few or all of these respiratory complexes have been lost during evolution. We show that the respiratory chain of Chromera velia, a phototrophic relative of parasitic apicomplexans, lacks complexes I and III, making it a uniquely reduced aerobic mitochondrion. In Chromera, putative lactate:cytochrome c oxidoreductases are predicted to transfer electrons from lactate to cytochrome c, rendering complex III unnecessary. The mitochondrial genome of Chromera has the smallest known protein-coding capacity of all mitochondria, encoding just cox1 and cox3 on heterogeneous linear molecules. In contrast, another photosynthetic relative of apicomplexans, Vitrella brassicaformis, retains the same set of genes as apicomplexans and dinoflagellates (cox1, cox3, and cob).
Subject(s)
Evolution, Molecular , Genetic Variation , Mitochondria/genetics , Phylogeny , Alveolata/genetics , Alveolata/metabolism , Animals , Apicomplexa/genetics , Cytochromes c/metabolism , Electron Transport , Electron Transport Complex I/genetics , Electron Transport Complex I/metabolism , Genome, Mitochondrial , Lactic Acid/metabolism , Mitochondria/metabolism , Mitochondrial Proton-Translocating ATPases/genetics , Mitochondrial Proton-Translocating ATPases/metabolism , Parasites/genetics , Parasites/metabolism , Photosynthesis/geneticsABSTRACT
Comparison of the genomes of free-living Bodo saltans and those of parasitic trypanosomatids reveals that the transition from a free-living to a parasitic life style has resulted in the loss of approximately 50% of protein-coding genes. Despite this dramatic reduction in genome size, B. saltans and trypanosomatids still share a significant number of common metabolic traits: glycosomes; a unique set of the pyrimidine biosynthetic pathway genes; an ATP-PFK which is homologous to the bacterial PPi -PFKs rather than to the canonical eukaryotic ATP-PFKs; an alternative oxidase; three phosphoglycerate kinases and two GAPDH isoenzymes; a pyruvate kinase regulated by fructose-2,6-bisphosphate; trypanothione as a substitute for glutathione; synthesis of fatty acids via a unique set of elongase enzymes; and a mitochondrial acetate:succinate coenzyme A transferase. B. saltans has lost the capacity to synthesize ubiquinone. Among genes that are present in B. saltans and lost in all trypanosomatids are those involved in the degradation of mureine, tryptophan and lysine. Novel acquisitions of trypanosomatids are components of pentose sugar metabolism, pteridine reductase and bromodomain-factor proteins. In addition, only the subfamily Leishmaniinae has acquired a gene for catalase and the capacity to convert diaminopimelic acid to lysine.
Subject(s)
Kinetoplastida/genetics , Kinetoplastida/metabolism , Trypanosomatina/genetics , Trypanosomatina/metabolism , Amino Acids/metabolism , Bacteria/genetics , Bacteria/metabolism , Carbohydrate Metabolism , Coenzymes/metabolism , Dolichols/metabolism , Ergosterol/biosynthesis , Eukaryota/genetics , Eukaryota/metabolism , Folic Acid/metabolism , Genes, Protozoan/genetics , Gluconeogenesis , Glycolysis , Kinetoplastida/enzymology , Lipid Metabolism , Mevalonic Acid/metabolism , Microbodies/metabolism , Mitochondria/enzymology , Mitochondria/metabolism , Oxidoreductases/metabolism , Pentose Phosphate Pathway , Peroxisomes/metabolism , Phospholipids/metabolism , Polyamines/metabolism , Protein Prenylation , Protozoan Proteins/genetics , Purines/biosynthesis , Purines/metabolism , Pyrimidines/biosynthesis , Pyrimidines/metabolism , Reactive Oxygen Species , Trypanosomatina/enzymology , Ubiquinone/metabolism , Urea/metabolism , Vitamins/metabolismABSTRACT
The genetic structure of human populations is extraordinarily complex and of fundamental importance to studies of anthropology, evolution, and medicine. As increasingly many individuals are of mixed origin, there is an unmet need for tools that can infer multiple origins. Misclassification of such individuals can lead to incorrect and costly misinterpretations of genomic data, primarily in disease studies and drug trials. We present an advanced tool to infer ancestry that can identify the biogeographic origins of highly mixed individuals. reAdmix can incorporate individual's knowledge of ancestors (e.g. having some ancestors from Turkey or a Scottish grandmother). reAdmix is an online tool available at http://chcb.saban-chla.usc.edu/reAdmix/.
Subject(s)
Biological Evolution , Computational Biology , Ethnicity/genetics , Genetics, Medical/methods , Animals , Humans , SoftwareABSTRACT
Heme is an iron-coordinated porphyrin that is universally essential as a protein cofactor for fundamental cellular processes, such as electron transport in the respiratory chain, oxidative stress response, or redox reactions in various metabolic pathways. Parasitic kinetoplastid flagellates represent a rare example of organisms that depend on oxidative metabolism but are heme auxotrophs. Here, we show that heme is fully dispensable for the survival of Phytomonas serpens, a plant parasite. Seeking to understand the metabolism of this heme-free eukaryote, we searched for heme-containing proteins in its de novo sequenced genome and examined several cellular processes for which heme has so far been considered indispensable. We found that P. serpens lacks most of the known hemoproteins and does not require heme for electron transport in the respiratory chain, protection against oxidative stress, or desaturation of fatty acids. Although heme is still required for the synthesis of ergosterol, its precursor, lanosterol, is instead incorporated into the membranes of P. serpens grown in the absence of heme. In conclusion, P. serpens is a flagellate with unique metabolic adaptations that allow it to bypass all requirements for heme.
Subject(s)
Heme/chemistry , Kinetoplastida/metabolism , Trypanosomatina/metabolism , Crithidia fasciculata/metabolism , Electron Transport , Ergosterol/chemistry , Fatty Acids/chemistry , Lanosterol/chemistry , Models, Biological , Oxidation-Reduction , Oxidative Stress , Oxygen/chemistry , Phylogeny , Porphyrins/chemistry , Sterols/chemistryABSTRACT
The canonical photosynthetic plastid genomes consist of a single circular-mapping chromosome that encodes a highly conserved protein core, involved in photosynthesis and ATP generation. Here, we demonstrate that the plastid genome of the photosynthetic relative of apicomplexans, Chromera velia, departs from this view in several unique ways. Core photosynthesis proteins PsaA and AtpB have been broken into two fragments, which we show are independently transcribed, oligoU-tailed, translated, and assembled into functional photosystem I and ATP synthase complexes. Genome-wide transcription profiles support expression of many other highly modified proteins, including several that contain extensions amounting to hundreds of amino acids in length. Canonical gene clusters and operons have been fragmented and reshuffled into novel putative transcriptional units. Massive genomic coverage by paired-end reads, coupled with pulsed-field gel electrophoresis and polymerase chain reaction, consistently indicate that the C. velia plastid genome is linear-mapping, a unique state among all plastids. Abundant intragenomic duplication probably mediated by recombination can explain protein splits, extensions, and genome linearization and is perhaps the key driving force behind the many features that defy the conventional ways of plastid genome architecture and function.
Subject(s)
ATP Synthetase Complexes/genetics , Alveolata/genetics , Genome, Protozoan , Photosystem I Protein Complex/genetics , Protozoan Proteins/genetics , ATP Synthetase Complexes/chemistry , ATP Synthetase Complexes/metabolism , Adenosine Triphosphate/metabolism , Base Sequence , Chromosome Mapping , Evolution, Molecular , Gene Expression Profiling , Models, Molecular , Molecular Sequence Data , Multigene Family , Photosynthesis/genetics , Photosystem I Protein Complex/chemistry , Photosystem I Protein Complex/metabolism , Protein Structure, Tertiary , Protozoan Proteins/chemistry , Protozoan Proteins/metabolismABSTRACT
Our knowledge of human evolutionary history has been greatly advanced by paleogenomics. Since the 2020s, the study of ancient DNA has increasingly focused on reconstructing the recent past. However, the accuracy of paleogenomic methods in resolving questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation remains an open question. We evaluated the performance and behavior of two commonly used methods, qpAdm and the f3-statistic, on admixture inference under a diversity of demographic models and data conditions. We performed two complementary simulation approaches-firstly exploring a wide demographic parameter space under four simple demographic models of varying complexities and configurations using branch-length data from two chromosomes-and secondly, we analyzed a model of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudohaploidization. We observe that population differentiation is the primary factor driving qpAdm performance. Notably, while complex gene flow histories influence which models are classified as plausible, they do not reduce overall performance. Under conditions reflective of the historical period, qpAdm most frequently identifies the true model as plausible among a small candidate set of closely related populations. To increase the utility for resolving fine-scaled hypotheses, we provide a heuristic for further distinguishing between candidate models that incorporates qpAdm model P-values and f3-statistics. Finally, we demonstrate a significant performance increase for qpAdm using whole-genome branch-length f2-statistics, highlighting the potential for improved demographic inference that could be achieved with future advancements in f-statistic estimations.
Subject(s)
DNA, Ancient , Models, Genetic , DNA, Ancient/analysis , Humans , Genetics, Population/methods , Gene Flow , Polymorphism, Single Nucleotide , Genome, Human , Evolution, MolecularABSTRACT
During the Hungarian Conquest in the 10th century CE, the early medieval Magyars, a group of mounted warriors from Eastern Europe, settled in the Carpathian Basin. They likely introduced the Hungarian language to this new settlement area, during an event documented by both written sources and archaeological evidence. Previous archaeogenetic research identified the newcomers as migrants from the Eurasian steppe. However, genome-wide ancient DNA from putative source populations has not been available to test alternative theories of their precise source. We generated genome-wide ancient DNA data for 131 individuals from candidate archaeological contexts in the Circum-Uralic region in present-day Russia. Our results tightly link the Magyars to people of the Early Medieval Karayakupovo archaeological horizon on both the European and Asian sides of the southern Urals. Our analyes show that ancestors of the people of the Karayakupovo archaeological horizon were established in the Southern Urals by the Iron Age and that their descendants persisted locally in the Volga-Kama region until at least the 14th century.
ABSTRACT
The Yamnaya archaeological complex appeared around 3300BCE across the steppes north of the Black and Caspian Seas, and by 3000BCE reached its maximal extent from Hungary in the west to Kazakhstan in the east. To localize the ancestral and geographical origins of the Yamnaya among the diverse Eneolithic people that preceded them, we studied ancient DNA data from 428 individuals of which 299 are reported for the first time, demonstrating three previously unknown Eneolithic genetic clines. First, a "Caucasus-Lower Volga" (CLV) Cline suffused with Caucasus hunter-gatherer (CHG) ancestry extended between a Caucasus Neolithic southern end in Neolithic Armenia, and a steppe northern end in Berezhnovka in the Lower Volga. Bidirectional gene flow across the CLV cline created admixed intermediate populations in both the north Caucasus, such as the Maikop people, and on the steppe, such as those at the site of Remontnoye north of the Manych depression. CLV people also helped form two major riverine clines by admixing with distinct groups of European hunter-gatherers. A "Volga Cline" was formed as Lower Volga people mixed with upriver populations that had more Eastern hunter-gatherer (EHG) ancestry, creating genetically hyper-variable populations as at Khvalynsk in the Middle Volga. A "Dnipro Cline" was formed as CLV people bearing both Caucasus Neolithic and Lower Volga ancestry moved west and acquired Ukraine Neolithic hunter-gatherer (UNHG) ancestry to establish the population of the Serednii Stih culture from which the direct ancestors of the Yamnaya themselves were formed around 4000BCE. This population grew rapidly after 3750-3350BCE, precipitating the expansion of people of the Yamnaya culture who totally displaced previous groups on the Volga and further east, while admixing with more sedentary groups in the west. CLV cline people with Lower Volga ancestry contributed four fifths of the ancestry of the Yamnaya, but also, entering Anatolia from the east, contributed at least a tenth of the ancestry of Bronze Age Central Anatolians, where the Hittite language, related to the Indo-European languages spread by the Yamnaya, was spoken. We thus propose that the final unity of the speakers of the "Proto-Indo-Anatolian" ancestral language of both Anatolian and Indo-European languages can be traced to CLV cline people sometime between 4400-4000 BCE.