RESUMO
The bacteria Yersinia pestis is the etiological agent of plague and has caused human pandemics with millions of deaths in historic times. How and when it originated remains contentious. Here, we report the oldest direct evidence of Yersinia pestis identified by ancient DNA in human teeth from Asia and Europe dating from 2,800 to 5,000 years ago. By sequencing the genomes, we find that these ancient plague strains are basal to all known Yersinia pestis. We find the origins of the Yersinia pestis lineage to be at least two times older than previous estimates. We also identify a temporal sequence of genetic changes that lead to increased virulence and the emergence of the bubonic plague. Our results show that plague infection was endemic in the human populations of Eurasia at least 3,000 years before any historical recordings of pandemics.
Assuntos
Peste/microbiologia , Yersinia pestis/classificação , Yersinia pestis/isolamento & purificação , Animais , Ásia , DNA Bacteriano/genética , Europa (Continente) , História Antiga , História Medieval , Humanos , Peste/história , Peste/transmissão , Sifonápteros/microbiologia , Dente/microbiologia , Yersinia pestis/genéticaRESUMO
Understanding the dynamic evolution of Salmonella is vital for effective bacterial infection management. This study explores the role of the flexible genome, organised in regions of genomic plasticity (RGP), in shaping the pathogenicity of Salmonella lineages. Through comprehensive genomic analysis of 12,244 Salmonella spp. genomes covering 2 species, 6 subspecies, and 46 serovars, we uncover distinct integration patterns of pathogenicity-related gene clusters into RGP, challenging traditional views of gene distribution. These RGP exhibit distinct preferences for specific genomic spots, and the presence or absence of such spots across Salmonella lineages profoundly shapes strain pathogenicity. RGP preferences are guided by conserved flanking genes surrounding integration spots, implicating their involvement in regulatory networks and functional synergies with integrated gene clusters. Additionally, we emphasise the multifaceted contributions of plasmids and prophages to the pathogenicity of diverse Salmonella lineages. Overall, this study provides a comprehensive blueprint of the pathogenicity potential of Salmonella. This unique insight identifies genomic spots in nonpathogenic lineages that hold the potential for harbouring pathogenicity genes, providing a foundation for predicting future adaptations and developing targeted strategies against emerging human pathogenic strains.
Assuntos
Genoma Bacteriano , Salmonella , Salmonella/genética , Salmonella/patogenicidade , Genoma Bacteriano/genética , Virulência/genética , Humanos , Genômica/métodos , Família Multigênica , Filogenia , Plasmídeos/genética , Infecções por Salmonella/microbiologia , Prófagos/genética , Evolução MolecularRESUMO
Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1-4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species.
Assuntos
Aves/classificação , Aves/genética , Genoma/genética , Genômica/métodos , Genômica/normas , Filogenia , Animais , Galinhas/genética , Conservação dos Recursos Naturais , Conjuntos de Dados como Assunto , Tentilhões/genética , Humanos , Seleção Genética/genética , Sintenia/genéticaRESUMO
The realization that ancient biomolecules are preserved in "fossil" samples has revolutionized archaeological science. Protein sequences survive longer than DNA, but their phylogenetic resolution is inferior; therefore, careful assessment of the research questions is required. Here, we show the potential of ancient proteins preserved in Pleistocene eggshell in addressing a longstanding controversy in human and animal evolution: the identity of the extinct bird that laid large eggs which were exploited by Australia's indigenous people. The eggs had been originally attributed to the iconic extinct flightless bird Genyornis newtoni (Dromornithidae, Galloanseres) and were subsequently dated to before 50 ± 5 ka by Miller et al. [Nat. Commun. 7, 10496 (2016)]. This was taken to represent the likely extinction date for this endemic megafaunal species and thus implied a role of humans in its demise. A contrasting hypothesis, according to which the eggs were laid by a large mound-builder megapode (Megapodiidae, Galliformes), would therefore acquit humans of their responsibility in the extinction of Genyornis. Ancient protein sequences were reconstructed and used to assess the evolutionary proximity of the undetermined eggshell to extant birds, rejecting the megapode hypothesis. Authentic ancient DNA could not be confirmed from these highly degraded samples, but morphometric data also support the attribution of the eggshell to Genyornis. When used in triangulation to address well-defined hypotheses, paleoproteomics is a powerful tool for reconstructing the evolutionary history in ancient samples. In addition to the clarification of phylogenetic placement, these data provide a more nuanced understanding of the modes of interactions between humans and their environment.
Assuntos
Aves , Casca de Ovo , Animais , Humanos , Filogenia , Aves/genética , DNA/genética , Evolução Biológica , Fósseis , DNA AntigoRESUMO
The black rhinoceros (Diceros bicornis L.) is a critically endangered species historically distributed across sub-Saharan Africa. Hunting and habitat disturbance have diminished both its numbers and distribution since the 19th century, but a poaching crisis in the late 20th century drove them to the brink of extinction. Genetic and genomic assessments can greatly increase our knowledge of the species and inform management strategies. However, when a species has been severely reduced, with the extirpation and artificial admixture of several populations, it is extremely challenging to obtain an accurate understanding of historic population structure and evolutionary history from extant samples. Therefore, we generated and analyzed whole genomes from 63 black rhinoceros museum specimens collected between 1775 and 1981. Results showed that the black rhinoceros could be genetically structured into six major historic populations (Central Africa, East Africa, Northwestern Africa, Northeastern Africa, Ruvuma, and Southern Africa) within which were nested four further subpopulations (Maasailand, southwestern, eastern rift, and northern rift), largely mirroring geography, with a punctuated north-south cline. However, we detected varying degrees of admixture among groups and found that several geographical barriers, most prominently the Zambezi River, drove population discontinuities. Genomic diversity was high in the middle of the range and decayed toward the periphery. This comprehensive historic portrait also allowed us to ascertain the ancestry of 20 resequenced genomes from extant populations. Lastly, using insights gained from this unique temporal data set, we suggest management strategies, some of which require urgent implementation, for the conservation of the remaining black rhinoceros diversity.
Assuntos
Evolução Biológica , Perissodáctilos , Animais , África Oriental , África Subsaariana , Perissodáctilos/genética , Espécies em Perigo de ExtinçãoRESUMO
Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.
Assuntos
Variação Genética/genética , Genética Populacional/normas , Genoma Humano/genética , Genômica/normas , Análise de Sequência de DNA/normas , Adulto , Alelos , Criança , Cromossomos Humanos Y/genética , Dinamarca , Feminino , Haplótipos/genética , Humanos , Complexo Principal de Histocompatibilidade/genética , Masculino , Idade Materna , Taxa de Mutação , Idade Paterna , Mutação Puntual/genética , Padrões de ReferênciaRESUMO
Lions are one of the world's most iconic megafauna, yet little is known about their temporal and spatial demographic history and population differentiation. We analyzed a genomic dataset of 20 specimens: two ca. 30,000-y-old cave lions (Panthera leo spelaea), 12 historic lions (Panthera leo leo/Panthera leo melanochaita) that lived between the 15th and 20th centuries outside the current geographic distribution of lions, and 6 present-day lions from Africa and India. We found that cave and modern lions shared an ancestor ca. 500,000 y ago and that the 2 lineages likely did not hybridize following their divergence. Within modern lions, we found 2 main lineages that diverged ca. 70,000 y ago, with clear evidence of subsequent gene flow. Our data also reveal a nearly complete absence of genetic diversity within Indian lions, probably due to well-documented extremely low effective population sizes in the recent past. Our results contribute toward the understanding of the evolutionary history of lions and complement conservation efforts to protect the diversity of this vulnerable species.
Assuntos
Evolução Molecular , Leões/genética , Leões/fisiologia , África , Animais , Fluxo Gênico , Variação Genética , Genômica , Geografia , Índia , Leões/classificação , Masculino , Filogenia , Cromossomo XRESUMO
Salmonella infections across the globe are becoming more challenging to control due to the emergence of multidrug-resistant (MDR) strains. Lytic phages may be suitable alternatives for treating these multidrug-resistant Salmonella infections. Most Salmonella phages to date were collected from human-impacted environments. To further explore the Salmonella phage space, and to potentially identify phages with novel characteristics, we characterized Salmonella-specific phages isolated from the Penang National Park, a conserved rainforest. Four phages with a broad lytic spectrum (kills >5 Salmonella serovars) were further characterized; they have isometric heads and cone-shaped tails, and genomes of ~39,900 bp, encoding 49 CDSs. As the genomes share a <95% sequence similarity to known genomes, the phages were classified as a new species within the genus Kayfunavirus. Interestingly, the phages displayed obvious differences in their lytic spectrum and pH stability, despite having a high sequence similarity (~99% ANI). Subsequent analysis revealed that the phages differed in the nucleotide sequence in the tail spike proteins, tail tubular proteins, and portal proteins, suggesting that the SNPs were responsible for their differing phenotypes. Our findings highlight the diversity of novel Salmonella bacteriophages from rainforest regions, which can be explored as an antimicrobial agent against MDR-Salmonella strains.
Assuntos
Bacteriófagos , Infecções por Salmonella , Fagos de Salmonella , Humanos , Fagos de Salmonella/genética , Floresta Úmida , Salmonella/genética , Bacteriófagos/genética , Infecções por Salmonella/genética , Fenótipo , Genômica , Genoma ViralRESUMO
Large vertebrates are extremely sensitive to anthropogenic pressure, and their populations are declining fast. The white rhinoceros (Ceratotherium simum) is a paradigmatic case: this African megaherbivore has suffered a remarkable decline in the last 150 years due to human activities. Its subspecies, the northern (NWR) and the southern white rhinoceros (SWR), however, underwent opposite fates: the NWR vanished quickly, while the SWR recovered after the severe decline. Such demographic events are predicted to have an erosive effect at the genomic level, linked to the extirpation of diversity, and increased genetic drift and inbreeding. However, there is little empirical data available to directly reconstruct the subtleties of such processes in light of distinct demographic histories. Therefore, we generated a whole-genome, temporal data set consisting of 52 resequenced white rhinoceros genomes, representing both subspecies at two time windows: before and during/after the bottleneck. Our data reveal previously unknown population structure within both subspecies, as well as quantifiable genomic erosion. Genome-wide heterozygosity decreased significantly by 10% in the NWR and 36% in the SWR, and inbreeding coefficients rose significantly by 11% and 39%, respectively. Despite the remarkable loss of genomic diversity and recent inbreeding it suffered, the only surviving subspecies, the SWR, does not show a significant accumulation of genetic load compared to its historical counterpart. Our data provide empirical support for predictions about the genomic consequences of shrinking populations, and our findings have the potential to inform the conservation efforts of the remaining white rhinoceroses.
Assuntos
Efeitos Antropogênicos , Perissodáctilos , Animais , Genômica , Endogamia , Perissodáctilos/genéticaRESUMO
North America is currently home to a number of grey wolf (Canis lupus) and wolf-like canid populations, including the coyote (Canis latrans) and the taxonomically controversial red, Eastern timber and Great Lakes wolves. We explored their population structure and regional gene flow using a dataset of 40 full genome sequences that represent the extant diversity of North American wolves and wolf-like canid populations. This included 15 new genomes (13 North American grey wolves, 1 red wolf and 1 Eastern timber/Great Lakes wolf), ranging from 0.4 to 15x coverage. In addition to providing full genome support for the previously proposed coyote-wolf admixture origin for the taxonomically controversial red, Eastern timber and Great Lakes wolves, the discriminatory power offered by our dataset suggests all North American grey wolves, including the Mexican form, are monophyletic, and thus share a common ancestor to the exclusion of all other wolves. Furthermore, we identify three distinct populations in the high arctic, one being a previously unidentified "Polar wolf" population endemic to Ellesmere Island and Greenland. Genetic diversity analyses reveal particularly high inbreeding and low heterozygosity in these Polar wolves, consistent with long-term isolation from the other North American wolves.
Assuntos
Coiotes/genética , Genética Populacional , Genoma , Genômica , Lobos/genética , Animais , Genômica/métodos , Genótipo , América do Norte , FilogeniaRESUMO
The origins of the First Americans remain contentious. Although Native Americans seem to be genetically most closely related to east Asians, there is no consensus with regard to which specific Old World populations they are closest to. Here we sequence the draft genome of an approximately 24,000-year-old individual (MA-1), from Mal'ta in south-central Siberia, to an average depth of 1×. To our knowledge this is the oldest anatomically modern human genome reported to date. The MA-1 mitochondrial genome belongs to haplogroup U, which has also been found at high frequency among Upper Palaeolithic and Mesolithic European hunter-gatherers, and the Y chromosome of MA-1 is basal to modern-day western Eurasians and near the root of most Native American lineages. Similarly, we find autosomal evidence that MA-1 is basal to modern-day western Eurasians and genetically closely related to modern-day Native Americans, with no close affinity to east Asians. This suggests that populations related to contemporary western Eurasians had a more north-easterly distribution 24,000 years ago than commonly thought. Furthermore, we estimate that 14 to 38% of Native American ancestry may originate through gene flow from this ancient population. This is likely to have occurred after the divergence of Native American ancestors from east Asian ancestors, but before the diversification of Native American populations in the New World. Gene flow from the MA-1 lineage into Native American ancestors could explain why several crania from the First Americans have been reported as bearing morphological characteristics that do not resemble those of east Asians. Sequencing of another south-central Siberian, Afontova Gora-2 dating to approximately 17,000 years ago, revealed similar autosomal genetic signatures as MA-1, suggesting that the region was continuously occupied by humans throughout the Last Glacial Maximum. Our findings reveal that western Eurasian genetic signatures in modern-day Native Americans derive not only from post-Columbian admixture, as commonly thought, but also from a mixed ancestry of the First Americans.
Assuntos
Povo Asiático/genética , Genoma Humano/genética , Indígenas Norte-Americanos/etnologia , Indígenas Norte-Americanos/genética , Filogenia , População Branca/genética , Animais , Ásia/etnologia , Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Emigração e Imigração , Fluxo Gênico/genética , Genoma Mitocondrial/genética , Haplótipos/genética , Humanos , Indígenas Norte-Americanos/classificação , Masculino , Filogeografia , Sibéria/etnologia , EsqueletoRESUMO
Clovis, with its distinctive biface, blade and osseous technologies, is the oldest widespread archaeological complex defined in North America, dating from 11,100 to 10,700 (14)C years before present (bp) (13,000 to 12,600 calendar years bp). Nearly 50 years of archaeological research point to the Clovis complex as having developed south of the North American ice sheets from an ancestral technology. However, both the origins and the genetic legacy of the people who manufactured Clovis tools remain under debate. It is generally believed that these people ultimately derived from Asia and were directly related to contemporary Native Americans. An alternative, Solutrean, hypothesis posits that the Clovis predecessors emigrated from southwestern Europe during the Last Glacial Maximum. Here we report the genome sequence of a male infant (Anzick-1) recovered from the Anzick burial site in western Montana. The human bones date to 10,705 ± 35 (14)C years bp (approximately 12,707-12,556 calendar years bp) and were directly associated with Clovis tools. We sequenced the genome to an average depth of 14.4× and show that the gene flow from the Siberian Upper Palaeolithic Mal'ta population into Native American ancestors is also shared by the Anzick-1 individual and thus happened before 12,600 years bp. We also show that the Anzick-1 individual is more closely related to all indigenous American populations than to any other group. Our data are compatible with the hypothesis that Anzick-1 belonged to a population directly ancestral to many contemporary Native Americans. Finally, we find evidence of a deep divergence in Native American populations that predates the Anzick-1 individual.
Assuntos
Genoma Humano/genética , Indígenas Norte-Americanos/genética , Filogenia , Arqueologia , Ásia/etnologia , Osso e Ossos , Sepultamento , Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Emigração e Imigração/história , Europa (Continente)/etnologia , Fluxo Gênico/genética , Haplótipos/genética , História Antiga , Humanos , Lactente , Masculino , Modelos Genéticos , Dados de Sequência Molecular , Montana , Dinâmica Populacional , Datação RadiométricaRESUMO
The practices of preparing traditional foods in the Arctic are rapidly disappearing. Traditional foods of the Arctic represent a rarity among food studies in that they are meat-sourced and prepared in non-industrial settings. These foods, generally consumed without any heating step prior to consumption, harbor an insofar undescribed microbiome. The food-associated microbiomes have implications not only with respect to disease risk, but might also positively influence host health by transferring a yet unknown diversity of live microbes to the human gastrointestinal tract. Here we report the first study of the microbial composition of traditionally dried fish prepared according to Greenlandic traditions and their industrial counterparts. We show that dried capelin prepared according to traditional methods have microbiomes clearly different from industrially prepared capelin, which also have more homogenous microbiomes than traditionally prepared capelin. Interestingly, the locally preferred type of traditionally dried capelin, described to be tastier than other traditionally dried capelin, contains bacteria that potentially confer distinct taste. Finally, we show that dried cod have comparably more homogenous microbiomes when compared to capelin and that in general, the environment of drying is a major determinant of the microbial composition of these indigenous food products.
Assuntos
Dessecação , Peixes/microbiologia , Indústria Alimentícia/métodos , Alimentos em Conserva/microbiologia , Microbiota , Alimentos Marinhos/microbiologia , Animais , Bactérias/classificação , Groenlândia , Humanos , Inuíte , RNA Ribossômico 16S/genéticaRESUMO
BACKGROUND: Viruses and other infectious agents cause more than 15% of human cancer cases. High-throughput sequencing-based studies of virus-cancer associations have mainly focused on cancer transcriptome data. METHODS: In this study, we applied a diverse selection of presequencing enrichment methods targeting all major viral groups, to characterize the viruses present in 197 samples from 18 sample types of cancerous origin. Using high-throughput sequencing, we generated 710 datasets constituting 57 billion sequencing reads. RESULTS: Detailed in silico investigation of the viral content, including exclusion of viral artefacts, from de novo assembled contigs and individual sequencing reads yielded a map of the viruses detected. Our data reveal a virome dominated by papillomaviruses, anelloviruses, herpesviruses, and parvoviruses. More than half of the included samples contained 1 or more viruses; however, no link between specific viruses and cancer types were found. CONCLUSIONS: Our study sheds light on viral presence in cancers and provides highly relevant virome data for future reference.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Metagenoma/genética , Neoplasias/virologia , Anelloviridae/genética , Anelloviridae/isolamento & purificação , Biópsia , Conjuntos de Dados como Assunto , Feminino , Herpesviridae/genética , Herpesviridae/isolamento & purificação , Humanos , Masculino , Neoplasias/patologia , Papillomaviridae/genética , Papillomaviridae/isolamento & purificação , Parvovirus/genética , Parvovirus/isolamento & purificaçãoRESUMO
OBJECTIVE: To investigate whether a whole grain diet alters the gut microbiome and insulin sensitivity, as well as biomarkers of metabolic health and gut functionality. DESIGN: 60 Danish adults at risk of developing metabolic syndrome were included in a randomised cross-over trial with two 8-week dietary intervention periods comprising whole grain diet and refined grain diet, separated by a washout period of ≥6 weeks. The response to the interventions on the gut microbiome composition and insulin sensitivity as well on measures of glucose and lipid metabolism, gut functionality, inflammatory markers, anthropometry and urine metabolomics were assessed. RESULTS: 50 participants completed both periods with a whole grain intake of 179±50 g/day and 13±10 g/day in the whole grain and refined grain period, respectively. Compliance was confirmed by a difference in plasma alkylresorcinols (p<0.0001). Compared with refined grain, whole grain did not significantly alter glucose homeostasis and did not induce major changes in the faecal microbiome. Also, breath hydrogen levels, plasma short-chain fatty acids, intestinal integrity and intestinal transit time were not affected. The whole grain diet did, however, compared with the refined grain diet, decrease body weight (p<0.0001), serum inflammatory markers, interleukin (IL)-6 (p=0.009) and C-reactive protein (p=0.003). The reduction in body weight was consistent with a reduction in energy intake, and IL-6 reduction was associated with the amount of whole grain consumed, in particular with intake of rye. CONCLUSION: Compared with refined grain diet, whole grain diet did not alter insulin sensitivity and gut microbiome but reduced body weight and systemic low-grade inflammation. TRIAL REGISTRATION NUMBER: NCT01731366; Results.
Assuntos
Microbioma Gastrointestinal , Inflamação/sangue , Redução de Peso , Grãos Integrais , Adulto , Idoso , Glicemia/metabolismo , Estudos Cross-Over , Dinamarca , Dieta , Ingestão de Energia , Fezes/microbiologia , Feminino , Humanos , Inflamação/dietoterapia , Resistência à Insulina , Interleucina-6/sangue , Lipídeos/sangue , Masculino , Metabolômica , Pessoa de Meia-IdadeRESUMO
We are facing a global metabolic health crisis provoked by an obesity epidemic. Here we report the human gut microbial composition in a population sample of 123 non-obese and 169 obese Danish individuals. We find two groups of individuals that differ by the number of gut microbial genes and thus gut bacterial richness. They contain known and previously unknown bacterial species at different proportions; individuals with a low bacterial richness (23% of the population) are characterized by more marked overall adiposity, insulin resistance and dyslipidaemia and a more pronounced inflammatory phenotype when compared with high bacterial richness individuals. The obese individuals among the lower bacterial richness group also gain more weight over time. Only a few bacterial species are sufficient to distinguish between individuals with high and low bacterial richness, and even between lean and obese participants. Our classifications based on variation in the gut microbiome identify subsets of individuals in the general white adult population who may be at increased risk of progressing to adiposity-associated co-morbidities.
Assuntos
Bactérias/isolamento & purificação , Biomarcadores/metabolismo , Trato Gastrointestinal/microbiologia , Metagenoma , Adiposidade , Adulto , Bactérias/classificação , Bactérias/genética , Índice de Massa Corporal , Estudos de Casos e Controles , Dieta , Dislipidemias/microbiologia , Metabolismo Energético , Europa (Continente)/etnologia , Feminino , Genes Bacterianos , Humanos , Inflamação/microbiologia , Resistência à Insulina , Masculino , Metagenoma/genética , Obesidade/metabolismo , Obesidade/microbiologia , Sobrepeso/metabolismo , Sobrepeso/microbiologia , Filogenia , Magreza/microbiologia , Aumento de Peso , Redução de Peso , População BrancaRESUMO
The rich fossil record of equids has made them a model for evolutionary processes. Here we present a 1.12-times coverage draft genome from a horse bone recovered from permafrost dated to approximately 560-780 thousand years before present (kyr BP). Our data represent the oldest full genome sequence determined so far by almost an order of magnitude. For comparison, we sequenced the genome of a Late Pleistocene horse (43 kyr BP), and modern genomes of five domestic horse breeds (Equus ferus caballus), a Przewalski's horse (E. f. przewalskii) and a donkey (E. asinus). Our analyses suggest that the Equus lineage giving rise to all contemporary horses, zebras and donkeys originated 4.0-4.5 million years before present (Myr BP), twice the conventionally accepted time to the most recent common ancestor of the genus Equus. We also find that horse population size fluctuated multiple times over the past 2 Myr, particularly during periods of severe climatic changes. We estimate that the Przewalski's and domestic horse populations diverged 38-72 kyr BP, and find no evidence of recent admixture between the domestic horse breeds and the Przewalski's horse investigated. This supports the contention that Przewalski's horses represent the last surviving wild horse population. We find similar levels of genetic variation among Przewalski's and domestic populations, indicating that the former are genetically viable and worthy of conservation efforts. We also find evidence for continuous selection on the immune system and olfaction throughout horse evolution. Finally, we identify 29 genomic regions among horse breeds that deviate from neutrality and show low levels of genetic variation compared to the Przewalski's horse. Such regions could correspond to loci selected early during domestication.
Assuntos
Evolução Molecular , Genoma/genética , Cavalos/genética , Filogenia , Animais , Conservação dos Recursos Naturais , DNA/análise , DNA/genética , Espécies em Perigo de Extinção , Equidae/classificação , Equidae/genética , Fósseis , Variação Genética/genética , História Antiga , Cavalos/classificação , Proteínas/análise , Proteínas/química , Proteínas/genética , YukonRESUMO
Yakutia, Sakha Republic, in the Siberian Far East, represents one of the coldest places on Earth, with winter record temperatures dropping below -70 °C. Nevertheless, Yakutian horses survive all year round in the open air due to striking phenotypic adaptations, including compact body conformations, extremely hairy winter coats, and acute seasonal differences in metabolic activities. The evolutionary origins of Yakutian horses and the genetic basis of their adaptations remain, however, contentious. Here, we present the complete genomes of nine present-day Yakutian horses and two ancient specimens dating from the early 19th century and â¼5,200 y ago. By comparing these genomes with the genomes of two Late Pleistocene, 27 domesticated, and three wild Przewalski's horses, we find that contemporary Yakutian horses do not descend from the native horses that populated the region until the mid-Holocene, but were most likely introduced following the migration of the Yakut people a few centuries ago. Thus, they represent one of the fastest cases of adaptation to the extreme temperatures of the Arctic. We find cis-regulatory mutations to have contributed more than nonsynonymous changes to their adaptation, likely due to the comparatively limited standing variation within gene bodies at the time the population was founded. Genes involved in hair development, body size, and metabolic and hormone signaling pathways represent an essential part of the Yakutian horse adaptive genetic toolkit. Finally, we find evidence for convergent evolution with native human populations and woolly mammoths, suggesting that only a few evolutionary strategies are compatible with survival in extremely cold environments.
Assuntos
Adaptação Fisiológica/genética , Temperatura Baixa , Cavalos/fisiologia , Animais , Regiões Árticas , Evolução Molecular , Genoma , Cavalos/genética , SibériaRESUMO
BACKGROUND: Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling. RESULTS: Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers. CONCLUSION: Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at: https://bitbucket.org/kosaidtu/norgal .