RESUMO
Dire wolves are considered to be one of the most common and widespread large carnivores in Pleistocene America1, yet relatively little is known about their evolution or extinction. Here, to reconstruct the evolutionary history of dire wolves, we sequenced five genomes from sub-fossil remains dating from 13,000 to more than 50,000 years ago. Our results indicate that although they were similar morphologically to the extant grey wolf, dire wolves were a highly divergent lineage that split from living canids around 5.7 million years ago. In contrast to numerous examples of hybridization across Canidae2,3, there is no evidence for gene flow between dire wolves and either North American grey wolves or coyotes. This suggests that dire wolves evolved in isolation from the Pleistocene ancestors of these species. Our results also support an early New World origin of dire wolves, while the ancestors of grey wolves, coyotes and dholes evolved in Eurasia and colonized North America only relatively recently.
Assuntos
Extinção Biológica , Filogenia , Lobos/classificação , Animais , Fósseis , Fluxo Gênico , Genoma/genética , Genômica , Mapeamento Geográfico , América do Norte , Paleontologia , Fenótipo , Lobos/genéticaRESUMO
A recent study of mammoth subfossil remains has demonstrated the potential of using relatively low-coverage high-throughput DNA sequencing to genetically sex specimens, revealing a strong male-biased sex ratio [P. Pecnerová et al., Curr. Biol. 27, 3505-3510.e3 (2017)]. Similar patterns were predicted for steppe bison, based on their analogous female herd-based structure. We genetically sexed subfossil remains of 186 Holarctic bison (Bison spp.), and also 91 brown bears (Ursus arctos), which are not female herd-based, and found that â¼75% of both groups were male, very close to the ratio observed in mammoths (72%). This large deviation from a 1:1 ratio was unexpected, but we found no evidence for sex differences with respect to DNA preservation, sample age, material type, or overall spatial distribution. We further examined ratios of male and female specimens from 4 large museum mammal collections and found a strong male bias, observable in almost all mammalian orders. We suggest that, in mammals at least, 1) wider male geographic ranges can lead to considerably increased chances of detection in fossil studies, and 2) sexual dimorphic behavior or appearance can facilitate a considerable sex bias in fossil and modern collections, on a previously unacknowledged scale. This finding has major implications for a wide range of studies of fossil and museum material.
Assuntos
DNA Antigo/análise , Fósseis , Mamíferos/genética , Modelos Genéticos , Museus , Sexismo/estatística & dados numéricos , Animais , Bison/genética , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Masculino , Mamutes/genética , Filogenia , Ursidae/genéticaRESUMO
Tropical islands are renowned as natural laboratories for evolutionary study. Lineage radiations across tropical archipelagos are ideal systems for investigating how colonization, speciation, and extinction processes shape biodiversity patterns. The expansion of the island thrush across the Indo-Pacific represents one of the largest yet most perplexing island radiations of any songbird species. The island thrush exhibits a complex mosaic of pronounced plumage variation across its range and is arguably the world's most polytypic bird. It is a sedentary species largely restricted to mountain forests, yet it has colonized a vast island region spanning a quarter of the globe. We conducted a comprehensive sampling of island thrush populations and obtained genome-wide SNP data, which we used to reconstruct its phylogeny, population structure, gene flow, and demographic history. The island thrush evolved from migratory Palearctic ancestors and radiated explosively across the Indo-Pacific during the Pleistocene, with numerous instances of gene flow between populations. Its bewildering plumage variation masks a biogeographically intuitive stepping stone colonization path from the Philippines through the Greater Sundas, Wallacea, and New Guinea to Polynesia. The island thrush's success in colonizing Indo-Pacific mountains can be understood in light of its ancestral mobility and adaptation to cool climates; however, shifts in elevational range, degree of plumage variation and apparent dispersal rates in the eastern part of its range raise further intriguing questions about its biology.
RESUMO
Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone.
Assuntos
Genoma , Software , Simulação por Computador , Genética Populacional , GenômicaRESUMO
The role of natural selection in shaping biological diversity is an area of intense interest in modern biology. To date, studies of positive selection have primarily relied on genomic datasets from contemporary populations, which are susceptible to confounding factors associated with complex and often unknown aspects of population history. In particular, admixture between diverged populations can distort or hide prior selection events in modern genomes, though this process is not explicitly accounted for in most selection studies despite its apparent ubiquity in humans and other species. Through analyses of ancient and modern human genomes, we show that previously reported Holocene-era admixture has masked more than 50 historic hard sweeps in modern European genomes. Our results imply that this canonical mode of selection has probably been underappreciated in the evolutionary history of humans and suggest that our current understanding of the tempo and mode of selection in natural populations may be inaccurate.
Assuntos
Hominidae , Seleção Genética , Animais , Humanos , Evolução Biológica , Genoma Humano , GenômicaRESUMO
Understanding the demographic history of populations is a key goal in population genetics, and with improving methods and data, ever more complex models are being proposed and tested. Demographic models of current interest typically consist of a set of discrete populations, their sizes and growth rates, and continuous and pulse migrations between those populations over a number of epochs, which can require dozens of parameters to fully describe. There is currently no standard format to define such models, significantly hampering progress in the field. In particular, the important task of translating the model descriptions in published work into input suitable for population genetic simulators is labor intensive and error prone. We propose the Demes data model and file format, built on widely used technologies, to alleviate these issues. Demes provide a well-defined and unambiguous model of populations and their properties that is straightforward to implement in software, and a text file format that is designed for simplicity and clarity. We provide thoroughly tested implementations of Demes parsers in multiple languages including Python and C, and showcase initial support in several simulators and inference methods. An introduction to the file format and a detailed specification are available at https://popsim-consortium.github.io/demes-spec-docs/.
Assuntos
Genética Populacional , Software , DemografiaRESUMO
Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this, a large number of specialized simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce msprime version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and the tskit library. We summarize msprime's many features, and show that its performance is excellent, often many times faster and more memory efficient than specialized alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement.
Assuntos
Algoritmos , Modelos Genéticos , Simulação por Computador , Genética Populacional , Mutação , SoftwareRESUMO
Studies in a variety of species have shown evidence for positively selected variants introduced into a population via introgression from another, distantly related population-a process known as adaptive introgression. However, there are few explicit frameworks for jointly modelling introgression and positive selection, in order to detect these variants using genomic sequence data. Here, we develop an approach based on convolutional neural networks (CNNs). CNNs do not require the specification of an analytical model of allele frequency dynamics and have outperformed alternative methods for classification and parameter estimation tasks in various areas of population genetics. Thus, they are potentially well suited to the identification of adaptive introgression. Using simulations, we trained CNNs on genotype matrices derived from genomes sampled from the donor population, the recipient population and a related non-introgressed population, in order to distinguish regions of the genome evolving under adaptive introgression from those evolving neutrally or experiencing selective sweeps. Our CNN architecture exhibits 95% accuracy on simulated data, even when the genomes are unphased, and accuracy decreases only moderately in the presence of heterosis. As a proof of concept, we applied our trained CNNs to human genomic datasets-both phased and unphased-to detect candidates for adaptive introgression that shaped our evolutionary history.
Assuntos
Evolução Molecular , Redes Neurais de Computação , Frequência do Gene , Genótipo , Humanos , MutaçãoRESUMO
Background: The evolutionary relationships of Felidae during their Early-Middle Miocene radiation is contentious. Although the early common ancestors have been subsumed under the grade-group Pseudaelurus, this group is thought to be paraphyletic, including the early ancestors of both modern cats and extinct sabretooths. Methods: Here, we sequenced a draft nuclear genome of Smilodon populator, dated to 13,182 ± 90 cal BP, making this the oldest palaeogenome from South America to date, a region known to be problematic for ancient DNA preservation. We analysed this genome, together with genomes from other extinct and extant cats to investigate their phylogenetic relationships. Results: We confirm a deep divergence (~20.65 Ma) within sabre-toothed cats. Through the analysis of both simulated and empirical data, we show a lack of gene flow between Smilodon and contemporary Felidae. Conclusions: Given that some species traditionally assigned to Pseudaelurus originated in the Early Miocene ~20 Ma, this indicates that some species of Pseudaelurus may be younger than the lineages they purportedly gave rise to, further supporting the hypothesis that Pseudaelurus was paraphyletic.
RESUMO
The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.
Assuntos
Genética Populacional , Biblioteca Genômica , Modelos Genéticos , Animais , Arabidopsis/genética , Cães/genética , Drosophila melanogaster/genética , Escherichia coli/genética , Genética Populacional/métodos , Genética Populacional/organização & administração , Genoma/genética , Genoma Humano/genética , Humanos , Pongo abelii/genéticaRESUMO
The Tasmanian tiger or thylacine (Thylacinus cynocephalus) was the largest carnivorous Australian marsupial to survive into the modern era. Despite last sharing a common ancestor with the eutherian canids ~160 million years ago, their phenotypic resemblance is considered the most striking example of convergent evolution in mammals. The last known thylacine died in captivity in 1936 and many aspects of the evolutionary history of this unique marsupial apex predator remain unknown. Here we have sequenced the genome of a preserved thylacine pouch young specimen to clarify the phylogenetic position of the thylacine within the carnivorous marsupials, reconstruct its historical demography and examine the genetic basis of its convergence with canids. Retroposon insertion patterns placed the thylacine as the basal lineage in Dasyuromorphia and suggest incomplete lineage sorting in early dasyuromorphs. Demographic analysis indicated a long-term decline in genetic diversity starting well before the arrival of humans in Australia. In spite of their extraordinary phenotypic convergence, comparative genomic analyses demonstrated that amino acid homoplasies between the thylacine and canids are largely consistent with neutral evolution. Furthermore, the genes and pathways targeted by positive selection differ markedly between these species. Together, these findings support models of adaptive convergence driven primarily by cis-regulatory evolution.
Assuntos
Evolução Molecular , Genoma , Marsupiais/genética , Animais , Austrália , Demografia , Filogenia , Análise de Sequência de DNARESUMO
The two living species of bison (European and American) are among the few terrestrial megafauna to have survived the late Pleistocene extinctions. Despite the extensive bovid fossil record in Eurasia, the evolutionary history of the European bison (or wisent, Bison bonasus) before the Holocene (<11.7 thousand years ago (kya)) remains a mystery. We use complete ancient mitochondrial genomes and genome-wide nuclear DNA surveys to reveal that the wisent is the product of hybridization between the extinct steppe bison (Bison priscus) and ancestors of modern cattle (aurochs, Bos primigenius) before 120 kya, and contains up to 10% aurochs genomic ancestry. Although undetected within the fossil record, ancestors of the wisent have alternated ecological dominance with steppe bison in association with major environmental shifts since at least 55 kya. Early cave artists recorded distinct morphological forms consistent with these replacement events, around the Last Glacial Maximum (LGM, â¼21-18 kya).