RESUMO
We investigate a 2,000-year genetic transect through Scandinavia spanning the Iron Age to the present, based on 48 new and 249 published ancient genomes and genotypes from 16,638 modern individuals. We find regional variation in the timing and magnitude of gene flow from three sources: the eastern Baltic, the British-Irish Isles, and southern Europe. British-Irish ancestry was widespread in Scandinavia from the Viking period, whereas eastern Baltic ancestry is more localized to Gotland and central Sweden. In some regions, a drop in current levels of external ancestry suggests that ancient immigrants contributed proportionately less to the modern Scandinavian gene pool than indicated by the ancestry of genomes from the Viking and Medieval periods. Finally, we show that a north-south genetic cline that characterizes modern Scandinavians is mainly due to the differential levels of Uralic ancestry and that this cline existed in the Viking Age and possibly earlier.
Assuntos
Genoma Humano , Humanos , Europa (Continente) , Variação Genética , Países Escandinavos e Nórdicos , Reino Unido , População Branca/genética , População Branca/história , Migração HumanaRESUMO
Only five species of the once-diverse Rhinocerotidae remain, making the reconstruction of their evolutionary history a challenge to biologists since Darwin. We sequenced genomes from five rhinoceros species (three extinct and two living), which we compared to existing data from the remaining three living species and a range of outgroups. We identify an early divergence between extant African and Eurasian lineages, resolving a key debate regarding the phylogeny of extant rhinoceroses. This early Miocene (â¼16 million years ago [mya]) split post-dates the land bridge formation between the Afro-Arabian and Eurasian landmasses. Our analyses also show that while rhinoceros genomes in general exhibit low levels of genome-wide diversity, heterozygosity is lowest and inbreeding is highest in the modern species. These results suggest that while low genetic diversity is a long-term feature of the family, it has been particularly exacerbated recently, likely reflecting recent anthropogenic-driven population declines.
Assuntos
Evolução Molecular , Genoma , Perissodáctilos/genética , Animais , Demografia , Fluxo Gênico , Variação Genética , Geografia , Heterozigoto , Homozigoto , Especificidade de Hospedeiro , Cadeias de Markov , Mutação/genética , Filogenia , Especificidade da Espécie , Fatores de TempoRESUMO
The grey wolf (Canis lupus) was the first species to give rise to a domestic population, and they remained widespread throughout the last Ice Age when many other large mammal species went extinct. Little is known, however, about the history and possible extinction of past wolf populations or when and where the wolf progenitors of the present-day dog lineage (Canis familiaris) lived1-8. Here we analysed 72 ancient wolf genomes spanning the last 100,000 years from Europe, Siberia and North America. We found that wolf populations were highly connected throughout the Late Pleistocene, with levels of differentiation an order of magnitude lower than they are today. This population connectivity allowed us to detect natural selection across the time series, including rapid fixation of mutations in the gene IFT88 40,000-30,000 years ago. We show that dogs are overall more closely related to ancient wolves from eastern Eurasia than to those from western Eurasia, suggesting a domestication process in the east. However, we also found that dogs in the Near East and Africa derive up to half of their ancestry from a distinct population related to modern southwest Eurasian wolves, reflecting either an independent domestication process or admixture from local wolves. None of the analysed ancient wolf genomes is a direct match for either of these dog ancestries, meaning that the exact progenitor populations remain to be located.
Assuntos
Cães , Genoma , Genômica , Filogenia , Lobos , África , Animais , DNA Antigo/análise , Cães/genética , Domesticação , Europa (Continente) , Genoma/genética , História Antiga , Oriente Médio , Mutação , América do Norte , Seleção Genética , Sibéria , Proteínas Supressoras de Tumor/genética , Lobos/classificação , Lobos/genéticaRESUMO
Dire wolves are considered to be one of the most common and widespread large carnivores in Pleistocene America1, yet relatively little is known about their evolution or extinction. Here, to reconstruct the evolutionary history of dire wolves, we sequenced five genomes from sub-fossil remains dating from 13,000 to more than 50,000 years ago. Our results indicate that although they were similar morphologically to the extant grey wolf, dire wolves were a highly divergent lineage that split from living canids around 5.7 million years ago. In contrast to numerous examples of hybridization across Canidae2,3, there is no evidence for gene flow between dire wolves and either North American grey wolves or coyotes. This suggests that dire wolves evolved in isolation from the Pleistocene ancestors of these species. Our results also support an early New World origin of dire wolves, while the ancestors of grey wolves, coyotes and dholes evolved in Eurasia and colonized North America only relatively recently.
Assuntos
Extinção Biológica , Filogenia , Lobos/classificação , Animais , Fósseis , Fluxo Gênico , Genoma/genética , Genômica , Mapeamento Geográfico , América do Norte , Paleontologia , Fenótipo , Lobos/genéticaRESUMO
The island of St Helena played a crucial role in the suppression of the transatlantic slave trade. Strategically located in the middle of the South Atlantic, it served as a staging post for the Royal Navy and reception point for enslaved Africans who had been "liberated" from slave ships intercepted by the British. In total, St Helena received approximately 27,000 liberated Africans between 1840 and 1867. Written sources suggest that the majority of these individuals came from West Central Africa, but their precise origins are unknown. Here, we report the results of ancient DNA analyses that we conducted as part of a wider effort to commemorate St Helena's liberated Africans and to restore knowledge of their lives and experiences. We generated partial genomes (0.1-0.5×) for 20 individuals whose remains had been recovered during archaeological excavations on the island. We compared their genomes with genotype data for over 3,000 present-day individuals from 90 populations across sub-Saharan Africa and conclude that the individuals most likely originated from different source populations within the general area between northern Angola and Gabon. We also find that the majority (17/20) of the individuals were male, supporting a well-documented sex bias in the latter phase of the transatlantic slave trade. The study expands our understanding of St Helena's liberated African community and illustrates how ancient DNA analyses can be used to investigate the origins and identities of individuals whose lives were bound up in the story of slavery and its abolition.
Assuntos
População Africana , Pessoas Escravizadas , Humanos , Feminino , Masculino , DNA Antigo , População Negra/genética , GenótipoRESUMO
The domestic pigeon's exceptional phenotypic diversity was key in developing Darwin's Theory of Evolution and establishing the concept of artificial selection. However, unlike its domestic counterpart, its wild progenitor, the rock dove Columba livia has received considerably less attention. Therefore, questions regarding its domestication, evolution, taxonomy, and conservation status remain unresolved. We generated whole-genome sequencing data from 65 historical rock doves that represent all currently recognized subspecies and span the species' original geographic distribution. Our dataset includes 3 specimens from Darwin's collection, and the type specimens of 5 different taxa. We characterized their population structure, genomic diversity, and gene-flow patterns. Our results show the West African subspecies C. l. gymnocyclus is basal to rock doves and domestic pigeons, and suggests gene-flow between the rock dove's sister species C. rupestris, and the ancestor of rock doves after its split from West African populations. These genomes allowed us to propose a model for the evolution of the rock dove in light of the refugia theory. We propose that rock dove genetic diversity and introgression patterns derive from a history of allopatric cycles and dispersion waves during the Quaternary glacial and interglacial periods. To explore the rock dove domestication history, we combined our new dataset with available genomes from domestic pigeons. Our results point to at least 1 domestication event in the Levant that gave rise to all domestic breeds analysed in this study. Finally, we propose a species-level taxonomic arrangement to reflect the evolutionary history of the West African rock dove populations.
Assuntos
Columbidae , Genoma , Animais , Columbidae/genéticaRESUMO
MOTIVATION: Machine learning (ML) methods are motivated by the need to automate information extraction from large datasets in order to support human users in data-driven tasks. This is an attractive approach for integrative joint analysis of vast amounts of omics data produced in next generation sequencing and other -omics assays. A systematic assessment of the current literature can help to identify key trends and potential gaps in methodology and applications. We surveyed the literature on ML multi-omic data integration and quantitatively explored the goals, techniques and data involved in this field. We were particularly interested in examining how researchers use ML to deal with the volume and complexity of these datasets. RESULTS: Our main finding is that the methods used are those that address the challenges of datasets with few samples and many features. Dimensionality reduction methods are used to reduce the feature count alongside models that can also appropriately handle relatively few samples. Popular techniques include autoencoders, random forests and support vector machines. We also found that the field is heavily influenced by the use of The Cancer Genome Atlas dataset, which is accessible and contains many diverse experiments. AVAILABILITY AND IMPLEMENTATION: All data and processing scripts are available at this GitLab repository: https://gitlab.com/polavieja_lab/ml_multi-omics_review/ or in Zenodo: https://doi.org/10.5281/zenodo.7361807. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Multiômica , Neoplasias , Humanos , Neoplasias/genética , Aprendizado de Máquina , GenomaRESUMO
Current mitochondrial DNA (mtDNA) haplogroup classification tools map reads to a single reference genome and perform inference based on the detected mutations to this reference. This approach biases haplogroup assignments towards the reference and prohibits accurate calculations of the uncertainty in assignment. We present HaploCart, a probabilistic mtDNA haplogroup classifier which uses a pangenomic reference graph framework together with principles of Bayesian inference. We demonstrate that our approach significantly outperforms available tools by being more robust to lower coverage or incomplete consensus sequences and producing phylogenetically-aware confidence scores that are unbiased towards any haplogroup. HaploCart is available both as a command-line tool and through a user-friendly web interface. The C++ program accepts as input consensus FASTA, FASTQ, or GAM files, and outputs a text file with the haplogroup assignments of the samples along with the level of confidence in the assignments. Our work considerably reduces the amount of data required to obtain a confident mitochondrial haplogroup assignment.
Assuntos
DNA Mitocondrial , Mitocôndrias , Humanos , DNA Mitocondrial/genética , Teorema de Bayes , Haplótipos/genética , Mitocôndrias/genética , MutaçãoRESUMO
Preserving genetic diversity and adaptive potential while avoiding inbreeding depression is crucial for the long-term conservation of natural populations. Despite demographic increases, traces of past bottleneck events at the genomic level should be carefully considered for population management. From this perspective, the peninsular Italian wolf is a paradigmatic case. After being on the brink of extinction in the late 1960s, peninsular Italian wolves rebounded and recolonized most of the peninsula aided by conservation measures, including habitat and legal protection. Notwithstanding their demographic recovery, a comprehensive understanding of the genomic consequences of the historical bottleneck in Italian wolves is still lacking. To fill this gap, we sequenced whole genomes of thirteen individuals sampled in the core historical range of the species in Central Italy to conduct population genomic analyses, including a comparison with wolves from two highly-inbred wolf populations (i.e., Scandinavia and Isle Royale). We found that peninsular Italian wolves, despite their recent recovery, still exhibit relatively low genetic diversity, a small effective population size, signatures of inbreeding, and a non-negligible genetic load. Our findings indicate that the peninsular Italian wolf population is still susceptible to bottleneck legacies, which could lead to local inbreeding depression in case of population reduction or fragmentations. This study emphasizes the importance of considering key genetic parameters to design appropriate long-term conservation management plans.
RESUMO
Although today the forest cover is continuous in Central Africa, this may have not always been the case, as the scarce fossil record in this region suggests that arid conditions might have significantly reduced tree density during the ice ages. Our aim was to investigate whether the dry ice age periods left a genetic signature on tree species that can be used to infer the date of the past fragmentation of the rainforest. We sequenced reduced representation libraries of 182 samples representing five widespread legume trees and seven outgroups. Phylogenetic analyses identified an early divergent lineage for all species in West Africa (Upper Guinea) and two clades in Central Africa: Lower Guinea-North and Lower Guinea-South. As the structure separating the Northern and Southern clades-congruent across species-cannot be explained by geographic barriers, we tested other hypotheses with demographic model testing using δαδι. The best estimates indicate that the two clades split between the Upper Pliocene and the Pleistocene, a date compatible with forest fragmentation driven by ice age climatic oscillations. Furthermore, we found remarkably older split dates for the shade-tolerant tree species with nonassisted seed dispersal than for light-demanding species with long-distance wind-dispersed seeds. Different recolonization abilities after recurrent cycles of forest fragmentation seem to explain why species with long-distance dispersal show more recent genetic admixture between the two clades than species with limited seed dispersal. Despite their old history, our results depict the African rainforests as a dynamic biome where tree species have expanded relatively recently after the last glaciation.
Assuntos
Floresta Úmida , Dispersão de Sementes/genética , Árvores/genética , África , Variação Genética , FilogeniaRESUMO
Dogs have been essential to life in the Siberian Arctic for over 9,500 y, and this tight link between people and dogs continues in Siberian communities. Although Arctic Siberian groups such as the Nenets received limited gene flow from neighboring groups, archaeological evidence suggests that metallurgy and new subsistence strategies emerged in Northwest Siberia around 2,000 y ago. It is unclear if the Siberian Arctic dog population was as continuous as the people of the region or if instead admixture occurred, possibly in relation to the influx of material culture from other parts of Eurasia. To address this question, we sequenced and analyzed the genomes of 20 ancient and historical Siberian and Eurasian Steppe dogs. Our analyses indicate that while Siberian dogs were genetically homogenous between 9,500 to 7,000 y ago, later introduction of dogs from the Eurasian Steppe and Europe led to substantial admixture. This is clearly the case in the Iamal-Nenets region (Northwestern Siberia) where dogs from the Iron Age period (â¼2,000 y ago) possess substantially less ancestry related to European and Steppe dogs than dogs from the medieval period (â¼1,000 y ago). Combined with findings of nonlocal materials recovered from these archaeological sites, including glass beads and metal items, these results indicate that Northwest Siberian communities were connected to a larger trade network through which they acquired genetically distinctive dogs from other regions. These exchanges were part of a series of major societal changes, including the rise of large-scale reindeer pastoralism â¼800 y ago.
Assuntos
Distribuição Animal , Evolução Biológica , Cães/genética , Fluxo Gênico , Genética Populacional , Genoma , Migração Humana , Animais , Arqueologia , Humanos , SibériaRESUMO
Lions are one of the world's most iconic megafauna, yet little is known about their temporal and spatial demographic history and population differentiation. We analyzed a genomic dataset of 20 specimens: two ca. 30,000-y-old cave lions (Panthera leo spelaea), 12 historic lions (Panthera leo leo/Panthera leo melanochaita) that lived between the 15th and 20th centuries outside the current geographic distribution of lions, and 6 present-day lions from Africa and India. We found that cave and modern lions shared an ancestor ca. 500,000 y ago and that the 2 lineages likely did not hybridize following their divergence. Within modern lions, we found 2 main lineages that diverged ca. 70,000 y ago, with clear evidence of subsequent gene flow. Our data also reveal a nearly complete absence of genetic diversity within Indian lions, probably due to well-documented extremely low effective population sizes in the recent past. Our results contribute toward the understanding of the evolutionary history of lions and complement conservation efforts to protect the diversity of this vulnerable species.
Assuntos
Evolução Molecular , Leões/genética , Leões/fisiologia , África , Animais , Fluxo Gênico , Variação Genética , Genômica , Geografia , Índia , Leões/classificação , Masculino , Filogenia , Cromossomo XRESUMO
The third millennium BCE was a period of major cultural and demographic changes in Europe that signaled the beginning of the Bronze Age. People from the Pontic steppe expanded westward, leading to the formation of the Corded Ware complex and transforming the genetic landscape of Europe. At the time, the Globular Amphora culture (3300-2700 BCE) existed over large parts of Central and Eastern Europe, but little is known about their interaction with neighboring Corded Ware groups and steppe societies. Here we present a detailed study of a Late Neolithic mass grave from southern Poland belonging to the Globular Amphora culture and containing the remains of 15 men, women, and children, all killed by blows to the head. We sequenced their genomes to between 1.1- and 3.9-fold coverage and performed kinship analyses that demonstrate that the individuals belonged to a large extended family. The bodies had been carefully laid out according to kin relationships by someone who evidently knew the deceased. From a population genetic viewpoint, the people from Koszyce are clearly distinct from neighboring Corded Ware groups because of their lack of steppe-related ancestry. Although the reason for the massacre is unknown, it is possible that it was connected with the expansion of Corded Ware groups, which may have resulted in competition for resources and violent conflict. Together with the archaeological evidence, these analyses provide an unprecedented level of insight into the kinship structure and social behavior of a Late Neolithic community.
Assuntos
Sepultamento/história , DNA Antigo/análise , Violência/história , Adolescente , Adulto , Arqueologia , Criança , Pré-Escolar , Feminino , História Antiga , Migração Humana , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Linhagem , Polônia , Adulto JovemRESUMO
Africa's black (Diceros bicornis) and white (Ceratotherium simum) rhinoceros are closely related sister-taxa that evolved highly divergent obligate browsing and grazing feeding strategies. Although their precursor species Diceros praecox and Ceratotherium mauritanicum appear in the fossil record â¼5.2 Ma, by 4 Ma both were still mixed feeders, and were even spatiotemporally sympatric at several Pliocene sites in what is today Africa's Rift Valley. Here, we ask whether or not D. praecox and C. mauritanicum were reproductively isolated when they came into Pliocene secondary contact. We sequenced and de novo assembled the first annotated black rhinoceros reference genome and compared it with available genomes of other black and white rhinoceros. We show that ancestral gene flow between D. praecox and C. mauritanicum ceased sometime between 3.3 and 4.1 Ma, despite conventional methods for the detection of gene flow from whole genome data returning false positive signatures of recent interspecific migration due to incomplete lineage sorting. We propose that ongoing Pliocene genetic exchange, for up to 2 My after initial divergence, could have potentially hindered the development of obligate feeding strategies until both species were fully reproductively isolated, but that the more severe and shifting paleoclimate of the early Pleistocene was likely the ultimate driver of ecological specialization in African rhinoceros.
Assuntos
Fluxo Gênico , Perissodáctilos/genética , Isolamento Reprodutivo , Animais , Comportamento Alimentar , Feminino , Genoma , Masculino , Taxa de MutaçãoRESUMO
The protozoan Plasmodium vivax is responsible for 42% of all cases of malaria outside Africa. The parasite is currently largely restricted to tropical and subtropical latitudes in Asia, Oceania, and the Americas. Though, it was historically present in most of Europe before being finally eradicated during the second half of the 20th century. The lack of genomic information on the extinct European lineage has prevented a clear understanding of historical population structuring and past migrations of P. vivax. We used medical microscope slides prepared in 1944 from malaria-affected patients from the Ebro Delta in Spain, one of the last footholds of malaria in Europe, to generate a genome of a European P. vivax strain. Population genetics and phylogenetic analyses placed this strain basal to a cluster including samples from the Americas. This genome allowed us to calibrate a genomic mutation rate for P. vivax, and to estimate the mean age of the last common ancestor between European and American strains to the 15th century. This date points to an introduction of the parasite during the European colonization of the Americas. In addition, we found that some known variants for resistance to antimalarial drugs, including Chloroquine and Sulfadoxine, were already present in this European strain, predating their use. Our results shed light on the evolution of an important human pathogen and illustrate the value of antique medical collections as a resource for retrieving genomic information on pathogens from the past.
Assuntos
Malária Vivax/parasitologia , Plasmodium vivax/classificação , Plasmodium vivax/genética , Sequenciamento Completo do Genoma/métodos , América , Ásia , Evolução Molecular , Genética Populacional , Genoma de Protozoário , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Oceania , Filogenia , Filogeografia , EspanhaRESUMO
Large vertebrates are extremely sensitive to anthropogenic pressure, and their populations are declining fast. The white rhinoceros (Ceratotherium simum) is a paradigmatic case: this African megaherbivore has suffered a remarkable decline in the last 150 years due to human activities. Its subspecies, the northern (NWR) and the southern white rhinoceros (SWR), however, underwent opposite fates: the NWR vanished quickly, while the SWR recovered after the severe decline. Such demographic events are predicted to have an erosive effect at the genomic level, linked to the extirpation of diversity, and increased genetic drift and inbreeding. However, there is little empirical data available to directly reconstruct the subtleties of such processes in light of distinct demographic histories. Therefore, we generated a whole-genome, temporal data set consisting of 52 resequenced white rhinoceros genomes, representing both subspecies at two time windows: before and during/after the bottleneck. Our data reveal previously unknown population structure within both subspecies, as well as quantifiable genomic erosion. Genome-wide heterozygosity decreased significantly by 10% in the NWR and 36% in the SWR, and inbreeding coefficients rose significantly by 11% and 39%, respectively. Despite the remarkable loss of genomic diversity and recent inbreeding it suffered, the only surviving subspecies, the SWR, does not show a significant accumulation of genetic load compared to its historical counterpart. Our data provide empirical support for predictions about the genomic consequences of shrinking populations, and our findings have the potential to inform the conservation efforts of the remaining white rhinoceroses.
Assuntos
Efeitos Antropogênicos , Perissodáctilos , Animais , Genômica , Endogamia , Perissodáctilos/genéticaRESUMO
Several Arctic marine mammal species are predicted to be negatively impacted by rapid sea ice loss associated with ongoing ocean warming. However, consequences for Arctic whales remain uncertain. To investigate how Arctic whales responded to past climatic fluctuations, we analysed 206 mitochondrial genomes from beluga whales (Delphinapterus leucas) sampled across their circumpolar range, and four nuclear genomes, covering both the Atlantic and the Pacific Arctic region. We found four well-differentiated mitochondrial lineages, which were established before the onset of the last glacial expansion ~110 thousand years ago. Our findings suggested these lineages diverged in allopatry, reflecting isolation of populations during glacial periods when the Arctic sea-shelf was covered by multiyear sea ice. Subsequent population expansion and secondary contact between the Atlantic and Pacific Oceans shaped the current geographic distribution of lineages, and may have facilitated mitochondrial introgression. Our demographic reconstructions based on both mitochondrial and nuclear genomes showed markedly lower population sizes during the Last Glacial Maximum (LGM) compared to the preceding Eemian and current Holocene interglacial periods. Habitat modelling similarly revealed less suitable habitat during the LGM (glacial) than at present (interglacial). Together, our findings suggested the association between climate, population size, and available habitat in belugas. Forecasts for year 2100 showed that beluga habitat will decrease and shift northwards as oceans continue to warm, putatively leading to population declines in some beluga populations. Finally, we identified vulnerable populations which, if extirpated as a consequence of ocean warming, will lead to a substantial decline of species-wide haplotype diversity.
Assuntos
Beluga , Animais , Regiões Árticas , Beluga/genética , Demografia , Ecossistema , Oceanos e Mares , Oceano Pacífico , FilogeografiaRESUMO
The Caribbean was one of the last parts of the Americas to be settled by humans, but how and when the islands were first occupied remains a matter of debate. Ancient DNA can help answering these questions, but the work has been hampered by poor DNA preservation. We report the genome sequence of a 1,000-year-old Lucayan Taino individual recovered from the site of Preacher's Cave in the Bahamas. We sequenced her genome to 12.4-fold coverage and show that she is genetically most closely related to present-day Arawakan speakers from northern South America, suggesting that the ancestors of the Lucayans originated there. Further, we find no evidence for recent inbreeding or isolation in the ancient genome, suggesting that the Lucayans had a relatively large effective population size. Finally, we show that the native American components in some present-day Caribbean genomes are closely related to the ancient Taino, demonstrating an element of continuity between precontact populations and present-day Latino populations in the Caribbean.
Assuntos
Indígena Americano ou Nativo do Alasca/genética , Genoma Humano/genética , Migração Humana/estatística & dados numéricos , Adulto , Arqueologia , Bahamas , DNA Antigo , DNA Mitocondrial/genética , Feminino , Genética Populacional , Genômica , Hispânico ou Latino/genética , História Antiga , Migração Humana/história , Humanos , Masculino , Paleontologia , Filogenia , Adulto JovemRESUMO
The Arctic is warming at an unprecedented rate, with unknown consequences for endemic fauna. However, Earth has experienced severe climatic oscillations in the past, and understanding how species responded to them might provide insight into their resilience to near-future climatic predictions. Little is known about the responses of Arctic marine mammals to past climatic shifts, but narwhals (Monodon monoceros) are considered one of the endemic Arctic species most vulnerable to environmental change. Here, we analyse 121 complete mitochondrial genomes from narwhals sampled across their range and use them in combination with species distribution models to elucidate the influence of past and ongoing climatic shifts on their population structure and demographic history. We find low levels of genetic diversity and limited geographic structuring of genetic clades. We show that narwhals experienced a long-term low effective population size, which increased after the Last Glacial Maximum, when the amount of suitable habitat expanded. Similar post-glacial habitat release has been a key driver of population size expansion of other polar marine predators. Our analyses indicate that habitat availability has been critical to the success of narwhals, raising concerns for their fate in an increasingly warming Arctic.