Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Syst Biol ; 2024 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-39140829

RESUMEN

African antelope diversity is a globally unique vestige of a much richer world-wide Pleistocene megafauna. Despite this, the evolutionary processes leading to the prolific radiation of African antelopes are not well understood. Here, we sequenced 145 whole genomes from both subspecies of the waterbuck (Kobus ellipsiprymnus), an African antelope believed to be in the process of speciation. We investigated genetic structure and population divergence and found evidence of a mid-Pleistocene separation on either side of the eastern Great Rift Valley, consistent with vicariance caused by a rain shadow along the so-called 'Kingdon's Line'. However, we also found pervasive evidence of both recent and widespread historical gene flow across the Rift Valley barrier. By inferring the genome-wide landscape of variation among subspecies, we found 14 genomic regions of elevated differentiation, including a locus that may be related to each subspecies' distinctive coat pigmentation pattern. We investigated these regions as candidate speciation islands. However, we observed no significant reduction in gene flow in these regions, nor any indications of selection against hybrids. Altogether, these results suggest a pattern whereby climatically driven vicariance is the most important process driving the African antelope radiation, and suggest that reproductive isolation may not set in until very late in the divergence process. This has a significant impact on taxonomic inference, as many taxa will be in a gray area of ambiguous systematic status, possibly explaining why it has been hard to achieve consensus regarding the species status of many African antelopes. Our analyses demonstrate how population genetics based on low-depth whole genome sequencing can provide new insights that can help resolve how far lineages have gone along the path to speciation.

2.
Mol Ecol ; : e17539, 2024 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-39373069

RESUMEN

Impalas are unusual among bovids because they have remained morphologically similar over millions of years-a phenomenon referred to as evolutionary stasis. Here, we sequenced 119 whole genomes from the two extant subspecies of impala, the common (Aepyceros melampus melampus) and black-faced (A. m. petersi) impala. We investigated the evolutionary forces working within the species to explore how they might be associated with its evolutionary stasis as a taxon. Despite being one of the most abundant bovid species, we found low genetic diversity overall, and a phylogeographic signal of spatial expansion from southern to eastern Africa. Contrary to expectations under a scenario of evolutionary stasis, we found pronounced genetic structure between and within the two subspecies with indications of ancient, but not recent, gene flow. Black-faced impala and eastern African common impala populations had more runs of homozygosity than common impala in southern Africa, and, using a proxy for genetic load, we found that natural selection is working less efficiently in these populations compared to the southern African populations. Together with the fossil record, our results are consistent with a fixed-optimum model of evolutionary stasis, in which impalas in the southern African core of the range are able to stay near their evolutionary fitness optimum as a generalist ecotone species, whereas eastern African impalas may struggle to do so due to the effects of genetic drift and reduced adaptation to the local habitat, leading to recurrent local extinction in eastern Africa and re-colonisation from the South.

3.
Mol Ecol ; 33(2): e17205, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37971141

RESUMEN

Genomic studies of species threatened by extinction are providing crucial information about evolutionary mechanisms and genetic consequences of population declines and bottlenecks. However, to understand how species avoid the extinction vortex, insights can be drawn by studying species that thrive despite past declines. Here, we studied the population genomics of the muskox (Ovibos moschatus), an Ice Age relict that was at the brink of extinction for thousands of years at the end of the Pleistocene yet appears to be thriving today. We analysed 108 whole genomes, including present-day individuals representing the current native range of both muskox subspecies, the white-faced and the barren-ground muskox (O. moschatus wardi and O. moschatus moschatus) and a ~21,000-year-old ancient individual from Siberia. We found that the muskox' demographic history was profoundly shaped by past climate changes and post-glacial re-colonizations. In particular, the white-faced muskox has the lowest genome-wide heterozygosity recorded in an ungulate. Yet, there is no evidence of inbreeding depression in native muskox populations. We hypothesize that this can be explained by the effect of long-term gradual population declines that allowed for purging of strongly deleterious mutations. This study provides insights into how species with a history of population bottlenecks, small population sizes and low genetic diversity survive against all odds.


Asunto(s)
Metagenómica , Resiliencia Psicológica , Humanos , Animales , Recién Nacido , Evolución Biológica , Genómica , Rumiantes/genética , Variación Genética/genética
4.
Mol Biol Evol ; 39(7)2022 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-35779009

RESUMEN

African wild pigs have a contentious evolutionary and biogeographic history. Until recently, desert warthog (Phacochoerus aethiopicus) and common warthog (P. africanus) were considered a single species. Molecular evidence surprisingly suggested they diverged at least 4.4 million years ago, and possibly outside of Africa. We sequenced the first whole-genomes of four desert warthogs and 35 common warthogs from throughout their range. We show that these two species diverged much later than previously estimated, 400,000-1,700,000 years ago depending on assumptions of gene flow. This brings it into agreement with the paleontological record. We found that the common warthog originated in western Africa and subsequently colonized eastern and southern Africa. During this range expansion, the common warthog interbred with the desert warthog, presumably in eastern Africa, underlining this region's importance in African biogeography. We found that immune system-related genes may have adaptively introgressed into common warthogs, indicating that resistance to novel diseases was one of the most potent drivers of evolution as common warthogs expanded their range. Hence, we solve some of the key controversies surrounding warthog evolution and reveal a complex evolutionary history involving range expansion, introgression, and adaptation to new diseases.


Asunto(s)
Resistencia a la Enfermedad , Enfermedades de los Porcinos , África , África Oriental , Animales , Secuencia de Bases , Resistencia a la Enfermedad/genética , Porcinos
5.
Mol Ecol ; 32(8): 1860-1874, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36651275

RESUMEN

The iconic Cape buffalo has experienced several documented population declines in recent history. These declines have been largely attributed to the late 19th century rinderpest pandemic. However, the effect of the rinderpest pandemic on their genetic diversity remains contentious, and other factors that have potentially affected this diversity include environmental changes during the Pleistocene, range expansions and recent human activity. Motivated by this, we present analyses of whole genome sequencing data from 59 individuals from across the Cape buffalo range to assess present-day levels of genome-wide genetic diversity and what factors have influenced these levels. We found that the Cape buffalo has high average heterozygosity overall (0.40%), with the two southernmost populations having significantly lower heterozygosity levels (0.33% and 0.29%) on par with that of the domesticated water buffalo (0.29%). Interestingly, we found that these lower levels are probably due to recent inbreeding (average fraction of runs of homozygosity 23.7% and 19.9%) rather than factors further back in time during the Pleistocene. Moreover, detailed investigations of recent demographic history show that events across the past three centuries were the main drivers of the exceptional loss of genetic diversity in the southernmost populations, coincident with the onset of colonialism in the southern extreme of the Cape buffalo range. Hence, our results add to the growing body of studies suggesting that multiple recent human-mediated impacts during the colonial period caused massive losses of large mammal abundance in southern Africa.


Asunto(s)
Genética de Población , Peste Bovina , Animales , Humanos , Sudáfrica , Variación Genética , Búfalos/genética , Colonialismo
6.
Mol Ecol ; 30(2): 528-544, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33226701

RESUMEN

Grant's gazelles have recently been proposed to be a species complex comprising three highly divergent mtDNA lineages (Nanger granti, N. notata and N. petersii). The three lineages have nonoverlapping distributions in East Africa, but without any obvious geographical divisions, making them an interesting model for studying the early-stage evolutionary dynamics of allopatric speciation in detail. Here, we use genomic data obtained by restriction site-associated (RAD) sequencing of 106 gazelle individuals to shed light on the evolutionary processes underlying Grant's gazelle divergence, to characterize their genetic structure and to assess the presence of gene flow between the main lineages in the species complex. We date the species divergence to 134,000 years ago, which is recent in evolutionary terms. We find population subdivision within N. granti, which coincides with the previously suggested two subspecies, N. g. granti and N. g. robertsii. Moreover, these two lineages seem to have hybridized in Masai Mara. Perhaps more surprisingly given their extreme genetic differentiation, N. granti and N. petersii also show signs of prolonged admixture in Mkomazi, which we identified as a hybrid population most likely founded by allopatric lineages coming into secondary contact. Despite the admixed composition of this population, elevated X chromosomal differentiation suggests that selection may be shaping the outcome of hybridization in this population. Our results therefore provide detailed insights into the processes of allopatric speciation and secondary contact in a recently radiated species complex.


Asunto(s)
Antílopes , Flujo Génico , África Oriental , Animales , Antílopes/genética , ADN Mitocondrial/genética , Especiación Genética , Hibridación Genética , Filogenia
7.
Eur J Hum Genet ; 32(2): 215-223, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37903942

RESUMEN

Perturbation of lipid homoeostasis is a major risk factor for cardiovascular disease (CVD), the leading cause of death worldwide. We aimed to identify genetic variants affecting lipid levels, and thereby risk of CVD, in Greenlanders. Genome-wide association studies (GWAS) of six blood lipids, triglycerides, LDL-cholesterol, HDL-cholesterol, total cholesterol, as well as apolipoproteins A1 and B, were performed in up to 4473 Greenlanders. For genome-wide significant variants, we also tested for associations with additional traits, including CVD events. We identified 11 genome-wide significant loci associated with lipid traits. Most of these loci were already known in Europeans, however, we found a potential causal variant near PCSK9 (rs12117661), which was independent of the known PCSK9 loss-of-function variant (rs11491147). rs12117661 was associated with lower LDL-cholesterol (ßSD(SE) = -0.22 (0.03), p = 6.5 × 10-12) and total cholesterol (-0.17 (0.03), p = 1.1 × 10-8) in the Greenlandic study population. Similar associations were observed in Europeans from the UK Biobank, where the variant was also associated with a lower risk of CVD outcomes. Moreover, rs12117661 was a top eQTL for PCSK9 across tissues in European data from the GTEx portal, and was located in a predicted regulatory element, supporting a possible causal impact on PCSK9 expression. Combined, the 11 GWAS signals explained up to 16.3% of the variance of the lipid traits. This suggests that the genetic architecture of lipid levels in Greenlanders is different from Europeans, with fewer variants explaining the variance.


Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Humanos , Proproteína Convertasa 9/genética , Groenlandia , Triglicéridos/genética , Lípidos/genética , HDL-Colesterol , LDL-Colesterol/genética , LDL-Colesterol/metabolismo , Enfermedades Cardiovasculares/genética , Polimorfismo de Nucleótido Simple
8.
Curr Biol ; 34(7): 1576-1586.e5, 2024 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-38479386

RESUMEN

Strong genetic structure has prompted discussion regarding giraffe taxonomy,1,2,3 including a suggestion to split the giraffe into four species: Northern (Giraffa c. camelopardalis), Reticulated (G. c. reticulata), Masai (G. c. tippelskirchi), and Southern giraffes (G. c. giraffa).4,5,6 However, their evolutionary history is not yet fully resolved, as previous studies used a simple bifurcating model and did not explore the presence or extent of gene flow between lineages. We therefore inferred a model that incorporates various evolutionary processes to assess the drivers of contemporary giraffe diversity. We analyzed whole-genome sequencing data from 90 wild giraffes from 29 localities across their current distribution. The most basal divergence was dated to 280 kya. Genetic differentiation, FST, among major lineages ranged between 0.28 and 0.62, and we found significant levels of ancient gene flow between them. In particular, several analyses suggested that the Reticulated lineage evolved through admixture, with almost equal contribution from the Northern lineage and an ancestral lineage related to Masai and Southern giraffes. These new results highlight a scenario of strong differentiation despite gene flow, providing further context for the interpretation of giraffe diversity and the process of speciation in general. They also illustrate that conservation measures need to target various lineages and sublineages and that separate management strategies are needed to conserve giraffe diversity effectively. Given local extinctions and recent dramatic declines in many giraffe populations, this improved understanding of giraffe evolutionary history is relevant for conservation interventions, including reintroductions and reinforcements of existing populations.


Asunto(s)
Jirafas , Animales , Jirafas/genética , Rumiantes/genética , Evolución Biológica , Filogenia , Flujo Genético
9.
Nat Commun ; 15(1): 2921, 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-38609362

RESUMEN

The blue wildebeest (Connochaetes taurinus) is a keystone species in savanna ecosystems from southern to eastern Africa, and is well known for its spectacular migrations and locally extreme abundance. In contrast, the black wildebeest (C. gnou) is endemic to southern Africa, barely escaped extinction in the 1900s and is feared to be in danger of genetic swamping from the blue wildebeest. Despite the ecological importance of the wildebeest, there is a lack of understanding of how its unique migratory ecology has affected its gene flow, genetic structure and phylogeography. Here, we analyze whole genomes from 121 blue and 22 black wildebeest across the genus' range. We find discrete genetic structure consistent with the morphologically defined subspecies. Unexpectedly, our analyses reveal no signs of recent interspecific admixture, but rather a late Pleistocene introgression of black wildebeest into the southern blue wildebeest populations. Finally, we find that migratory blue wildebeest populations exhibit a combination of long-range panmixia, higher genetic diversity and lower inbreeding levels compared to neighboring populations whose migration has recently been disrupted. These findings provide crucial insights into the evolutionary history of the wildebeest, and tangible genetic evidence for the negative effects of anthropogenic activities on highly migratory ungulates.


Asunto(s)
Antílopes , Animales , Antílopes/genética , Ecosistema , África Oriental , África Austral , Efectos Antropogénicos
10.
Nat Commun ; 15(1): 172, 2024 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-38172616

RESUMEN

Several African mammals exhibit a phylogeographic pattern where closely related taxa are split between West/Central and East/Southern Africa, but their evolutionary relationships and histories remain controversial. Bushpigs (Potamochoerus larvatus) and red river hogs (P. porcus) are recognised as separate species due to morphological distinctions, a perceived lack of interbreeding at contact, and putatively old divergence times, but historically, they were considered conspecific. Moreover, the presence of Malagasy bushpigs as the sole large terrestrial mammal shared with the African mainland raises intriguing questions about its origin and arrival in Madagascar. Analyses of 67 whole genomes revealed a genetic continuum between the two species, with putative signatures of historical gene flow, variable FST values, and a recent divergence time (<500,000 years). Thus, our study challenges key arguments for splitting Potamochoerus into two species and suggests their speciation might be incomplete. Our findings also indicate that Malagasy bushpigs diverged from southern African populations and underwent a limited bottleneck 1000-5000 years ago, concurrent with human arrival in Madagascar. These results shed light on the evolutionary history of an iconic and widespread African mammal and provide insight into the longstanding biogeographic puzzle surrounding the bushpig's presence in Madagascar.


Asunto(s)
Mamíferos , Humanos , Animales , Porcinos , Madagascar , Filogenia , Porosidad , Filogeografía , Mamíferos/genética
11.
Mol Ecol Resour ; 23(7): 1604-1619, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37400991

RESUMEN

The genome of recently admixed individuals or hybrids has characteristic genetic patterns that can be used to learn about their recent admixture history. One of these are patterns of interancestry heterozygosity, which can be inferred from SNP data from either called genotypes or genotype likelihoods, without the need for information on genomic location. This makes them applicable to a wide range of data that are often used in evolutionary and conservation genomic studies, such as low-depth sequencing mapped to scaffolds and reduced representation sequencing. Here we implement maximum likelihood estimation of interancestry heterozygosity patterns using two complementary models. We furthermore develop apoh (Admixture Pedigrees of Hybrids), a software that uses estimates of paired ancestry proportions to detect recently admixed individuals or hybrids, and to suggest possible admixture pedigrees. It furthermore calculates several hybrid indices that make it easier to identify and rank possible admixture pedigrees that could give rise to the estimated patterns. We implemented apoh both as a command line tool and as a Graphical User Interface that allows the user to automatically and interactively explore, rank and visualize compatible recent admixture pedigrees, and calculate the different summary indices. We validate the performance of the method using admixed family trios from the 1000 Genomes Project. In addition, we show its applicability on identifying recent hybrids from RAD-seq data of Grant's gazelle (Nanger granti and Nanger petersii) and whole genome low-depth data of waterbuck (Kobus ellipsiprymnus) which shows complex admixture of up to four populations.


Asunto(s)
Genética de Población , Genoma , Humanos , Linaje , Genoma/genética , Genotipo , Programas Informáticos
12.
Genetics ; 225(2)2023 Oct 04.
Artículo en Inglés | MEDLINE | ID: mdl-37611212

RESUMEN

Principal component analysis (PCA) is commonly used in genetics to infer and visualize population structure and admixture between populations. PCA is often interpreted in a way similar to inferred admixture proportions, where it is assumed that individuals belong to one of several possible populations or are admixed between these populations. We propose a new method to assess the statistical fit of PCA (interpreted as a model spanned by the top principal components) and to show that violations of the PCA assumptions affect the fit. Our method uses the chosen top principal components to predict the genotypes. By assessing the covariance (and the correlation) of the residuals (the differences between observed and predicted genotypes), we are able to detect violation of the model assumptions. Based on simulations and genome-wide human data, we show that our assessment of fit can be used to guide the interpretation of the data and to pinpoint individuals that are not well represented by the chosen principal components. Our method works equally on other similar models, such as the admixture model, where the mean of the data is represented by linear matrix decomposition.

13.
Mol Ecol Resour ; 22(2): 458-467, 2022 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-34431216

RESUMEN

Being able to assign sex to individuals and identify autosomal and sex-linked scaffolds are essential in most population genomic analyses. Non-model organisms often have genome assemblies at scaffold-level and lack characterization of sex-linked scaffolds. Previous methods to identify sex and sex-linked scaffolds have relied on synteny between the non-model organism and a closely related species or prior knowledge about the sex of the samples to identify sex-linked scaffolds. In the latter case, the difference in depth of coverage between the autosomes and the sex chromosomes are used. Here, we present "sex assignment through coverage" (SATC), a method to assign sex to samples and identify sex-linked scaffolds from next generation sequencing (NGS) data. The method works for species with a homogametic/heterogametic sex determination system and only requires a scaffold-level reference assembly and sampling of both sexes with whole genome sequencing (WGS) data. We use the sequencing depth distribution across scaffolds to jointly identify: (i) male and female individuals, and (ii) sex-linked scaffolds. This is achieved through projecting the scaffold depths into a low-dimensional space using principal component analysis (PCA) and subsequent Gaussian mixture clustering. We demonstrate the applicability of our method using data from five mammal species and a bird species complex. The method is freely available at https://github.com/popgenDK/SATC as R code and a graphical user interface (GUI).


Asunto(s)
Genoma , Genómica , Cromosomas Sexuales , Análisis para Determinación del Sexo , Animales , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Masculino , Cromosomas Sexuales/genética , Sintenía
14.
Genetics ; 222(4)2022 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-36173322

RESUMEN

The site frequency spectrum is an important summary statistic in population genetics used for inference on demographic history and selection. However, estimation of the site frequency spectrum from called genotypes introduces bias when working with low-coverage sequencing data. Methods exist for addressing this issue but sometimes suffer from 2 problems. First, they can have very high computational demands, to the point that it may not be possible to run estimation for genome-scale data. Second, existing methods are prone to overfitting, especially for multidimensional site frequency spectrum estimation. In this article, we present a stochastic expectation-maximization algorithm for inferring the site frequency spectrum from NGS data that address these challenges. We show that this algorithm greatly reduces runtime and enables estimation with constant, trivial RAM usage. Furthermore, the algorithm reduces overfitting and thereby improves downstream inference. An implementation is available at github.com/malthesr/winsfs.


Asunto(s)
Algoritmos , Genética de Población , Genotipo , Genoma , Sesgo , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
15.
G3 (Bethesda) ; 11(8)2021 08 07.
Artículo en Inglés | MEDLINE | ID: mdl-34015083

RESUMEN

Estimation of relatedness between pairs of individuals is important in many genetic research areas. When estimating relatedness, it is important to account for admixture if this is present. However, the methods that can account for admixture are all based on genotype data as input, which is a problem for low-depth next-generation sequencing (NGS) data from which genotypes are called with high uncertainty. Here, we present a software tool, NGSremix, for maximum likelihood estimation of relatedness between pairs of admixed individuals from low-depth NGS data, which takes the uncertainty of the genotypes into account via genotype likelihoods. Using both simulated and real NGS data for admixed individuals with an average depth of 4x or below we show that our method works well and clearly outperforms all the commonly used state-of-the-art relatedness estimation methods PLINK, KING, relateAdmix, and ngsRelate that all perform quite poorly. Hence, NGSremix is a useful new tool for estimating relatedness in admixed populations from low-depth NGS data. NGSremix is implemented in C/C++ in a multi-threaded software and is freely available on Github https://github.com/KHanghoj/NGSremix.


Asunto(s)
Genética de Población , Secuenciación de Nucleótidos de Alto Rendimiento , Genotipo , Humanos , Polimorfismo de Nucleótido Simple , Probabilidad , Programas Informáticos
16.
Mol Ecol Resour ; 21(4): 1085-1097, 2021 May.
Artículo en Inglés | MEDLINE | ID: mdl-33434329

RESUMEN

Genotyping-by-sequencing methods such as RADseq are popular for generating genomic and population-scale data sets from a diverse range of organisms. These often lack a usable reference genome, restricting users to RADseq specific software for processing. However, these come with limitations compared to generic next generation sequencing (NGS) toolkits. Here, we describe and test a simple pipeline for reference-free RADseq data processing that blends de novo elements from STACKS with the full suite of state-of-the art NGS tools. Specifically, we use the de novo RADseq assembly employed by STACKS to create a catalogue of RAD loci that serves as a reference for read mapping, variant calling and site filters. Using RADseq data from 28 zebra sequenced to ~8x depth-of-coverage we evaluate our approach by comparing the site frequency spectra (SFS) to those from alternative pipelines. Most pipelines yielded similar SFS at 8x depth, but only a genotype likelihood based pipeline performed similarly at low sequencing depth (2-4x). We compared the RADseq SFS with medium-depth (~13x) shotgun sequencing of eight overlapping samples, revealing that the RADseq SFS was persistently slightly skewed towards rare and invariant alleles. Using simulations and human data we confirm that this is expected when there is allelic dropout (AD) in the RADseq data. AD in the RADseq data caused a heterozygosity deficit of ~16%, which dropped to ~5% after filtering AD. Hence, AD was the most important source of bias in our RADseq data.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Programas Informáticos , Animales , Equidae/genética , Genómica , Humanos , Funciones de Verosimilitud , Pérdida de Heterocigocidad , Polimorfismo de Nucleótido Simple
17.
Curr Biol ; 31(9): 1862-1871.e5, 2021 05 10.
Artículo en Inglés | MEDLINE | ID: mdl-33636121

RESUMEN

Large carnivores are generally sensitive to ecosystem changes because their specialized diet and position at the top of the trophic pyramid is associated with small population sizes. Accordingly, low genetic diversity at the whole-genome level has been reported for all big cat species, including the widely distributed leopard. However, all previous whole-genome analyses of leopards are based on the Far Eastern Amur leopards that live at the extremity of the species' distribution and therefore are not necessarily representative of the whole species. We sequenced 53 whole genomes of African leopards. Strikingly, we found that the genomic diversity in the African leopard is 2- to 5-fold higher than in other big cats, including the Amur leopard, likely because of an exceptionally high effective population size maintained by the African leopard throughout the Pleistocene. Furthermore, we detected ongoing gene flow and very low population differentiation within African leopards compared with those of other big cats. We corroborated this by showing a complete absence of an otherwise ubiquitous equatorial forest barrier to gene flow. This sets the leopard apart from most other widely distributed large African mammals, including lions. These results revise our understanding of trophic sensitivity and highlight the remarkable resilience of the African leopard, likely because of its extraordinary habitat versatility and broad dietary niche.


Asunto(s)
Ecosistema , Variación Genética , Panthera/anatomía & histología , Panthera/genética , África , Animales , Femenino , Flujo Génico , Masculino , Panthera/clasificación , Densidad de Población
18.
Mol Ecol Resour ; 20(4): 936-949, 2020 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-32323416

RESUMEN

Model based methods for genetic clustering of individuals, such as those implemented in structure or ADMIXTURE, allow the user to infer individual ancestries and study population structure. The underlying model makes several assumptions about the demographic history that shaped the analysed genetic data. One assumption is that all individuals are a result of K homogeneous ancestral populations that are all well represented in the data, while another assumption is that no drift happened after the admixture event. The histories of many real world populations do not conform to that model, and in that case taking the inferred admixture proportions at face value might be misleading. We propose a method to evaluate the fit of admixture models based on estimating the correlation of the residual difference between the true genotypes and the genotypes predicted by the model. When the model assumptions are not violated, the residuals from a pair of individuals are not correlated. In the case of a bad fitting admixture model, individuals with similar demographic histories have a positive correlation of their residuals. Using simulated and real data, we show how the method is able to detect a bad fit of inferred admixture proportions due to using an insufficient number of clusters K or to demographic histories that deviate significantly from the admixture model assumptions, such as admixture from ghost populations, drift after admixture events and nondiscrete ancestral populations. We have implemented the method as an open source software that can be applied to both unphased genotypes and low depth sequencing data.


Asunto(s)
Pruebas Genéticas/métodos , Modelos Genéticos , Simulación por Computador , Genética de Población/métodos , Genotipo , Humanos , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA